Machine Learning for Physics

Name: Machine Learning for Physics
Start: 2025-03-13T08:50:00+00:00
End: 2025-03-14T19:00:00+00:00
Location: LIP Lisboa

13–14 Mar 2025

LIP Lisboa

Europe/Lisbon timezone

Contact

lisbon-ml-workshop@cern.ch

Contribution List

11. Introduction to the course and welcome

Michele Gallinaro (LIP)

13/03/2025, 08:55

1. Introduction to Machine Learning for Physics (lecture)

Prof. Pietro Vischia (Universidad de Oviedo and Instituto de Ciencias y Tecnologías Espaciales de Asturias (ICTEA))

13/03/2025, 09:00

This lecture will introduce the broad concept of Machine Learning, its connection to Artificial Intelligence, and will broadly review the use of ML in High Energy Physics.

2. Automatic Differentiation and Supervised Learning (lecture)

Prof. Pietro Vischia (Universidad de Oviedo and Instituto de Ciencias y Tecnologías Espaciales de Asturias (ICTEA))

13/03/2025, 10:30

This lecture will focus on supervised learning, a setting where the training data set is “labelled”, that is the target quantity of learning is known. Automatic Differentiation, the technique that powers up modern machine learning frameworks will then be explained in detail, together with its connection to differentiable programming.

4. Exercise 1: Network structure and inductive bias in Higgs physics (ttH)

Prof. Pietro Vischia (Universidad de Oviedo and Instituto de Ciencias y Tecnologías Espaciales de Asturias (ICTEA))

13/03/2025, 13:30

Inductive bias refers to the process of encoding into the learning process some properties of the data known a priori: this can happen by manipulating the training data (augmentation), by modifying the structure of the algorithm (e.g. dense vs convolutional networks), or by modifying the learning target (loss function). The exercise will consist in comparing the performance of generic...

3. Exercise 2: Classification and anomaly detection in S-top searches

Cristóvão da Cruz e Silva (LIP)

13/03/2025, 16:00

Classification is a category of supervised learning where the goal is to classify the data into different categories. For the CMS search of the supersymmetric partner of the top quark in the compressed mass scenario a Boosted Decision Tree (BDT) algorithm was used to distinguish between signal-like and background-like events. In this exercise, a neural network will be implemented to achieve...

5. Into the belly of Transformers: mathematical formalism and inner workings (lecture)

Prof. Pietro Vischia (Universidad de Oviedo and Instituto de Ciencias y Tecnologías Espaciales de Asturias (ICTEA))

14/03/2025, 09:00

Transformers are an architecture that powers up most Large Language Models in the market nowadays. This lecture will explain the inner structure of a transformer.

6. Exercise 3: Flavour tagging with Transformers

Inês Ochoa (LIP)

14/03/2025, 09:30

Flavour tagging allows us to identify jets that originate from b- and c-quarks, and is a crucial tool for the physics programme of LHC experiments. The jet flavour can be predicted based on the characteristics of the charged particle tracks associated with it. This set of variable number and unordered tracks lends itself to a graph representation, which can be exploited by transformers. In...

8. Unsupervised learning (lecture)

Prof. Pietro Vischia (Universidad de Oviedo and Instituto de Ciencias y Tecnologías Espaciales de Asturias (ICTEA))

14/03/2025, 11:00

When the data set is unlabelled, that is when the target quantity for learning is not known, traditional supervised learning techniques cannot be used. This lecture will explain the corresponding techniques to obtain learning algorithms without an explicitly known target, such as reinforcement learning.

9. Exercise 4: probing the substructure of boosted jets with unsupervised learning

Inês Ochoa (LIP)

14/03/2025, 13:30

10. Data challenge!!!

Cristóvão da Cruz e Silva (LIP), Inês Ochoa (LIP), Prof. Pietro Vischia (Universidad de Oviedo and Instituto de Ciencias y Tecnologías Espaciales de Asturias (ICTEA))

14/03/2025, 16:00

The data challenge will consist in solving a machine learning problem on a given data set.

The participants will be provided access to the data set, and skeleton code to set up the study.

Participants will have to submit a series of predictions for an evaluation data set, as well as the code and an explanation of the logic behind it.

The models faring the best in the evaluation...

7. Wrap-up, Awards, and Group Photo

14/03/2025, 18:00

Choose timezone

Machine Learning for Physics

Contact