Machine Learning for Physics
from
Thursday 13 March 2025 (08:50)
to
Friday 14 March 2025 (19:00)
Monday 10 March 2025
¶
Tuesday 11 March 2025
¶
Wednesday 12 March 2025
¶
Thursday 13 March 2025
¶
08:55
Introduction to the course and welcome
-
Michele Gallinaro
(
LIP
)
Introduction to the course and welcome
Michele Gallinaro
(
LIP
)
08:55 - 09:00
Room: LIP-Lisboa/3-311 - Sala de Seminários
09:00
Introduction to Machine Learning for Physics (lecture)
-
Pietro Vischia
(
Universidad de Oviedo and Instituto de Ciencias y Tecnologías Espaciales de Asturias (ICTEA)
)
Introduction to Machine Learning for Physics (lecture)
Pietro Vischia
(
Universidad de Oviedo and Instituto de Ciencias y Tecnologías Espaciales de Asturias (ICTEA)
)
09:00 - 10:00
Room: LIP-Lisboa/3-311 - Sala de Seminários
This lecture will introduce the broad concept of Machine Learning, its connection to Artificial Intelligence, and will broadly review the use of ML in High Energy Physics.
10:00
Coffee Break
Coffee Break
10:00 - 10:30
Room: LIP-Lisboa/3-311 - Sala de Seminários
10:30
Automatic Differentiation and Supervised Learning (lecture)
-
Pietro Vischia
(
Universidad de Oviedo and Instituto de Ciencias y Tecnologías Espaciales de Asturias (ICTEA)
)
Automatic Differentiation and Supervised Learning (lecture)
Pietro Vischia
(
Universidad de Oviedo and Instituto de Ciencias y Tecnologías Espaciales de Asturias (ICTEA)
)
10:30 - 12:00
Room: LIP-Lisboa/3-311 - Sala de Seminários
This lecture will focus on supervised learning, a setting where the training data set is “labelled”, that is the target quantity of learning is known. Automatic Differentiation, the technique that powers up modern machine learning frameworks will then be explained in detail, together with its connection to differentiable programming.
12:00
Lunch break (lunch not provided)
Lunch break (lunch not provided)
12:00 - 13:30
13:30
Exercise 1: Network structure and inductive bias in Higgs physics (ttH)
-
Pietro Vischia
(
Universidad de Oviedo and Instituto de Ciencias y Tecnologías Espaciales de Asturias (ICTEA)
)
Exercise 1: Network structure and inductive bias in Higgs physics (ttH)
Pietro Vischia
(
Universidad de Oviedo and Instituto de Ciencias y Tecnologías Espaciales de Asturias (ICTEA)
)
13:30 - 15:30
Room: LIP-Lisboa/3-311 - Sala de Seminários
Inductive bias refers to the process of encoding into the learning process some properties of the data known a priori: this can happen by manipulating the training data (augmentation), by modifying the structure of the algorithm (e.g. dense vs convolutional networks), or by modifying the learning target (loss function). The exercise will consist in comparing the performance of generic algorithms with that of algorithms targeted to specific structures (e.g. convolutional networks).
15:30
Coffee Break
Coffee Break
15:30 - 16:00
Room: LIP-Lisboa/3-311 - Sala de Seminários
16:00
Exercise 2: Classification and anomaly detection in S-top searches
-
Cristóvão da Cruz e Silva
(
LIP
)
Exercise 2: Classification and anomaly detection in S-top searches
Cristóvão da Cruz e Silva
(
LIP
)
16:00 - 18:00
Room: LIP-Lisboa/3-311 - Sala de Seminários
Classification is a category of supervised learning where the goal is to classify the data into different categories. For the CMS search of the supersymmetric partner of the top quark in the compressed mass scenario a Boosted Decision Tree (BDT) algorithm was used to distinguish between signal-like and background-like events. In this exercise, a neural network will be implemented to achieve this task and performance will be compared with the BDT approach. If time allows, a further attempt will be made with an autoencoder neural network, where only background simulated events are used for training, the performance of this approach will also be compared to the previous two approaches. For comparison of the approaches a simplified limit setting via the pyhf library will be used.
Friday 14 March 2025
¶
09:00
Into the belly of Transformers: mathematical formalism and inner workings (lecture)
-
Pietro Vischia
(
Universidad de Oviedo and Instituto de Ciencias y Tecnologías Espaciales de Asturias (ICTEA)
)
Into the belly of Transformers: mathematical formalism and inner workings (lecture)
Pietro Vischia
(
Universidad de Oviedo and Instituto de Ciencias y Tecnologías Espaciales de Asturias (ICTEA)
)
09:00 - 09:30
Room: LIP-Lisboa/3-311 - Sala de Seminários
Transformers are an architecture that powers up most Large Language Models in the market nowadays. This lecture will explain the inner structure of a transformer.
09:30
Exercise 3: Flavour tagging with Transformers
-
Inês Ochoa
(
LIP
)
Exercise 3: Flavour tagging with Transformers
Inês Ochoa
(
LIP
)
09:30 - 10:30
Room: LIP-Lisboa/3-311 - Sala de Seminários
Flavour tagging allows us to identify jets that originate from b- and c-quarks, and is a crucial tool for the physics programme of LHC experiments. The jet flavour can be predicted based on the characteristics of the charged particle tracks associated with it. This set of variable number and unordered tracks lends itself to a graph representation, which can be exploited by transformers. In this exercise, we train and evaluate a transformer for identifying b-jets.
10:30
Coffee Break
Coffee Break
10:30 - 11:00
Room: LIP-Lisboa/3-311 - Sala de Seminários
11:00
Unsupervised learning (lecture)
-
Pietro Vischia
(
Universidad de Oviedo and Instituto de Ciencias y Tecnologías Espaciales de Asturias (ICTEA)
)
Unsupervised learning (lecture)
Pietro Vischia
(
Universidad de Oviedo and Instituto de Ciencias y Tecnologías Espaciales de Asturias (ICTEA)
)
11:00 - 12:00
Room: LIP-Lisboa/3-311 - Sala de Seminários
When the data set is unlabelled, that is when the target quantity for learning is not known, traditional supervised learning techniques cannot be used. This lecture will explain the corresponding techniques to obtain learning algorithms without an explicitly known target, such as reinforcement learning.
12:00
Lunch break (lunch not provided)
Lunch break (lunch not provided)
12:00 - 13:30
13:30
Exercise 4: probing the substructure of boosted jets with unsupervised learning
-
Inês Ochoa
(
LIP
)
Exercise 4: probing the substructure of boosted jets with unsupervised learning
Inês Ochoa
(
LIP
)
13:30 - 15:30
Room: LIP-Lisboa/3-311 - Sala de Seminários
15:30
Coffee Break
Coffee Break
15:30 - 16:00
Room: LIP-Lisboa/3-311 - Sala de Seminários
16:00
Data challenge!!!
-
Cristóvão da Cruz e Silva
(
LIP
)
Inês Ochoa
(
LIP
)
Pietro Vischia
(
Universidad de Oviedo and Instituto de Ciencias y Tecnologías Espaciales de Asturias (ICTEA)
)
Data challenge!!!
Cristóvão da Cruz e Silva
(
LIP
)
Inês Ochoa
(
LIP
)
Pietro Vischia
(
Universidad de Oviedo and Instituto de Ciencias y Tecnologías Espaciales de Asturias (ICTEA)
)
16:00 - 17:40
Room: LIP-Lisboa/3-311 - Sala de Seminários
The data challenge will consist in solving a machine learning problem on a given data set. The participants will be provided access to the data set, and skeleton code to set up the study. Participants will have to submit a series of predictions for an evaluation data set, as well as the code and an explanation of the logic behind it. The models faring the best in the evaluation dataset will be declared winner: the first three models will be presented by their developers (1st: 5 minutes, 2nd and 3rd: 2 minutes) during the awards ceremony. Certificates for the first three classified will be provided.
17:40
Break (evaluation of challenge outcomes)
Break (evaluation of challenge outcomes)
17:40 - 18:00
Room: LIP-Lisboa/3-311 - Sala de Seminários
18:00
Wrap-up, Awards, and Group Photo
Wrap-up, Awards, and Group Photo
18:00 - 18:30
Room: LIP-Lisboa/3-311 - Sala de Seminários