28–30 Oct 2024
Porto
Europe/Lisbon timezone

DataLake Applications: Lattice QCD

29 Oct 2024, 16:30
20m
Auditório (Centro de Investigação Médica (CIM-FMUP))

Auditório

Centro de Investigação Médica (CIM-FMUP)

Presentation (15' + 5' for questions) Development, implementation and operation of Data Lakes IBERGRID

Speaker

Mr Gaurav Ray (IFCA-CSIC)

Description

In the current era of Big Data data management practices are an increasingly important consideration when doing scientific research. The Scientific community's aspiration for FAIR data depends on good data management practices and policies, and interTwin's DataLake has been designed with these goals in mind. I will present the status of an application of the DataLake to the particular field of Lattice QCD simulations within physics. I will discuss how the DataLake is being co-developed with the lattice use case, focusing on how the lattice community's data requirements have influenced its development. I will talk about the integration of the DataLake with the external ILDG metadata and file catalogues. I will end by discussing the potential ways in which the DataLake concept could change and improve how lattice collaborations do their research in future.

Primary author

Mr Gaurav Ray (IFCA-CSIC)

Presentation materials