Speaker
Description
In the current era of Big Data data management practices are an increasingly important consideration when doing scientific research. The Scientific community's aspiration for FAIR data depends on good data management practices and policies, and interTwin's DataLake has been designed with these goals in mind. I will present the status of an application of the DataLake to the particular field of Lattice QCD simulations within physics. I will discuss how the DataLake is being co-developed with the lattice use case, focusing on how the lattice community's data requirements have influenced its development. I will talk about the integration of the DataLake with the external ILDG metadata and file catalogues. I will end by discussing the potential ways in which the DataLake concept could change and improve how lattice collaborations do their research in future.