This thesis presents a case study on the FAIRification of scientific data within the context of the NFFA-DI initiative, which promotes open science and data standardization in materials science and nanotechnology. The goal was to enhance the quality, usability, and long-term accessibility of legacy laboratory data collected at the HMMBE Laboratory, a molecular beam epitaxy synthesis facility. To achieve this, a complete workflow was developed to convert fragmented, unstructured text files into structured, interoperable datasets using the NeXus format, a hierarchical standard based on HDF5. A new application definition and two supporting base classes were created to model the MBE deposition process. Parsing functions were implemented to extract, transform, and enrich data from heterogeneous sources, resulting in fully structured NeXus files. Consequently, the converted files were validated both in the structure and content. The transformed data were then uploaded in a local NOMAD repository. A custom plugin and dedicated NOMAD app were developed to enable proper ingestion, indexing, and visualization of the new data format. The outcome demonstrates how isolated experimental records can be converted into reusable and interoperable scientific assets, contributing to the advancement of FAIR data practices and supporting broader goals in open, collaborative research.

Data Curation for Optimizing Molecular Beam Epitaxial Growth of III-V Semiconductor Samples / Musini, Leonardo. - (2025 May 27).

Data Curation for Optimizing Molecular Beam Epitaxial Growth of III-V Semiconductor Samples

MUSINI, LEONARDO
2025-05-27

Abstract

This thesis presents a case study on the FAIRification of scientific data within the context of the NFFA-DI initiative, which promotes open science and data standardization in materials science and nanotechnology. The goal was to enhance the quality, usability, and long-term accessibility of legacy laboratory data collected at the HMMBE Laboratory, a molecular beam epitaxy synthesis facility. To achieve this, a complete workflow was developed to convert fragmented, unstructured text files into structured, interoperable datasets using the NeXus format, a hierarchical standard based on HDF5. A new application definition and two supporting base classes were created to model the MBE deposition process. Parsing functions were implemented to extract, transform, and enrich data from heterogeneous sources, resulting in fully structured NeXus files. Consequently, the converted files were validated both in the structure and content. The transformed data were then uploaded in a local NOMAD repository. A custom plugin and dedicated NOMAD app were developed to enable proper ingestion, indexing, and visualization of the new data format. The outcome demonstrates how isolated experimental records can be converted into reusable and interoperable scientific assets, contributing to the advancement of FAIR data practices and supporting broader goals in open, collaborative research.
27-mag-2025
Curcio, Davide
Data Curation for Optimizing Molecular Beam Epitaxial Growth of III-V Semiconductor Samples / Musini, Leonardo. - (2025 May 27).
File in questo prodotto:
File Dimensione Formato  
Musini_Leonardo_Thesis_MDMC2024-2025.pdf

accesso aperto

Tipologia: Tesi
Licenza: Non specificato
Dimensione 1.82 MB
Formato Adobe PDF
1.82 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11767/148550
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact