In committee of experts strategies, small datasets are extracted from a larger one and utilised for the training of multiple models. These models' predictions are then carefully weighted so as to obtain estimates which are dominated by the model(s) that are most informed in each domain of the data manifold. Here, we show how this divide-and-conquer philosophy provides an avenue in the making of machine learning potentials for atomistic systems, which is general across systems of different natures and efficiently scalable by construction. We benchmark this approach on various datasets and demonstrate that divide-and-conquer linear potentials are more accurate than their single model counterparts, while incurring little to no extra computational cost.A divide-and-conquer strategy - where small datasets are extracted from a larger one and utilised to train multiple models, which are then carefully combined for prediction - provides an avenue for accurate machine learning potentials.

Divide-and-conquer potentials enable scalable and accurate predictions of forces and energies in atomistic systems / Zeni, Claudio; Anelli, Andrea; Glielmo, Aldo; de Gironcoli, Stefano; Rossi, Kevin. - In: DIGITAL DISCOVERY. - ISSN 2635-098X. - 3:1(2024), pp. 113-121. [10.1039/d3dd00155e]

Divide-and-conquer potentials enable scalable and accurate predictions of forces and energies in atomistic systems

Zeni, Claudio;Anelli, Andrea;Glielmo, Aldo;de Gironcoli, Stefano;Rossi, Kevin
2024-01-01

Abstract

In committee of experts strategies, small datasets are extracted from a larger one and utilised for the training of multiple models. These models' predictions are then carefully weighted so as to obtain estimates which are dominated by the model(s) that are most informed in each domain of the data manifold. Here, we show how this divide-and-conquer philosophy provides an avenue in the making of machine learning potentials for atomistic systems, which is general across systems of different natures and efficiently scalable by construction. We benchmark this approach on various datasets and demonstrate that divide-and-conquer linear potentials are more accurate than their single model counterparts, while incurring little to no extra computational cost.A divide-and-conquer strategy - where small datasets are extracted from a larger one and utilised to train multiple models, which are then carefully combined for prediction - provides an avenue for accurate machine learning potentials.
2024
3
1
113
121
10.1039/d3dd00155e
Zeni, Claudio; Anelli, Andrea; Glielmo, Aldo; de Gironcoli, Stefano; Rossi, Kevin
File in questo prodotto:
File Dimensione Formato  
d3dd00155e.pdf

accesso aperto

Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 1.79 MB
Formato Adobe PDF
1.79 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11767/137571
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 3
social impact