In committee of experts strategies, small datasets are extracted from a larger one and utilised for the training of multiple models. These models' predictions are then carefully weighted so as to obtain estimates which are dominated by the model(s) that are most informed in each domain of the data manifold. Here, we show how this divide-and-conquer philosophy provides an avenue in the making of machine learning potentials for atomistic systems, which is general across systems of different natures and efficiently scalable by construction. We benchmark this approach on various datasets and demonstrate that divide-and-conquer linear potentials are more accurate than their single model counterparts, while incurring little to no extra computational cost.A divide-and-conquer strategy - where small datasets are extracted from a larger one and utilised to train multiple models, which are then carefully combined for prediction - provides an avenue for accurate machine learning potentials.
Divide-and-conquer potentials enable scalable and accurate predictions of forces and energies in atomistic systems / Zeni, Claudio; Anelli, Andrea; Glielmo, Aldo; de Gironcoli, Stefano; Rossi, Kevin. - In: DIGITAL DISCOVERY. - ISSN 2635-098X. - 3:1(2024), pp. 113-121. [10.1039/d3dd00155e]
Divide-and-conquer potentials enable scalable and accurate predictions of forces and energies in atomistic systems
Zeni, Claudio;Anelli, Andrea;Glielmo, Aldo;de Gironcoli, Stefano;Rossi, Kevin
2024-01-01
Abstract
In committee of experts strategies, small datasets are extracted from a larger one and utilised for the training of multiple models. These models' predictions are then carefully weighted so as to obtain estimates which are dominated by the model(s) that are most informed in each domain of the data manifold. Here, we show how this divide-and-conquer philosophy provides an avenue in the making of machine learning potentials for atomistic systems, which is general across systems of different natures and efficiently scalable by construction. We benchmark this approach on various datasets and demonstrate that divide-and-conquer linear potentials are more accurate than their single model counterparts, while incurring little to no extra computational cost.A divide-and-conquer strategy - where small datasets are extracted from a larger one and utilised to train multiple models, which are then carefully combined for prediction - provides an avenue for accurate machine learning potentials.File | Dimensione | Formato | |
---|---|---|---|
d3dd00155e.pdf
accesso aperto
Tipologia:
Versione Editoriale (PDF)
Licenza:
Creative commons
Dimensione
1.79 MB
Formato
Adobe PDF
|
1.79 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.