The identification of universal properties from minimally processed data sets is one goal of machine learning techniques applied to statistical physics. Here, we study how the minimum number of variables needed to accurately describe the important features of a data set—the intrinsic dimension (Id)—behaves in the vicinity of phase transitions. We employ state-of-the-art nearest-neighbors-based Id estimators to compute the Id of raw Monte Carlo thermal configurations across different phase transitions: first-order, second-order, and Berezinskii-Kosterlitz-Thouless. For all the considered cases, we find that the Id uniquely characterizes the transition regime. The finite-size analysis of the Id allows us to not only identify critical points with an accuracy comparable to methods that rely on a priori identification of order parameters but also to determine the corresponding (critical) exponent ν in the case of continuous transitions. For the case of topological transitions, this analysis overcomes the reported limitations affecting other unsupervised learning methods. Our work reveals how raw data sets display unique signatures of universal behavior in the absence of any dimensional reduction scheme and suggest direct parallelism between conventional order parameters in real space and the intrinsic dimension in the data space.

Unsupervised Learning Universal Critical Behavior via the Intrinsic Dimension / Mendes-Santos, T.; Turkeshi, X.; Dalmonte, M.; Rodriguez, Alex. - In: PHYSICAL REVIEW. X. - ISSN 2160-3308. - 11:1(2021), pp. 1-17. [10.1103/PhysRevX.11.011040]

Unsupervised Learning Universal Critical Behavior via the Intrinsic Dimension

Turkeshi, X.;Dalmonte, M.;Rodriguez, Alex
2021-01-01

Abstract

The identification of universal properties from minimally processed data sets is one goal of machine learning techniques applied to statistical physics. Here, we study how the minimum number of variables needed to accurately describe the important features of a data set—the intrinsic dimension (Id)—behaves in the vicinity of phase transitions. We employ state-of-the-art nearest-neighbors-based Id estimators to compute the Id of raw Monte Carlo thermal configurations across different phase transitions: first-order, second-order, and Berezinskii-Kosterlitz-Thouless. For all the considered cases, we find that the Id uniquely characterizes the transition regime. The finite-size analysis of the Id allows us to not only identify critical points with an accuracy comparable to methods that rely on a priori identification of order parameters but also to determine the corresponding (critical) exponent ν in the case of continuous transitions. For the case of topological transitions, this analysis overcomes the reported limitations affecting other unsupervised learning methods. Our work reveals how raw data sets display unique signatures of universal behavior in the absence of any dimensional reduction scheme and suggest direct parallelism between conventional order parameters in real space and the intrinsic dimension in the data space.
2021
11
1
1
17
011040
10.1103/PhysRevX.11.011040
https://arxiv.org/abs/2006.12953
Mendes-Santos, T.; Turkeshi, X.; Dalmonte, M.; Rodriguez, Alex
File in questo prodotto:
File Dimensione Formato  
PhysRevX.11.011040.pdf

accesso aperto

Descrizione: pdf editoriale
Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 3.58 MB
Formato Adobe PDF
3.58 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11767/124771
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 37
  • ???jsp.display-item.citation.isi??? 37
social impact