Understanding which parts of a dynamical system cause each other is extremely relevant in fundamental and applied sciences. However, inferring causal links from observational data, namely, without direct manipulations of the system, is still computationally challenging, especially if the data are high dimensional. In this Letter we introduce a framework for constructing causal graphs from high-dimensional time series, whose computational cost scales linearly with the number of variables. The approach is based on the automatic identification of dynamical communities, groups of variables which mutually influence each other and can therefore be described as a single node in a causal graph. These communities are efficiently identified by optimizing the information imbalance, a statistical quantity that assigns a weight to each putative causal variable based on its information content relative to a target variable. The communities are then ordered starting from the fully autonomous ones, whose evolution is independent from all the others, to those that are progressively dependent on other communities, building in this manner a community causal graph. We demonstrate the computational efficiency and the accuracy of our approach on discrete-time and continuous-time dynamical systems including up to 80 variables.
Linear Scaling Causal Discovery from High-Dimensional Time Series by Dynamical Community Detection / Allione, Matteo; Del Tatto, Vittorio; Laio, Alessandro. - In: PHYSICAL REVIEW LETTERS. - ISSN 0031-9007. - 135:4(2025). [10.1103/kd73-93cg]
Linear Scaling Causal Discovery from High-Dimensional Time Series by Dynamical Community Detection
Allione, Matteo;Del Tatto, Vittorio;Laio, Alessandro
2025-01-01
Abstract
Understanding which parts of a dynamical system cause each other is extremely relevant in fundamental and applied sciences. However, inferring causal links from observational data, namely, without direct manipulations of the system, is still computationally challenging, especially if the data are high dimensional. In this Letter we introduce a framework for constructing causal graphs from high-dimensional time series, whose computational cost scales linearly with the number of variables. The approach is based on the automatic identification of dynamical communities, groups of variables which mutually influence each other and can therefore be described as a single node in a causal graph. These communities are efficiently identified by optimizing the information imbalance, a statistical quantity that assigns a weight to each putative causal variable based on its information content relative to a target variable. The communities are then ordered starting from the fully autonomous ones, whose evolution is independent from all the others, to those that are progressively dependent on other communities, building in this manner a community causal graph. We demonstrate the computational efficiency and the accuracy of our approach on discrete-time and continuous-time dynamical systems including up to 80 variables.| File | Dimensione | Formato | |
|---|---|---|---|
|
kd73-93cg.pdf
non disponibili
Descrizione: pdf editoriale
Tipologia:
Versione Editoriale (PDF)
Licenza:
Non specificato
Dimensione
512.96 kB
Formato
Adobe PDF
|
512.96 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


