Understanding which parts of a dynamical system cause each other is extremely relevant in fundamental and applied sciences. However, inferring causal links from observational data, namely, without direct manipulations of the system, is still computationally challenging, especially if the data are high dimensional. In this Letter we introduce a framework for constructing causal graphs from high-dimensional time series, whose computational cost scales linearly with the number of variables. The approach is based on the automatic identification of dynamical communities, groups of variables which mutually influence each other and can therefore be described as a single node in a causal graph. These communities are efficiently identified by optimizing the information imbalance, a statistical quantity that assigns a weight to each putative causal variable based on its information content relative to a target variable. The communities are then ordered starting from the fully autonomous ones, whose evolution is independent from all the others, to those that are progressively dependent on other communities, building in this manner a community causal graph. We demonstrate the computational efficiency and the accuracy of our approach on discrete-time and continuous-time dynamical systems including up to 80 variables.

Linear Scaling Causal Discovery from High-Dimensional Time Series by Dynamical Community Detection / Allione, Matteo; Del Tatto, Vittorio; Laio, Alessandro. - In: PHYSICAL REVIEW LETTERS. - ISSN 0031-9007. - 135:4(2025). [10.1103/kd73-93cg]

Linear Scaling Causal Discovery from High-Dimensional Time Series by Dynamical Community Detection

Allione, Matteo;Del Tatto, Vittorio;Laio, Alessandro
2025-01-01

Abstract

Understanding which parts of a dynamical system cause each other is extremely relevant in fundamental and applied sciences. However, inferring causal links from observational data, namely, without direct manipulations of the system, is still computationally challenging, especially if the data are high dimensional. In this Letter we introduce a framework for constructing causal graphs from high-dimensional time series, whose computational cost scales linearly with the number of variables. The approach is based on the automatic identification of dynamical communities, groups of variables which mutually influence each other and can therefore be described as a single node in a causal graph. These communities are efficiently identified by optimizing the information imbalance, a statistical quantity that assigns a weight to each putative causal variable based on its information content relative to a target variable. The communities are then ordered starting from the fully autonomous ones, whose evolution is independent from all the others, to those that are progressively dependent on other communities, building in this manner a community causal graph. We demonstrate the computational efficiency and the accuracy of our approach on discrete-time and continuous-time dynamical systems including up to 80 variables.
2025
135
4
047401
https://arxiv.org/abs/2501.10886
Allione, Matteo; Del Tatto, Vittorio; Laio, Alessandro
File in questo prodotto:
File Dimensione Formato  
kd73-93cg.pdf

non disponibili

Descrizione: pdf editoriale
Tipologia: Versione Editoriale (PDF)
Licenza: Non specificato
Dimensione 512.96 kB
Formato Adobe PDF
512.96 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11767/147172
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact