Motivation: Hi-C matrices are cornerstones for qualitative and quantitative studies of genome folding, from its territorial organization to compartments and topological domains. The high dynamic range of genomic distances probed in Hi-C assays reflects in an inherent stochastic background of the interactions matrices, which inevitably convolve the features of interest with largely non-specific ones. Results: Here we introduce and discuss essHi-C, a method to isolate the specific, or essential component of Hi-C matrices from the non-specific portion of the spectrum that is compatible with random matrices. Systematic comparisons show that essHi-C improves the clarity of the interaction patterns, enhances the robustness against sequencing depth of topologically associating domains identification, allows the unsupervised clustering of experiments in different cell lines and recovers the cell-cycle phasing of single-cells based on Hi-C data. Thus, essHi-C provides means for isolating significant biological and physical features from Hi-C matrices. Availability: The essHi-C software package is available at: https://github.com/stefanofranzini/essHIC . Supplementary information: Supplementary data are available at Bioinformatics online.
essHi-C: Essential component analysis of Hi-C matrices / Franzini, Stefano; Di Stefano, Marco; Micheletti, Cristian. - In: BIOINFORMATICS. - ISSN 1367-4803. - 37:15(2021), pp. 2088-2094. [10.1093/bioinformatics/btab062]
essHi-C: Essential component analysis of Hi-C matrices
Franzini, Stefano;Micheletti, Cristian
2021-01-01
Abstract
Motivation: Hi-C matrices are cornerstones for qualitative and quantitative studies of genome folding, from its territorial organization to compartments and topological domains. The high dynamic range of genomic distances probed in Hi-C assays reflects in an inherent stochastic background of the interactions matrices, which inevitably convolve the features of interest with largely non-specific ones. Results: Here we introduce and discuss essHi-C, a method to isolate the specific, or essential component of Hi-C matrices from the non-specific portion of the spectrum that is compatible with random matrices. Systematic comparisons show that essHi-C improves the clarity of the interaction patterns, enhances the robustness against sequencing depth of topologically associating domains identification, allows the unsupervised clustering of experiments in different cell lines and recovers the cell-cycle phasing of single-cells based on Hi-C data. Thus, essHi-C provides means for isolating significant biological and physical features from Hi-C matrices. Availability: The essHi-C software package is available at: https://github.com/stefanofranzini/essHIC . Supplementary information: Supplementary data are available at Bioinformatics online.File | Dimensione | Formato | |
---|---|---|---|
btab062.pdf
non disponibili
Descrizione: pdf editoriale
Tipologia:
Versione Editoriale (PDF)
Licenza:
Non specificato
Dimensione
597.2 kB
Formato
Adobe PDF
|
597.2 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.