Motivation: Hi-C matrices are cornerstones for qualitative and quantitative studies of genome folding, from its territorial organization to compartments and topological domains. The high dynamic range of genomic distances probed in Hi-C assays reflects in an inherent stochastic background of the interactions matrices, which inevitably convolve the features of interest with largely non-specific ones. Results: Here we introduce and discuss essHi-C, a method to isolate the specific, or essential component of Hi-C matrices from the non-specific portion of the spectrum that is compatible with random matrices. Systematic comparisons show that essHi-C improves the clarity of the interaction patterns, enhances the robustness against sequencing depth of topologically associating domains identification, allows the unsupervised clustering of experiments in different cell lines and recovers the cell-cycle phasing of single-cells based on Hi-C data. Thus, essHi-C provides means for isolating significant biological and physical features from Hi-C matrices. Availability: The essHi-C software package is available at: https://github.com/stefanofranzini/essHIC . Supplementary information: Supplementary data are available at Bioinformatics online.

essHi-C: Essential component analysis of Hi-C matrices / Franzini, Stefano; Di Stefano, Marco; Micheletti, Cristian. - In: BIOINFORMATICS. - ISSN 1367-4803. - 37:15(2021), pp. 2088-2094. [10.1093/bioinformatics/btab062]

essHi-C: Essential component analysis of Hi-C matrices

Franzini, Stefano;Micheletti, Cristian
2021-01-01

Abstract

Motivation: Hi-C matrices are cornerstones for qualitative and quantitative studies of genome folding, from its territorial organization to compartments and topological domains. The high dynamic range of genomic distances probed in Hi-C assays reflects in an inherent stochastic background of the interactions matrices, which inevitably convolve the features of interest with largely non-specific ones. Results: Here we introduce and discuss essHi-C, a method to isolate the specific, or essential component of Hi-C matrices from the non-specific portion of the spectrum that is compatible with random matrices. Systematic comparisons show that essHi-C improves the clarity of the interaction patterns, enhances the robustness against sequencing depth of topologically associating domains identification, allows the unsupervised clustering of experiments in different cell lines and recovers the cell-cycle phasing of single-cells based on Hi-C data. Thus, essHi-C provides means for isolating significant biological and physical features from Hi-C matrices. Availability: The essHi-C software package is available at: https://github.com/stefanofranzini/essHIC . Supplementary information: Supplementary data are available at Bioinformatics online.
2021
37
15
2088
2094
10.1093/bioinformatics/btab062
https://academic.oup.com/bioinformatics/article/37/15/2088/6125382?login=true
https://arxiv.org/abs/2101.10645
Franzini, Stefano; Di Stefano, Marco; Micheletti, Cristian
File in questo prodotto:
File Dimensione Formato  
btab062.pdf

non disponibili

Descrizione: pdf editoriale
Tipologia: Versione Editoriale (PDF)
Licenza: Non specificato
Dimensione 597.2 kB
Formato Adobe PDF
597.2 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11767/126391
Citazioni
  • ???jsp.display-item.citation.pmc??? 0
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact