Studying evolutionary correlations in alignments of homologous sequences by means of an inverse Potts model has proven useful to obtain residue-residue contact energies and to predict contacts in proteins. The quality of the results depend much on several choices of the detailed model and on the algorithms used. We built, in a very controlled way, synthetic alignments with statistical properties similar to those of real proteins, and used them to assess the performance of different inversion algorithms and of their variants. Realistic synthetic alignments display typical features of low-temperature phases of disordered systems, a feature that affects the inversion algorithms. We showed that a Boltzmann-learning algorithm is computationally feasible and performs well in predicting the energy of native contacts. However, all algorithms, when applied to alignments of realistic size, suffer of false positives quite equally, making the quality of the prediction of native contacts with the different algorithm much system-dependent.

Statistical mechanical properties of sequence space determine the efficiency of the various algorithms to predict interaction energies and native contacts from protein coevolution / Franco, G.; Cagiada, M.; Bussi, G.; Tiana, G.. - In: PHYSICAL BIOLOGY. - ISSN 1478-3967. - 16:4(2019), pp. 1-14. [10.1088/1478-3975/ab1c15]

Statistical mechanical properties of sequence space determine the efficiency of the various algorithms to predict interaction energies and native contacts from protein coevolution

Bussi G.;
2019-01-01

Abstract

Studying evolutionary correlations in alignments of homologous sequences by means of an inverse Potts model has proven useful to obtain residue-residue contact energies and to predict contacts in proteins. The quality of the results depend much on several choices of the detailed model and on the algorithms used. We built, in a very controlled way, synthetic alignments with statistical properties similar to those of real proteins, and used them to assess the performance of different inversion algorithms and of their variants. Realistic synthetic alignments display typical features of low-temperature phases of disordered systems, a feature that affects the inversion algorithms. We showed that a Boltzmann-learning algorithm is computationally feasible and performs well in predicting the energy of native contacts. However, all algorithms, when applied to alignments of realistic size, suffer of false positives quite equally, making the quality of the prediction of native contacts with the different algorithm much system-dependent.
2019
16
4
1
14
046007
https://arxiv.org/abs/1902.01155
http://hdl.handle.net/2434/647912
https://air.unimi.it/retrieve/handle/2434/647912/1235885/SuppMat.pdf
Franco, G.; Cagiada, M.; Bussi, G.; Tiana, G.
File in questo prodotto:
File Dimensione Formato  
Franco_2019_Phys._Biol._16_046007.pdf

non disponibili

Tipologia: Versione Editoriale (PDF)
Licenza: Non specificato
Dimensione 1.57 MB
Formato Adobe PDF
1.57 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11767/95976
Citazioni
  • ???jsp.display-item.citation.pmc??? 2
  • Scopus 5
  • ???jsp.display-item.citation.isi??? 4
social impact