We introduce a simple model that describes the average occurrence of point variations in a generic protein sequence. This model is based on the idea that mutations are more likely to be fixed at sites in contact with others that have mutated in the recent past. Therefore, we extend the usual assumptions made in protein coevolution by introducing a time dumping on the effect of a substitution on its surrounding and makes correlated substitutions happen in avalanches localized in space and time. The model correctly predicts the average correlation of substitutions as a function of their distance along the sequence. At the same time, it predicts an among-site distribution of the number of substitutions per site highly compatible with a negative binomial, consistently with experimental data. The promising outcomes achieved with this model encourage the application of the same ideas in the field of pairwise and multiple sequence alignment.

A common root for coevolution and substitution rate variability in protein sequence evolution / Rizzato, F.; Zamuner, S.; Pagnani, A.; Laio, A.. - In: SCIENTIFIC REPORTS. - ISSN 2045-2322. - 9:1(2019), pp. 1-10. [10.1038/s41598-019-53958-w]

A common root for coevolution and substitution rate variability in protein sequence evolution

Rizzato F.;Zamuner S.;Pagnani A.;Laio A.
2019-01-01

Abstract

We introduce a simple model that describes the average occurrence of point variations in a generic protein sequence. This model is based on the idea that mutations are more likely to be fixed at sites in contact with others that have mutated in the recent past. Therefore, we extend the usual assumptions made in protein coevolution by introducing a time dumping on the effect of a substitution on its surrounding and makes correlated substitutions happen in avalanches localized in space and time. The model correctly predicts the average correlation of substitutions as a function of their distance along the sequence. At the same time, it predicts an among-site distribution of the number of substitutions per site highly compatible with a negative binomial, consistently with experimental data. The promising outcomes achieved with this model encourage the application of the same ideas in the field of pairwise and multiple sequence alignment.
2019
9
1
1
10
18032
https://www.nature.com/articles/s41598-019-53958-w#rightslink
Rizzato, F.; Zamuner, S.; Pagnani, A.; Laio, A.
File in questo prodotto:
File Dimensione Formato  
s41598-019-53958-w.pdf

accesso aperto

Descrizione: DOAJ Open Access
Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 2.67 MB
Formato Adobe PDF
2.67 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11767/110828
Citazioni
  • ???jsp.display-item.citation.pmc??? 3
  • Scopus 4
  • ???jsp.display-item.citation.isi??? 4
social impact