We introduce a simple model that describes the average occurrence of point variations in a generic protein sequence. This model is based on the idea that mutations are more likely to be fixed at sites in contact with others that have mutated in the recent past. Therefore, we extend the usual assumptions made in protein coevolution by introducing a time dumping on the effect of a substitution on its surrounding and makes correlated substitutions happen in avalanches localized in space and time. The model correctly predicts the average correlation of substitutions as a function of their distance along the sequence. At the same time, it predicts an among-site distribution of the number of substitutions per site highly compatible with a negative binomial, consistently with experimental data. The promising outcomes achieved with this model encourage the application of the same ideas in the field of pairwise and multiple sequence alignment.
A common root for coevolution and substitution rate variability in protein sequence evolution / Rizzato, F.; Zamuner, S.; Pagnani, A.; Laio, A.. - In: SCIENTIFIC REPORTS. - ISSN 2045-2322. - 9:1(2019), pp. 1-10. [10.1038/s41598-019-53958-w]
A common root for coevolution and substitution rate variability in protein sequence evolution
Rizzato F.;Zamuner S.;Pagnani A.;Laio A.
2019-01-01
Abstract
We introduce a simple model that describes the average occurrence of point variations in a generic protein sequence. This model is based on the idea that mutations are more likely to be fixed at sites in contact with others that have mutated in the recent past. Therefore, we extend the usual assumptions made in protein coevolution by introducing a time dumping on the effect of a substitution on its surrounding and makes correlated substitutions happen in avalanches localized in space and time. The model correctly predicts the average correlation of substitutions as a function of their distance along the sequence. At the same time, it predicts an among-site distribution of the number of substitutions per site highly compatible with a negative binomial, consistently with experimental data. The promising outcomes achieved with this model encourage the application of the same ideas in the field of pairwise and multiple sequence alignment.File | Dimensione | Formato | |
---|---|---|---|
s41598-019-53958-w.pdf
accesso aperto
Descrizione: DOAJ Open Access
Tipologia:
Versione Editoriale (PDF)
Licenza:
Creative commons
Dimensione
2.67 MB
Formato
Adobe PDF
|
2.67 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.