Skip to Main content Skip to Navigation
Journal articles

Efficient selection of branch-specific models of sequence evolution.

Abstract : The analysis of extant sequences shows that molecular evolution has been heterogeneous through time and among lineages. However, for a given sequence alignment, it is often difficult to uncover what factors caused this heterogeneity. In fact, identifying and characterizing heterogeneous patterns of molecular evolution along a phylogenetic tree is very challenging, for lack of appropriate methods. Users either have to a priori define groups of branches along which they believe molecular evolution has been similar or have to allow each branch to have its own pattern of molecular evolution. The first approach assumes prior knowledge that is seldom available, and the second requires estimating an unreasonably large number of parameters. Here we propose a convenient and reliable approach where branches get clustered by their pattern of molecular evolution alone, with no need for prior knowledge about the data set under study. Model selection is achieved in a statistical framework and therefore avoids overparameterization. We rely on substitution mapping for efficiency and present two clustering approaches, depending on whether or not we expect neighbouring branches to share more similar patterns of sequence evolution than distant branches. We validate our method on simulations and test it on four previously published data sets. We find that our method correctly groups branches sharing similar equilibrium GC contents in a data set of ribosomal RNAs and recovers expected footprints of selection through dN/dS. Importantly, it also uncovers a new pattern of relaxed selection in a phylogeny of Mantellid frogs, which we are able to correlate to life-history traits. This shows that our programs should be very useful to study patterns of molecular evolution and reveal new correlations between sequence and species evolution. Our programs can run on DNA, RNA, codon, or amino acid sequences with a large set of possible models of substitutions and are available at http://biopp.univ-montp2.fr/forge/testnh.
Complete list of metadata

https://hal.archives-ouvertes.fr/hal-00965698
Contributor : Laurence Naiglin Connect in order to contact the contributor
Submitted on : Tuesday, June 15, 2021 - 11:42:51 AM
Last modification on : Friday, October 22, 2021 - 2:58:35 PM
Long-term archiving on: : Thursday, September 16, 2021 - 6:30:49 PM

File

mss059.pdf
Publisher files allowed on an open archive

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

Citation

Julien Y Dutheil, Nicolas Galtier, Jonathan Romiguier, Emmanuel Douzery, Vincent Ranwez, et al.. Efficient selection of branch-specific models of sequence evolution.. Molecular Biology and Evolution, Oxford University Press (OUP), 2012, 29 (7), pp.1861-1874. ⟨10.1093/molbev/mss059⟩. ⟨hal-00965698⟩

Share

Metrics

Record views

464

Files downloads

128