Efficient selection of branch-specific models of sequence evolution. - INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement Access content directly
Journal Articles Molecular Biology and Evolution Year : 2012

Efficient selection of branch-specific models of sequence evolution.


The analysis of extant sequences shows that molecular evolution has been heterogeneous through time and among lineages. However, for a given sequence alignment, it is often difficult to uncover what factors caused this heterogeneity. In fact, identifying and characterizing heterogeneous patterns of molecular evolution along a phylogenetic tree is very challenging, for lack of appropriate methods. Users either have to a priori define groups of branches along which they believe molecular evolution has been similar or have to allow each branch to have its own pattern of molecular evolution. The first approach assumes prior knowledge that is seldom available, and the second requires estimating an unreasonably large number of parameters. Here we propose a convenient and reliable approach where branches get clustered by their pattern of molecular evolution alone, with no need for prior knowledge about the data set under study. Model selection is achieved in a statistical framework and therefore avoids overparameterization. We rely on substitution mapping for efficiency and present two clustering approaches, depending on whether or not we expect neighbouring branches to share more similar patterns of sequence evolution than distant branches. We validate our method on simulations and test it on four previously published data sets. We find that our method correctly groups branches sharing similar equilibrium GC contents in a data set of ribosomal RNAs and recovers expected footprints of selection through dN/dS. Importantly, it also uncovers a new pattern of relaxed selection in a phylogeny of Mantellid frogs, which we are able to correlate to life-history traits. This shows that our programs should be very useful to study patterns of molecular evolution and reveal new correlations between sequence and species evolution. Our programs can run on DNA, RNA, codon, or amino acid sequences with a large set of possible models of substitutions and are available at http://biopp.univ-montp2.fr/forge/testnh.
Fichier principal
Vignette du fichier
mss059.pdf (3.8 Mo) Télécharger le fichier
Origin : Publisher files allowed on an open archive

Dates and versions

hal-00965698 , version 1 (15-06-2021)





Julien y Dutheil, Nicolas Galtier, Jonathan Romiguier, Emmanuel J.P. Douzery, Vincent Ranwez, et al.. Efficient selection of branch-specific models of sequence evolution.. Molecular Biology and Evolution, 2012, 29 (7), pp.1861-1874. ⟨10.1093/molbev/mss059⟩. ⟨hal-00965698⟩
193 View
36 Download



Gmail Facebook X LinkedIn More