Parsimony and maximum-likelihood phylogenetic analyses of morphology do not generally integrate uncertainty in inferring evolutionary history: a response to Brown et al.
2017; Royal Society; Volume: 284; Issue: 1864 Linguagem: Inglês
10.1098/rspb.2017.1636
ISSN1471-2954
AutoresMark N. Puttick, Joseph O’Reilly, Derek Oakley, Alistair R. Tanner, James F. Fleming, James Clark, Lucy Holloway, Jesús Lozano-Fernández, Luke A. Parry, James E. Tarver, Davide Pisani, Philip C. J. Donoghue,
Tópico(s)Genetic diversity and population structure
ResumoYou have accessMoreSectionsView PDF ToolsAdd to favoritesDownload CitationsTrack Citations ShareShare onFacebookTwitterLinked InRedditEmail Cite this article Puttick Mark N., O'Reilly Joseph E., Oakley Derek, Tanner Alistair R., Fleming James F., Clark James, Holloway Lucy, Lozano-Fernandez Jesus, Parry Luke A., Tarver James E., Pisani Davide and Donoghue Philip C. J. 2017Parsimony and maximum-likelihood phylogenetic analyses of morphology do not generally integrate uncertainty in inferring evolutionary history: a response to Brown et al.Proc. R. Soc. B.2842017163620171636http://doi.org/10.1098/rspb.2017.1636SectionYou have accessInvited replyParsimony and maximum-likelihood phylogenetic analyses of morphology do not generally integrate uncertainty in inferring evolutionary history: a response to Brown et al. Mark N. Puttick Mark N. Puttick http://orcid.org/0000-0002-1011-3442 School of Earth Sciences, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK Department of Life Sciences, The Natural History Museum, Cromwell Road, South Kensington, London SW7 5BD, UK Google Scholar Find this author on PubMed Search for more papers by this author , Joseph E. O'Reilly Joseph E. O'Reilly http://orcid.org/0000-0001-9775-253X School of Earth Sciences, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK Google Scholar Find this author on PubMed Search for more papers by this author , Derek Oakley Derek Oakley School of Earth Sciences, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK Google Scholar Find this author on PubMed Search for more papers by this author , Alistair R. Tanner Alistair R. Tanner School of Biological Sciences, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK Google Scholar Find this author on PubMed Search for more papers by this author , James F. Fleming James F. Fleming School of Earth Sciences, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK Google Scholar Find this author on PubMed Search for more papers by this author , James Clark James Clark http://orcid.org/0000-0003-2896-1631 School of Earth Sciences, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK Google Scholar Find this author on PubMed Search for more papers by this author , Lucy Holloway Lucy Holloway http://orcid.org/0000-0003-1603-2296 School of Earth Sciences, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK Google Scholar Find this author on PubMed Search for more papers by this author , Jesus Lozano-Fernandez Jesus Lozano-Fernandez http://orcid.org/0000-0003-3597-1221 School of Earth Sciences, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK Google Scholar Find this author on PubMed Search for more papers by this author , Luke A. Parry Luke A. Parry http://orcid.org/0000-0002-3910-0346 School of Earth Sciences, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK Google Scholar Find this author on PubMed Search for more papers by this author , James E. Tarver James E. Tarver School of Earth Sciences, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK Google Scholar Find this author on PubMed Search for more papers by this author , Davide Pisani Davide Pisani School of Earth Sciences, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK School of Biological Sciences, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK [email protected] Google Scholar Find this author on PubMed Search for more papers by this author and Philip C. J. Donoghue Philip C. J. Donoghue http://orcid.org/0000-0003-3116-7463 School of Earth Sciences, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK [email protected] Google Scholar Find this author on PubMed Search for more papers by this author Mark N. Puttick Mark N. Puttick http://orcid.org/0000-0002-1011-3442 School of Earth Sciences, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK Department of Life Sciences, The Natural History Museum, Cromwell Road, South Kensington, London SW7 5BD, UK Google Scholar Find this author on PubMed , Joseph E. O'Reilly Joseph E. O'Reilly http://orcid.org/0000-0001-9775-253X School of Earth Sciences, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK Google Scholar Find this author on PubMed , Derek Oakley Derek Oakley School of Earth Sciences, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK Google Scholar Find this author on PubMed , Alistair R. Tanner Alistair R. Tanner School of Biological Sciences, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK Google Scholar Find this author on PubMed , James F. Fleming James F. Fleming School of Earth Sciences, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK Google Scholar Find this author on PubMed , James Clark James Clark http://orcid.org/0000-0003-2896-1631 School of Earth Sciences, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK Google Scholar Find this author on PubMed , Lucy Holloway Lucy Holloway http://orcid.org/0000-0003-1603-2296 School of Earth Sciences, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK Google Scholar Find this author on PubMed , Jesus Lozano-Fernandez Jesus Lozano-Fernandez http://orcid.org/0000-0003-3597-1221 School of Earth Sciences, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK Google Scholar Find this author on PubMed , Luke A. Parry Luke A. Parry http://orcid.org/0000-0002-3910-0346 School of Earth Sciences, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK Google Scholar Find this author on PubMed , James E. Tarver James E. Tarver School of Earth Sciences, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK Google Scholar Find this author on PubMed , Davide Pisani Davide Pisani School of Earth Sciences, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK School of Biological Sciences, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK [email protected] Google Scholar Find this author on PubMed and Philip C. J. Donoghue Philip C. J. Donoghue http://orcid.org/0000-0003-3116-7463 School of Earth Sciences, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK [email protected] Google Scholar Find this author on PubMed Published:11 October 2017https://doi.org/10.1098/rspb.2017.1636Our recent study evaluated the performance of parsimony and probabilistic models of phylogenetic inference based on categorical data [1]. We found that a Bayesian implementation of a probabilistic Markov model produced more accurate results than either of the competing parsimony approaches (the main method currently employed), and the maximum-likelihood implementation of the same model. This occurs principally because the results of Bayesian analyses are less resolved (less precise) as a measure of topological uncertainty is intrinsically recovered in this MCMC-based approach and can be used to construct a majority-rule consensus tree that reflects this. Of the three main methods, maximum likelihood performed the worst of all as a single exclusively bifurcating tree is estimated in this framework which does not integrate an intrinsic measure of support.In their comment on our article, Brown et al. [2] argue that our experiments are invalid because we did not employ uncertainty measures after obtaining a maximum-likelihood estimate of the topology. When bootstrapping is employed, a 50% consensus tree constructed from the bootstrap distribution is often indistinguishable from the majority-rule consensus tree constructed from the posterior sample obtained in a Bayesian analysis. This result is not entirely unexpected, as the maximum-likelihood and Bayesian statistical frameworks share many statistical similarities, including a dependence on a likelihood function that incorporates the Mk model in this context. On this basis, Brown and colleagues conclude that they cannot advocate one method of phylogenetic inference over another: Bayesian, maximum-likelihood and parsimony methods differ, and thoughtful consideration is required in order to choose among these methods. Unfortunately, their analyses do not wholly support this conclusion because they exclusively focus on the performance of the two implementations of the same probabilistic model, without considering their performance relative to parsimony. This was a key aspect of our study comparing the primary methods of phylogenetic reconstruction as they are commonly implemented. Our and other previous studies [1,3,4] reject parsimony in favour of a Bayesian MCMC framework in which uncertainty is incorporated, further drawing into question the veracity of Brown and colleagues' assertion that there is equivalent performance among methods.The principle thrust of the argument presented by Brown et al. [2] is that the experiments performed by us [1] did not allow for a fair comparison between phylogenetic methods: the Bayesian implementation intrinsically integrates uncertainty, while it is common practice to evaluate uncertainty post hoc for maximum-likelihood and parsimony inferences using bootstrap methodology. In our study [1], we explicitly addressed this issue in two ways. The first argument was that bootstrapping is not an intrinsic aspect of maximum-likelihood estimation or parsimony phylogenetic analysis. Thus, we did not need to consider support values in our analyses. Using Bayesian estimation, it is intractable to analytically estimate topology using the Mk model and so it is necessary to use an MCMC sampling procedure to produce a posterior sample of trees. From this approximation of the posterior distribution, it is straightforward to interpret a 50% majority-rule consensus tree and clade support measures (posterior probabilities), unlike analogous measures produced from bootstrapping [5]. Our second argument was that bootstrapping is arguably unsuited to analysis of morphological data because its statistical expectations are not met, viz. that the phylogenetic signal is not independently and identically distributed through the data, which is a view common to phylogenetic textbooks (e.g. [6–8]). Brown et al. [2] correctly highlight that this is an issue shared by both Bayesian and maximum-likelihood implementations of the Mk model, as independence is assumed when summing the log-likelihood of individual characters. However, the interpretation of posterior probabilities as the probability of observing a clade given the morphological data is straightforward, whereas the exact meaning of a bootstrap proportion is still equivocal, with numerous proposed interpretations [9], all of which are contingent on the maximum-likelihood estimate of topology.We agree that bootstrapping has been used commonly in phylogenetic reconstruction, including analyses based on morphological traits, to assign a level of support to the constituent nodes of a most parsimonious or maximum-likelihood topology estimate. In this sense, our experiments could be viewed as failing to faithfully simulate common practice. However, while it is common practice to measure support for the clades through bootstrapping in maximum-likelihood and parsimony phylogenetic analyses of morphological traits, most studies present these support measures on fully resolved topology estimates that include nodes with negligible support, rather than collapsing nodes that exhibit less than 50% support into soft polytomies, as Brown et al. suggest [2]. To underline the prevalence of this approach, we reviewed studies citing Lewis [10], the originator of the Mk model, published since the start of this year, as recorded in Web of Science (census date 14 June 2017). Of the 48 citing articles (see the electronic supplementary material), 31 phylogenetic studies were based on morphological traits, in whole or in part. Of the 11 studies that employed maximum likelihood, 10 evaluated bootstrap support, all of which resolved nodes with less than 50% support. The same pattern is seen in parsimony analyses where, among 18 studies, only 12 evaluated bootstrap support, of which eight resolved nodes with less than 50% support—though these nodes were usually supported by other metrics like Bremer support. Resolution of unsupported nodes is less prevalent in Bayesian analyses where, among the 29 studies examined (27 of which presented posterior probabilities), only 12 resolved unsupported nodes; many of these were in maximum clade credibility trees. Unsupported nodes were present in Bayesian trees in only two of the nine studies that employed both maximum-likelihood and Bayesian analyses. Thus, while many of these studies present maximum-likelihood- and parsimony-based trees that are more fully resolved than their support measures should perhaps permit, when they are associated with parallel Bayesian analyses, these are invariably summarized by majority rule consensus.Hence, the experiments presented in our paper [1] followed common practice, as demonstrated by the literature. Brown et al. [2] are correct in their view that measures of support are widely employed in phylogenetics, and poorly supported clades should be collapsed in maximum-likelihood or maximum parsimony topologies. However, most maximum-likelihood- and parsimony-based studies effectively ignore post hoc topological support measures in their inferences of evolutionary history, which are most often based on more fully resolved, maximum-likelihood and parsimony trees. Practice shows that the same is not true of Bayesian analyses, which are usually summarized by the majority rule consensus (though some studies also seek further resolution using other methods for summarizing a distribution of trees, such as maximum clade credibility). Therefore, based on current use of phylogenetic models, our support for Bayesian inference is validated based on the current practice used by phylogeneticists.In effect, Brown et al. [2] have not addressed the core questions of our study. Rather, they have extended the experiments we undertook, with a different aim, and they have extended the conclusions. They observe that when clade support is considered, maximum-likelihood and Bayesian implementations of the Mk model perform equally well. This is an important observation that will provide some confidence in maximum-likelihood-based analyses of morphological trait data—just as soon as common practice catches up with the need to control for topological uncertainty when inferring evolutionary history.Brown et al. [2] close out their manuscript without advocating a method of phylogenetic inference and, indeed, argue that there is no superior method. Suitable methods, they argue, should be identified in each instance given the biological question at hand. In so doing, they explicitly draw parsimony back into consideration—despite the fact that their analyses do not address this method. This declaration ignores previous studies that highlight the inaccuracy of parsimony [1,3,4], to which they present no counter-evidence. The focus of our study was an objective comparison of the efficacy of the primary methods of phylogenetic reconstruction, including parsimony, as commonly implemented by practitioners. Our experimental design, focused on such common practices, is valid, as are the results, interpretations and conclusions that we derived from our experiments.FootnotesThe accompanying comment can be viewed at http://dx.doi.org/10.1098/rspb.2017.0986.Electronic supplementary material is available online at https://dx.doi.org/10.6084/m9.figshare.c.3888106.© 2017 The Author(s)Published by the Royal Society. All rights reserved.References1Puttick MNet al.2017Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data. Proc. R. Soc. B 284, 20162290. (doi:10.1098/rspb.2016.2290) Link, ISI, Google Scholar2Brown JW, Parins-Fukuchi C, Stull GW, Vargas OM, Smith SA. 2017Bayesian and likelihood phylogenetic reconstructions of morphological traits are not discordant when taking uncertainty into consideration: a comment on Puttick et al.Proc. R. Soc. B 284, 20170986. (doi:10.1098/rspb.2017.0986) Link, ISI, Google Scholar3O'Reilly JE, Puttick MN, Parry LA, Tanner AR, Tarver JE, Fleming J, Pisani D, Donoghue PCJ. 2016Bayesian methods outperform parsimony but at the expense of precision in the estimation of phylogeny from discrete morphological data. Biol. Lett. 12, 20160081. (doi:10.1098/rsbl.2016.0081) Link, ISI, Google Scholar4Wright AM, Hillis DM. 2014Bayesian analysis using a simple likelihood model outperforms parsimony for estimation of phylogeny from discrete morphological data. PLoS ONE 9, e109210. (doi:10.1371/journal.pone.0109210) Crossref, PubMed, ISI, Google Scholar5Yang Z, Rannala B. 2005Branch-length prior influences Bayesian posterior probability of phylogeny. Syst. Biol. 54, 455–470. (doi:10.1080/10635150590945313) Crossref, PubMed, ISI, Google Scholar6Felsenstein J. 2004Inferring phylogenies. Sunderland, MA: Sinauer. Google Scholar7Kitching IJ, Forey PL, Humphries CJ, Williams DM. 1998Cladistics: the theory and practice of parsimony analysis, 2nd edn. Oxford, UK: Oxford University Press. Google Scholar8Schuh RT. 2000Biological systematics: principles and applications. Ithaca, NY: Constock. Google Scholar9Yang Z. 2006Computational molecular evolution. Oxford, UK: Ocford University Press. Crossref, Google Scholar10Lewis PO. 2001A likelihood approach to estimating phylogeny from discrete morphological character data. Syst. Biol. 50, 913–925. Crossref, PubMed, ISI, Google Scholar Previous ArticleNext Article VIEW FULL TEXT DOWNLOAD PDF FiguresRelatedReferencesDetailsCited byWu R, Pisani D and Donoghue P (2023) The unbearable uncertainty of panarthropod relationships, Biology Letters, 19:1, Online publication date: 1-Jan-2023. Murphy J, Puttick M, O'Reilly J, Pisani D, Donoghue P and Ruta M (2021) Empirical distributions of homoplasy in morphological data, Palaeontology, 10.1111/pala.12535, 64:4, (505-518), Online publication date: 1-Jul-2021. Howard R, Edgecombe G, Shi X, Hou X and Ma X (2020) Ancestral morphology of Ecdysozoa constrained by an early Cambrian stem group ecdysozoan, BMC Evolutionary Biology, 10.1186/s12862-020-01720-6, 20:1, Online publication date: 1-Dec-2020. Keating J, Sansom R, Sutton M, Knight C, Garwood R and Serb J (2020) Morphological Phylogenetics Evaluated Using Novel Evolutionary Simulations, Systematic Biology, 10.1093/sysbio/syaa012, 69:5, (897-912), Online publication date: 1-Sep-2020. Luque J, Feldmann R, Vernygora O, Schweitzer C, Cameron C, Kerr K, Vega F, Duque A, Strange M, Palmer A and Jaramillo C (2019) Exceptional preservation of mid-Cretaceous marine arthropods and the evolution of novel forms via heterochrony, Science Advances, 10.1126/sciadv.aav3875, 5:4, (eaav3875), Online publication date: 1-Apr-2019. Allen B, Stubbs T, Benton M, Puttick M and Mannion P (2018) Archosauromorph extinction selectivity during the Triassic-Jurassic mass extinction, Palaeontology, 10.1111/pala.12399, 62:2, (211-224), Online publication date: 1-Mar-2019. Smith M (2019) Bayesian and parsimony approaches reconstruct informative trees from simulated morphological datasets, Biology Letters, 15:2, Online publication date: 1-Feb-2019. Caldas I and Schrago C (2019) Data partitioning and correction for ascertainment bias reduce the uncertainty of placental mammal divergence times inferred from the morphological clock, Ecology and Evolution, 10.1002/ece3.4921, 9:4, (2255-2262), Online publication date: 1-Feb-2019. Puttick M, O'Reilly J, Pisani D, Donoghue P and Rahman I (2018) Probabilistic methods outperform parsimony in the phylogenetic analysis of data simulated without a probabilistic model, Palaeontology, 10.1111/pala.12388, 62:1, (1-17), Online publication date: 1-Jan-2019. Schrago C, Aguiar B and Mello B (2018) Comparative evaluation of maximum parsimony and Bayesian phylogenetic reconstruction using empirical morphological data, Journal of Evolutionary Biology, 10.1111/jeb.13344, 31:10, (1477-1484), Online publication date: 1-Oct-2018. Coiro M, Chomicki G and Doyle J (2018) Experimental signal dissection and method sensitivity analyses reaffirm the potential of fossils and morphology in the resolution of the relationship of angiosperms and Gnetales, Paleobiology, 10.1017/pab.2018.23, 44:3, (490-510), Online publication date: 1-Aug-2018. O'Reilly J, Puttick M, Pisani D, Donoghue P and Smith A (2018) Empirical realism of simulated data is more important than the model used to generate it: a reply to Goloboff et al . , Palaeontology, 10.1111/pala.12361, 61:4, (631-635), Online publication date: 1-Jul-2018. Goloboff P, Torres Galvis A, Arias J and Smith A (2018) Parsimony and model-based phylogenetic methods for morphological data: comments on O'Reilly et al . , Palaeontology, 10.1111/pala.12353, 61:4, (625-630), Online publication date: 1-Jul-2018. This Issue11 October 2017Volume 284Issue 1864 Article InformationDOI:https://doi.org/10.1098/rspb.2017.1636PubMed:29021176Published by:Royal SocietyOnline ISSN:1471-2954History: Manuscript received21/07/2017Manuscript accepted11/09/2017Published online11/10/2017Published in print11/10/2017 License:© 2017 The Author(s)Published by the Royal Society. All rights reserved. Citations and impact Subjectsevolutionpalaeontologytaxonomy and systematics Large datasets are available through Proceedings B's partnership with Dryad
Referência(s)