Skip to main content
Log in

Accuracy of estimated phylogenetic trees from molecular data

I. Distantly Related Species

  • Published:
Journal of Molecular Evolution Aims and scope Submit manuscript

Summary

The accuracies and efficiencies of four different methods for constructing phylogenetic trees from molecular data were examined by using computer simulation. The methods examined are UPGMA, Fitch and Margoliash's (1967) (F/M) method, Farris' (1972) method, and the modified Farris method (Tateno, Nei, and Tajima, this paper). In the computer simulation, eight OTUs (32 OTUs in one case) were assumed to evolve according to a given model tree, and the evolutionary change of a sequence of 300 nucleotides was followed. The nucleotide substitution in this sequence was assumed to occur following the Poisson distribution, negative binomial distribution or a model of temporally varying rate. Estimates of nucleotide substitutions (genetic distances) were then computed for all pairs of the nucleotide sequences that were generated at the end of the evolution considered, and from these estimates a phylogenetic tree was reconstructed and compared with the true model tree. The results of this comparison indicate that when the coefficient of variation of branch length is large the Farris and modified Farris methods tend to be better than UPGMA and the F/M method for obtaining a good topology. For estimating the number of nucleotide substitutions for each branch of the tree, however, the modified Farris method shows a better performance than the Farris method. When the coefficient of variation of branch length is small, however, UPGMA shows the best performance among the four methods examined. Nevertheless, any tree-making method is likely to make errors in obtaining the correct topology with a high probability, unless all branch lengths of the true tree are sufficiently long. It is also shown that the agreement between patristic and observed genetic distances is not a good indicator of the goodness of the tree obtained.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Cavalli-Sforza LL, Edwards AWF (1967) Phylogenetic analysis: models and estimation procedures. Am J Hum Gen 19:233–257

    Google Scholar 

  • Chakraborty R (1977) Estimation of time of divergence from phylogenetic studies. Can J Genet Cytol 19:217–223

    Google Scholar 

  • Dayhoff MO (ed) (1969) Atlas of protein sequence and structure, Vol. 4. Natl Biomed Res Found, Silver Spring, MD

    Google Scholar 

  • Doolittle RF, Blombäck B (1964) Amino-acid sequence investigations of fibrinopeptides from various mammals: evolutionary implications. Nature 202:147–152

    Google Scholar 

  • Edwards AWF, Cavalli-Sforza LL (1965) A method for cluster analysis. Biometrics 21:362–375

    Google Scholar 

  • Farris JS (1972) Estimating phylogenetic trees from distance matrices. Am Nat 106:645–668

    Google Scholar 

  • Farris JS (1979) On the naturalness of phylogenetic classification. Syst Zool 28:200–214

    Google Scholar 

  • Farris JS, Kluge AG, Mickevich MF (1979) Paraphyly of theRana boylii species group. Syst Zool 28:627–634

    Google Scholar 

  • Fitch WM, Margoliash E (1967) Construction of phylogenetic trees. Science 155:279–284

    Google Scholar 

  • Goodman M, Moore GW, Barnabas J, Matsuda G (1974) The phylogeny of human globin genes investigated by the maximum parsimony method. J Mol Evol 3:1–48

    Google Scholar 

  • Jukes TH, Cantor CR (1969) Evolution of protein molecules. In: Munro HN (ed) Mammalian protein metabolism. Academic Press, New York, pp 21–123

    Google Scholar 

  • Kimura M (1969) The rate of molecular evolution considered from the standpoint of population genetics. Proc Natl Acad Sci USA 63:1181–1188

    Google Scholar 

  • King, JL, Jukes TH (1969) Non-Darwinian evolution. Science 164:788–798

    Google Scholar 

  • Langley CH, Fitch WM (1974) An examination of the constancy of the rate of molecular evolution. J Mol Evol 3:161–177

    Google Scholar 

  • Li W (1981) Simple method for constructing phylogenetic trees from distance matrices. Proc Natl Acad Sci USA 78:1085–1089

    Google Scholar 

  • Margoliash E, Smith EL (1965) Structural and functional aspects of cytochrome c in relation to evolution. In: Bryson V, Vogel HJ (eds) Evolving genes and proteins. Academic Press, New York, pp 221–242

    Google Scholar 

  • Moore GW, Barnabas J, Goodman M (1973a) A method for constructing maximum parsimony ancestral amino acid sequences on a given network. J Theor Biol 38:459–485

    Google Scholar 

  • Moore GW, Goodman M, Barnabas J (1973b) An iterative approach from the standpoint of the additive hypothesis to the dendrogram problem posed by molecular data sets. J Theor Biol 38:423–457

    Google Scholar 

  • Nei M (1972) Genetic distance between populations. Am Nat 106:283–292

    Google Scholar 

  • Nei M (1975) Molecular population genetics and evolution. North Holland, Amsterdam and New York

    Google Scholar 

  • Nei M (1977) Standard error of immunological dating of evolutionary time. J Mol Evol 9:203–211

    Google Scholar 

  • Nei M (1978) Genetic distance and molecular taxonomy. Abstract in: Proc XIV Intl Cong Genet. Nauka Publishing Office, Moscow, pp 84–85

    Google Scholar 

  • Nei M, Tateno Y (1978) Nonrandom amino acid substitution and estimation of the number of nucleotide substitutions in evolution. J Mol Evol 11:333–347

    Google Scholar 

  • Ohta T (1976) Simulation studies on the evolution of amino acid sequences. J Mol Evol 8:1–12

    Google Scholar 

  • Ohta T, Kimura M (1971) On the constancy of the evolutionary rate of cistrons. J Mol Evol 1:18–25

    Google Scholar 

  • Peacock D, Boulter D (1975) Use of amino acid sequence data in phylogeny and evaluation of methods using computer simulation. J Mol Biol 95:513–527

    Google Scholar 

  • Prager EM, Wilson AC (1971) The dependence of immunological cross-reactivity upon sequence resemblance among lysozymes. J Biol Chem 246:5978–5989

    Google Scholar 

  • Prager EM, Wilson AC (1978) Construction of phylogenetic trees for proteins and nucleic acids: empirical evaluation of alternative matrix methods. J Mol Evol 11:129–142

    Google Scholar 

  • Robinson DF, Foulds LR (1981) Comparison of phylogenetic trees. Math Biosci 53:131–147

    Google Scholar 

  • Sarich VM, Wilson AC (1966) Quantitative immunochemistry and the evolution of primate albumins: micro-complement fixation. Science 154:1563–1566

    Google Scholar 

  • Sneath PHA, Sokal RR (1973) Numerical taxonomy. WH Freeman, San Francisco

    Google Scholar 

  • Sokal RR, Michener CD (1958) A statistical method for evaluating systematic relationships. Univ Kansas Sci Bull 28: 1409–1438

    Google Scholar 

  • Swofford DL (1981) On the utility of the distance Wagner procedure. In: Funk VA, Brooks DR (eds) Advances in cladistics. Cladistics Publications, Bronx, New York

    Google Scholar 

  • Tateno Y (1978) Statistical studies on the evolutionary changes of macromolecules. Ph.D. Dissertation, University of Texas at Houston

  • Tateno Y, Nei M (1978) Goodman et al.'s method for augmenting the number of nucleotide substitutions. J Mol Evol 11:67–73

    Google Scholar 

  • Waterman MS, Smith TF (1978) On the similarity of dendrograms. J Theor Biol 73:789–800

    Google Scholar 

  • Wilson AC, Carlson SS, White TJ (1977) Biochemical evolution. Ann Rev Biochem 46:573–639

    Google Scholar 

  • Zuckerkandl E, Pauling L (1962) Molecular disease, evolution, and genetic heterogeneity. In: Kasha M, Pullman B (eds) Horizons in biochemistry. Academic Press, New York, pp 189–225

    Google Scholar 

  • Zuckerkandl E, Pauling L (1965) Evolutionary divergence and convergence in proteins. In: Bryson V, Vogel HJ (eds) Evolving genes and proteins. Academic Press, New York, pp 97–166

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tateno, Y., Nei, M. & Tajima, F. Accuracy of estimated phylogenetic trees from molecular data. J Mol Evol 18, 387–404 (1982). https://doi.org/10.1007/BF01840887

Download citation

  • Received:

  • Revised:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF01840887

Key words

Navigation