In biology, phylogenetics (/ˌfləˈnɛtɪks, -lə-/)[1][2][3] is the study of the evolutionary history of life using genetics, which is known as phylogenetic inference. It establishes the relationship between organisms with the empirical data and observed heritable traits of DNA sequences, protein amino acid sequences, and morphology. The results are a phylogenetic tree—a diagram setting the hypothetical relationships between organisms and their evolutionary history.[4]

The tips of a phylogenetic tree can be living taxa or fossils, which represent the present time or "end" of an evolutionary lineage, respectively. A phylogenetic diagram can be rooted or unrooted. A rooted tree diagram indicates the hypothetical common ancestor of the tree. An unrooted tree diagram (a network) makes no assumption about the ancestral line, and does not show the origin or "root" of the taxa in question or the direction of inferred evolutionary transformations.[5]

In addition to their use for inferring phylogenetic patterns among taxa, phylogenetic analyses are often employed to represent relationships among genes or individual organisms. Such uses have become central to understanding biodiversity, evolution, ecology, and genomes.

Phylogenetics is a component of systematics that uses similarities and differences of the characteristics of species to interpret their evolutionary relationships and origins. Phylogenetics focuses on whether the characteristics of a species reinforce a phylogenetic inference that it diverged from the most recent common ancestor of a taxonomic group.[6]

In the field of cancer research, phylogenetics can be used to study the clonal evolution of tumors and molecular chronology, predicting and showing how cell populations vary throughout the progression of the disease and during treatment, using whole genome sequencing techniques.[7] The evolutionary processes behind cancer progression are quite different from those in most species and are important to phylogenetic inference; these differences manifest in several areas: the types of aberrations that occur, the rates of mutation, the high heterogeneity (variability) of tumor cell subclones, and the absence of genetic recombination.[8][9]

Phylogenetics can also aid in drug design and discovery. Phylogenetics allows scientists to organize species and can show which species are likely to have inherited particular traits that are medically useful, such as producing biologically active compounds - those that have effects on the human body. For example, in drug discovery, venom-producing animals are particularly useful. Venoms from these animals produce several important drugs, e.g., ACE inhibitors and Prialt (Ziconotide). To find new venoms, scientists turn to phylogenetics to screen for closely related species that may have the same useful traits. The phylogenetic tree shows which species of fish have an origin of venom, and related fish they may contain the trait. Using this approach in studying venomous fish, biologists are able to identify the fish species that may be venomous. Biologist have used this approach in many species such as snakes and lizards.[10] In forensic science, phylogenetic tools are useful to assess DNA evidence for court cases. The simple phylogenetic tree of viruses A-E shows the relationships between viruses e.g., all viruses are descendants of Virus A.

HIV forensics uses phylogenetic analysis to track the differences in HIV genes and determine the relatedness of two samples. Phylogenetic analysis has been used in criminal trials to exonerate or hold individuals. HIV forensics does have its limitations, i.e., it cannot be the sole proof of transmission between individuals and phylogenetic analysis which shows transmission relatedness does not indicate direction of transmission.[11]

Taxonomy and classification

edit
 
One small clade of fish, showing how venom has evolved multiple times.[10]

Taxonomy is the identification, naming, and classification of organisms. Compared to systemization, classification emphasizes whether a species has characteristics of a taxonomic group.[6] The Linnaean classification system developed in the 1700s by Carolus Linnaeus is the foundation for modern classification methods. Linnaean classification relies on an organism's phenotype or physical characteristics to group and organize species.[12] With the emergence of biochemistry, organism classifications are now usually based on phylogenetic data, and many systematists contend that only monophyletic taxa should be recognized as named groups. The degree to which classification depends on inferred evolutionary history differs depending on the school of taxonomy: phenetics ignores phylogenetic speculation altogether, trying to represent the similarity between organisms instead; cladistics (phylogenetic systematics) tries to reflect phylogeny in its classifications by only recognizing groups based on shared, derived characters (synapomorphies); evolutionary taxonomy tries to take into account both the branching pattern and "degree of difference" to find a compromise between them.

Inference of a phylogenetic tree

edit

Usual methods of phylogenetic inference involve computational approaches implementing the optimality criteria and methods of parsimony, maximum likelihood (ML), and MCMC-based Bayesian inference. All these depend upon an implicit or explicit mathematical model describing the evolution of characters observed.[13]

Phenetics, popular in the mid-20th century but now largely obsolete, used distance matrix-based methods to construct trees based on overall similarity in morphology or similar observable traits (i.e. in the phenotype or the overall similarity of DNA, not the DNA sequence), which was often assumed to approximate phylogenetic relationships.

Prior to 1950, phylogenetic inferences were generally presented as narrative scenarios. Such methods are often ambiguous and lack explicit criteria for evaluating alternative hypotheses.[14][15][16]

Impacts of taxon sampling

edit

In phylogenetic analysis, taxon sampling selects a small group of taxa to represent the evolutionary history of its broader population.[17] This process is also known as stratified sampling or clade-based sampling.[18] The practice occurs given limited resources to compare and analyze every species within a target population.[17] Based on the representative group selected, the construction and accuracy of phylogenetic trees vary, which impacts derived phylogenetic inferences.[18]

Unavailable datasets, such as an organism's incomplete DNA and protein amino acid sequences in genomic databases, directly restrict taxonomic sampling.[18] Consequently, a significant source of error within phylogenetic analysis occurs due to inadequate taxon samples. Accuracy may be improved by increasing the number of genetic samples within its monophyletic group. Conversely, increasing sampling from outgroups extraneous to the target stratified population may decrease accuracy. Long branch attraction is an attributed theory for this occurrence, where nonrelated branches are incorrectly classified together, insinuating a shared evolutionary history.[17]

 
Percentage of inter-ordinal branches reconstructed with a constant number of bases and four phylogenetic tree construction models; neighbor-joining (NJ), minimum evolution (ME), unweighted maximum parsimony (MP), and maximum likelihood (ML). Demonstrates phylogenetic analysis with fewer taxa and more genes per taxon matches more often with the replicable consensus tree. The dotted line demonstrates an equal accuracy increase between the two taxon sampling methods. Figure is property of Michael S. Rosenberg and Sudhir Kumar as presented in the journal article Taxon Sampling, Bioinformatics, and Phylogenomics.[18]

There are debates if increasing the number of taxa sampled improves phylogenetic accuracy more than increasing the number of genes sampled per taxon. Differences in each method's sampling impact the number of nucleotide sites utilized in a sequence alignment, which may contribute to disagreements. For example, phylogenetic trees constructed utilizing a more significant number of total nucleotides are generally more accurate, as supported by phylogenetic trees' bootstrapping replicability from random sampling.

The graphic presented in Taxon Sampling, Bioinformatics, and Phylogenomics, compares the correctness of phylogenetic trees generated using fewer taxa and more sites per taxon on the x-axis to more taxa and fewer sites per taxon on the y-axis. With fewer taxa, more genes are sampled amongst the taxonomic group; in comparison, with more taxa added to the taxonomic sampling group, fewer genes are sampled. Each method has the same total number of nucleotide sites sampled. Furthermore, the dotted line represents a 1:1 accuracy between the two sampling methods. As seen in the graphic, most of the plotted points are located below the dotted line, which indicates gravitation toward increased accuracy when sampling fewer taxa with more sites per taxon. The research performed utilizes four different phylogenetic tree construction models to verify the theory; neighbor-joining (NJ), minimum evolution (ME), unweighted maximum parsimony (MP), and maximum likelihood (ML). In the majority of models, sampling fewer taxon with more sites per taxon demonstrated higher accuracy.

Generally, with the alignment of a relatively equal number of total nucleotide sites, sampling more genes per taxon has higher bootstrapping replicability than sampling more taxa. However, unbalanced datasets within genomic databases make increasing the gene comparison per taxon in uncommonly sampled organisms increasingly difficult.[18]

History

edit

Overview

edit

The term "phylogeny" derives from the German Phylogenie, introduced by Haeckel in 1866,[19] and the Darwinian approach to classification became known as the "phyletic" approach.[20] It can be traced back to Aristotle, who wrote in his Posterior Analytics, "We may assume the superiority ceteris paribus [other things being equal] of the demonstration which derives from fewer postulates or hypotheses."

Ernst Haeckel's recapitulation theory

edit

The modern concept of phylogenetics evolved primarily as a disproof of a previously widely accepted theory. During the late 19th century, Ernst Haeckel's recapitulation theory, or "biogenetic fundamental law", was widely popular.[21] It was often expressed as "ontogeny recapitulates phylogeny", i.e. the development of a single organism during its lifetime, from germ to adult, successively mirrors the adult stages of successive ancestors of the species to which it belongs. But this theory has long been rejected.[22][23] Instead, ontogeny evolves – the phylogenetic history of a species cannot be read directly from its ontogeny, as Haeckel thought would be possible, but characters from ontogeny can be (and have been) used as data for phylogenetic analyses; the more closely related two species are, the more apomorphies their embryos share.

Timeline of key points

edit
 
Branching tree diagram from Heinrich Georg Bronn's work (1858)
 
Phylogenetic tree suggested by Haeckel (1866)
  • 14th century, lex parsimoniae (parsimony principle), William of Ockam, English philosopher, theologian, and Franciscan friar, but the idea actually goes back to Aristotle, as a precursor concept. He introduced the concept of Occam's razor, which is the problem solving principle that recommends searching for explanations constructed with the smallest possible set of elements. Though he did not use these exact words, the principle can be summarized as "Entities must not be multiplied beyond necessity." The principle advocates that when presented with competing hypotheses about the same prediction, one should prefer the one that requires fewest assumptions.
  • 1763, Bayesian probability, Rev. Thomas Bayes,[24] a precursor concept. Bayesian probability began a resurgence in the 1950s, allowing scientists in the computing field to pair traditional Bayesian statistics with other more modern techniques. It is now used as a blanket term for several related interpretations of probability as an amount of epistemic confidence.
  • 18th century, Pierre Simon (Marquis de Laplace), perhaps first to use ML (maximum likelihood), precursor concept. His work gave way to the Laplace distribution, which can be directly linked to least absolute deviations.
  • 1809, evolutionary theory, Philosophie Zoologique, Jean-Baptiste de Lamarck, precursor concept, foreshadowed in the 17th century and 18th century by Voltaire, Descartes, and Leibniz, with Leibniz even proposing evolutionary changes to account for observed gaps suggesting that many species had become extinct, others transformed, and different species that share common traits may have at one time been a single race,[25] also foreshadowed by some early Greek philosophers such as Anaximander in the 6th century BC and the atomists of the 5th century BC, who proposed rudimentary theories of evolution[26]
  • 1837, Darwin's notebooks show an evolutionary tree[27]
  • 1840, American Geologist Edward Hitchcock published what is considered to be the first paleontological "Tree of Life". Many critiques, modifications, and explanations would follow.[28]
     
    This chart displays one of the first published attempts at a paleontological "Tree of Life" by Geologist Edward Hitchcock. (1840)
  • 1843, distinction between homology and analogy (the latter now referred to as homoplasy), Richard Owen, precursor concept. Homology is the term used to characterize the similarity of features that can be parsimoniously explained by common ancestry. Homoplasy is the term used to describe a feature that has been gained or lost independently in separate lineages over the course of evolution.
  • 1858, Paleontologist Heinrich Georg Bronn (1800–1862) published a hypothetical tree to illustrating the paleontological "arrival" of new, similar species. following the extinction of an older species. Bronn did not propose a mechanism responsible for such phenomena, precursor concept.[29]
  • 1858, elaboration of evolutionary theory, Darwin and Wallace,[30] also in Origin of Species by Darwin the following year, precursor concept.
  • 1866, Ernst Haeckel, first publishes his phylogeny-based evolutionary tree, precursor concept. Haeckel introduces the now-disproved recapitulation theory. He introduced the term "Cladus" as a taxonomic category just below subphylum.[31]
  • 1893, Dollo's Law of Character State Irreversibility,[32] precursor concept. Dollo's Law of Irreversibility states that "an organism never comes back exactly to its previous state due to the indestructible nature of the past, it always retains some trace of the transitional stages through which it has passed."[33]
  • 1912, ML (maximum likelihood recommended, analyzed, and popularized by Ronald Fisher, precursor concept. Fisher is one of the main contributors to the early 20th-century revival of Darwinism, and has been called the "greatest of Darwin's successors" for his contributions to the revision of the theory of evolution and his use of mathematics to combine Mendelian genetics and natural selection in the 20th century "modern synthesis".
  • 1921, Tillyard uses term "phylogenetic" and distinguishes between archaic and specialized characters in his classification system.[34]
  • 1940, Lucien Cuénot coined the term "clade" in 1940: "terme nouveau de clade (du grec κλάδοςç, branche) [A new term clade (from the Greek word klados, meaning branch)]".[35] He used it for evolutionary branching.[36]
  • 1947, Bernhard Rensch introduced the term Kladogenesis in his German book Neuere Probleme der Abstammungslehre Die transspezifische Evolution,[37] translated into English in 1959 as Evolution Above the Species Level (still using the same spelling).[38]
  • 1949, Jackknife resampling, Maurice Quenouille (foreshadowed in '46 by Mahalanobis and extended in '58 by Tukey), precursor concept.
  • 1950, Willi Hennig's classic formalization.[39] Hennig is considered the founder of phylogenetic systematics, and published his first works in German of this year. He also asserted a version of the parsimony principle, stating that the presence of amorphous characters in different species 'is always reason for suspecting kinship, and that their origin by convergence should not be presumed a priori'. This has been considered a foundational view of phylogenetic inference.
  • 1952, William Wagner's ground plan divergence method.[40]
  • 1957, Julian Huxley adopted Rensch's terminology as "cladogenesis" with a full definition: "Cladogenesis I have taken over directly from Rensch, to denote all splitting, from subspeciation through adaptive radiation to the divergence of phyla and kingdoms." With it he introduced the word "clades", defining it as: "Cladogenesis results in the formation of delimitable monophyletic units, which may be called clades."[41][36]
  • 1960, Arthur Cain and Geoffrey Ainsworth Harrison coined "cladistic" to mean evolutionary relationship,[42]
  • 1963, first attempt to use ML (maximum likelihood) for phylogenetics, Edwards and Cavalli-Sforza.[43]
  • 1965
    • Camin-Sokal parsimony, first parsimony (optimization) criterion and first computer program/algorithm for cladistic analysis both by Camin and Sokal.[44]
    • Character compatibility method, also called clique analysis, introduced independently by Camin and Sokal (loc. cit.) and E. O. Wilson.[45]
  • 1966
    • English translation of Hennig.[46]
    • "Cladistics" and "cladogram" coined (Webster's, loc. cit.)
  • 1969
    • Dynamic and successive weighting, James Farris.[47]
    • Wagner parsimony, Kluge and Farris.[48]
    • CI (consistency index), Kluge and Farris.[48]
    • Introduction of pairwise compatibility for clique analysis, Le Quesne.[49]
  • 1970, Wagner parsimony generalized by Farris.[50]
  • 1971
    • First successful application of ML (maximum likelihood) to phylogenetics (for protein sequences), Neyman.[51]
    • Fitch parsimony, Walter M. Fitch.[52] These gave way to the most basic ideas of maximum parsimony. Fitch is known for his work on reconstructing phylogenetic trees from protein and DNA sequences. His definition of orthologous sequences has been referenced in many research publications.
    • NNI (nearest neighbour interchange), first branch-swapping search strategy, developed independently by Robinson[53] and Moore et al.
    • ME (minimum evolution), Kidd and Sgaramella-Zonta[54] (it is unclear if this is the pairwise distance method or related to ML as Edwards and Cavalli-Sforza call ML "minimum evolution").
  • 1972, Adams consensus, Adams.[55]
  • 1976, prefix system for ranks, Farris.[56]
  • 1977, Dollo parsimony, Farris.[57]
  • 1979
  • 1980, PHYLIP, first software package for phylogenetic analysis, Joseph Felsenstein. A free computational phylogenetics package of programs for inferring evolutionary trees (phylogenies). One such example tree created by PHYLIP, called a "drawgram", generates rooted trees. This image shown in the figure below shows the evolution of phylogenetic trees over time.
  • 1981
    • Majority consensus, Margush and MacMorris.[61]
    • Strict consensus, Sokal and Rohlf[62]
       
      This image depicts a PHYLIP generated drawgram. This drawgram is an example of one of the possible trees the software is capable of generating.
      first computationally efficient ML (maximum likelihood) algorithm.[63] Felsenstein created the Felsenstein Maximum Likelihood method, used for the inference of phylogeny which evaluates a hypothesis about evolutionary history in terms of the probability that the proposed model and the hypothesized history would give rise to the observed data set.
  • 1982
    • PHYSIS, Mikevich and Farris
    • Branch and bound, Hendy and Penny[64]
  • 1985
    • First cladistic analysis of eukaryotes based on combined phenotypic and genotypic evidence Diana Lipscomb.[65]
    • First issue of Cladistics.
    • First phylogenetic application of bootstrap, Felsenstein.[66]
    • First phylogenetic application of jackknife, Scott Lanyon.[67]
  • 1986, MacClade, Maddison and Maddison.
  • 1987, neighbor-joining method Saitou and Nei[68]
  • 1988, Hennig86 (version 1.5), Farris
    • Bremer support (decay index), Bremer.[69]
  • 1989
    • RI (retention index), RCI (rescaled consistency index), Farris.[70]
    • HER (homoplasy excess ratio), Archie.[71]
  • 1990
    • combinable components (semi-strict) consensus, Bremer.[72]
    • SPR (subtree pruning and regrafting), TBR (tree bisection and reconnection), Swofford and Olsen.[73]
  • 1991
    • DDI (data decisiveness index), Goloboff.[74][75]
    • First cladistic analysis of eukaryotes based only on phenotypic evidence, Lipscomb.
  • 1993, implied weighting Goloboff.[76]
  • 1994, reduced consensus: RCC (reduced cladistic consensus) for rooted trees, Wilkinson.[77]
  • 1995, reduced consensus RPC (reduced partition consensus) for unrooted trees, Wilkinson.[78]
  • 1996, first working methods for BI (Bayesian Inference) independently developed by Li,[79] Mau,[80] and Rannala and Yang[81] and all using MCMC (Markov chain-Monte Carlo).
  • 1998, TNT (Tree Analysis Using New Technology), Goloboff, Farris, and Nixon.
  • 1999, Winclada, Nixon.
  • 2003, symmetrical resampling, Goloboff.[82]
  • 2004, 2005, similarity metric (using an approximation to Kolmogorov complexity) or NCD (normalized compression distance), Li et al.,[83] Cilibrasi and Vitanyi.[84]

Uses of phylogenetic analysis

edit

Pharmacology

edit

One use of phylogenetic analysis involves the pharmacological examination of closely related groups of organisms. Advances in cladistics analysis through faster computer programs and improved molecular techniques have increased the precision of phylogenetic determination, allowing for the identification of species with pharmacological potential.

Historically, phylogenetic screens for pharmacological purposes were used in a basic manner, such as studying the Apocynaceae family of plants, which includes alkaloid-producing species like Catharanthus, known for producing vincristine, an antileukemia drug. Modern techniques now enable researchers to study close relatives of a species to uncover either a higher abundance of important bioactive compounds (e.g., species of Taxus for taxol) or natural variants of known pharmaceuticals (e.g., species of Catharanthus for different forms of vincristine or vinblastine).[85]

Biodiversity

edit

Phylogenetic analysis has also been applied to biodiversity studies within the fungi family. Phylogenetic analysis helps understand the evolutionary history of various groups of organisms, identify relationships between different species, and predict future evolutionary changes. Emerging imagery systems and new analysis techniques allow for the discovery of more genetic relationships in biodiverse fields, which can aid in conservation efforts by identifying rare species that could benefit ecosystems globally.

 
Phylogenetic Subtree of fungi containing different biodiverse sections of the fungi group.

Infectious disease epidemiology

edit

Whole-genome sequence data from outbreaks or epidemics of infectious diseases can provide important insights into transmission dynamics and inform public health strategies. Traditionally, studies have combined genomic and epidemiological data to reconstruct transmission events. However, recent research has explored deducing transmission patterns solely from genomic data using phylodynamics, which involves analyzing the properties of pathogen phylogenies. Phylodynamics uses theoretical models to compare predicted branch lengths with actual branch lengths in phylogenies to infer transmission patterns. Additionally, coalescent theory, which describes probability distributions on trees based on population size, has been adapted for epidemiological purposes. Another source of information within phylogenies that has been explored is "tree shape." These approaches, while computationally intensive, have the potential to provide valuable insights into pathogen transmission dynamics.[86]

 
Pathogen Transmission Trees

The structure of the host contact network significantly impacts the dynamics of outbreaks, and management strategies rely on understanding these transmission patterns. Pathogen genomes spreading through different contact network structures, such as chains, homogeneous networks, or networks with super-spreaders, accumulate mutations in distinct patterns, resulting in noticeable differences in the shape of phylogenetic trees, as illustrated in Fig. 1. Researchers have analyzed the structural characteristics of phylogenetic trees generated from simulated bacterial genome evolution across multiple types of contact networks. By examining simple topological properties of these trees, researchers can classify them into chain-like, homogeneous, or super-spreading dynamics, revealing transmission patterns. These properties form the basis of a computational classifier used to analyze real-world outbreaks. Computational predictions of transmission dynamics for each outbreak often align with known epidemiological data.

 
Graphical Representation of Phylogenetic Tree analysis

Different transmission networks result in quantitatively different tree shapes. To determine whether tree shapes captured information about underlying disease transmission patterns, researchers simulated the evolution of a bacterial genome over three types of outbreak contact networks—homogeneous, super-spreading, and chain-like. They summarized the resulting phylogenies with five metrics describing tree shape. Figures 2 and 3 illustrate the distributions of these metrics across the three types of outbreaks, revealing clear differences in tree topology depending on the underlying host contact network.

Super-spreader networks give rise to phylogenies with higher Colless imbalance, longer ladder patterns, lower Δw, and deeper trees than those from homogeneous contact networks. Trees from chain-like networks are less variable, deeper, more imbalanced, and narrower than those from other networks.

Scatter plots can be used to visualize the relationship between two variables in pathogen transmission analysis, such as the number of infected individuals and the time since infection. These plots can help identify trends and patterns, such as whether the spread of the pathogen is increasing or decreasing over time, and can highlight potential transmission routes or super-spreader events. Box plots displaying the range, median, quartiles, and potential outliers datasets can also be valuable for analyzing pathogen transmission data, helping to identify important features in the data distribution. They may be used to quickly identify differences or similarities in the transmission data.[86]

Disciplines other than biology

edit
 
Phylogeny of Indo-European languages[87]

Phylogenetic tools and representations (trees and networks) can also be applied to philology, the study of the evolution of oral languages and written text and manuscripts, such as in the field of quantitative comparative linguistics.[88]

Computational phylogenetics can be used to investigate a language as an evolutionary system. The evolution of human language closely corresponds with human's biological evolution which allows phylogenetic methods to be applied. The concept of a "tree" serves as an efficient way to represent relationships between languages and language splits. It also serves as a way of testing hypotheses about the connections and ages of language families. For example, relationships among languages can be shown by using cognates as characters.[89][90] The phylogenetic tree of Indo-European languages shows the relationships between several of the languages in a timeline, as well as the similarity between words and word order.

There are three types of criticisms about using phylogenetics in philology, the first arguing that languages and species are different entities, therefore you can not use the same methods to study both. The second being how phylogenetic methods are being applied to linguistic data. And the third, discusses the types of data that is being used to construct the trees.[89]

Bayesian phylogenetic methods, which are sensitive to how treelike the data is, allow for the reconstruction of relationships among languages, locally and globally. The main two reasons for the use of Bayesian phylogenetics are that (1) diverse scenarios can be included in calculations and (2) the output is a sample of trees and not a single tree with true claim.[91]

The same process can be applied to texts and manuscripts. In Paleography, the study of historical writings and manuscripts, texts were replicated by scribes who copied from their source and alterations - i.e., 'mutations' - occurred when the scribe did not precisely copy the source.[92]

Phylogenetics has been applied to archaeological artefacts such as the early hominin hand-axes,[93] late Palaeolithic figurines,[94] Neolithic stone arrowheads,[95] Bronze Age ceramics,[96] and historical-period houses.[97] Bayesian methods have also been employed by archaeologists in an attempt to quantify uncertainty in the tree topology and divergence times of stone projectile point shapes in the European Final Palaeolithic and earliest Mesolithic.[98]

See also

edit

References

edit
  1. ^ "phylogenetic". Dictionary.com Unabridged (Online). n.d.
  2. ^ "phylogenetic". Merriam-Webster.com Dictionary. Merriam-Webster.
  3. ^ from Greek φυλή/φῦλον [phylé/phylon] "tribe, clan, race", and γενετικός [genetikós] "origin, source, birth" Liddell, Henry George; Scott, Robert; Jones, Henry Stuart (1968). A Greek-English lexicon (9 ed.). Oxford: Clarendon Press. p. 1961.
  4. ^ "phylogeny". Biology online. Retrieved 15 February 2013.
  5. ^ Itzik, Peer (1 January 2001). "Phylogenetic Trees". www.cs.tau.ac.il.
  6. ^ a b Harris, Katherine (23 June 2019). Taxonomy & Phylogeny. Biology LibreTexts. Retrieved 19 April 2023.
  7. ^ Herberts, Cameron; Annala, Matti; Sipola, Joonatan; Ng, Sarah W. S.; Chen, Xinyi E.; Nurminen, Anssi; Korhonen, Olga V.; Munzur, Aslı D.; Beja, Kevin; Schönlau, Elena; Bernales, Cecily Q.; Ritch, Elie; Bacon, Jack V. W.; Lack, Nathan A.; Nykter, Matti (August 2022). "Deep whole-genome ctDNA chronology of treatment-resistant prostate cancer". Nature. 608 (7921): 199–208. Bibcode:2022Natur.608..199H. doi:10.1038/s41586-022-04975-9. ISSN 1476-4687. PMID 35859180. S2CID 250730778.
  8. ^ Schwartz, Russell; Schäffer, Alejandro A. (April 2017). "The evolution of tumour phylogenetics: principles and practice". Nature Reviews Genetics. 18 (4): 213–229. doi:10.1038/nrg.2016.170. ISSN 1471-0056. PMC 5886015. PMID 28190876.
  9. ^ Ní Leathlobhair, Máire; Lenski, Richard E. (2022). "Population genetics of clonally transmissible cancers". Nature Ecology & Evolution. 6 (8): 1077–1089. Bibcode:2022NatEE...6.1077N. doi:10.1038/s41559-022-01790-3. ISSN 2397-334X. PMID 35879542.
  10. ^ a b "Drug discovery - Understanding Evolution". 7 July 2021. Retrieved 23 April 2023.
  11. ^ Bernard, EJ; Azad, Y; Vandamme, AM; Weait, M; Geretti, AM (2007). "HIV forensics: pitfalls and acceptable standards in the use of phylogenetic analysis as evidence in criminal investigations of HIV transmission". HIV Medicine. 8 (6): 382–387. doi:10.1111/j.1468-1293.2007.00486.x. ISSN 1464-2662. PMID 17661846. S2CID 38883310.
  12. ^ CK-12 Foundation (6 March 2021). Linnaean Classification. Biology LibreTexts. Retrieved 19 April 2023.{{cite book}}: CS1 maint: numeric names: authors list (link)
  13. ^ Phylogenetic Inference. Metaphysics Research Lab, Stanford University. 15 February 2024. {{cite book}}: |website= ignored (help)
  14. ^ Richard C. Brusca & Gary J. Brusca (2003). Invertebrates (2nd ed.). Sunderland, Massachusetts: Sinauer Associates. ISBN 978-0-87893-097-5.
  15. ^ Bock, W. J. (2004). Explanations in systematics. Pp. 49–56. In Williams, D. M. and Forey, P. L. (eds) Milestones in Systematics. London: Systematics Association Special Volume Series 67. CRC Press, Boca Raton, Florida.
  16. ^ Auyang, Sunny Y. (1998). Narratives and Theories in Natural History. In: Foundations of complex-system theories: in economics, evolutionary biology, and statistical physics. Cambridge, U.K.; New York: Cambridge University Press.[page needed]
  17. ^ a b c Rosenberg, Michael (28 August 2001). "Incomplete taxon sampling is not a problem for phylogenetic inference". Proceedings of the National Academy of Sciences. 98 (19): 10751–10756. Bibcode:2001PNAS...9810751R. doi:10.1073/pnas.191248498. PMC 58547. PMID 11526218.
  18. ^ a b c d e Rosenberg, Michael; Kumar, Sudhir (1 February 2023). "Taxon Sampling, Bioinformatics, and Phylogenetics". Evolutionary Journal of the Linnean Society. 52 (1): 119–124. doi:10.1080/10635150390132894. PMC 2796430. PMID 12554445. Retrieved 19 April 2023.
  19. ^ Harper, Douglas (2010). "Phylogeny". Online Etymology Dictionary.
  20. ^ Stuessy 2009.
  21. ^ "Early Evolution and Development: Ernst Haeckel". UC Museum of Paleontology.
  22. ^ Blechschmidt, Erich (1977) The Beginnings of Human Life. Springer-Verlag Inc., p. 32: "The so-called basic law of biogenetics is wrong. No buts or ifs can mitigate this fact. It is not even a tiny bit correct or correct in a different form, making it valid in a certain percentage. It is totally wrong."
  23. ^ Ehrlich, Paul; Richard Holm; Dennis Parnell (1963) The Process of Evolution. New York: McGraw–Hill, p. 66: "Its shortcomings have been almost universally pointed out by modern authors, but the idea still has a prominent place in biological mythology. The resemblance of early vertebrate embryos is readily explained without resort to mysterious forces compelling each individual to reclimb its phylogenetic tree."
  24. ^ Bayes, Mr; Price, Mr (1763). "An Essay towards Solving a Problem in the Doctrine of Chances. By the Late Rev. Mr. Bayes, F. R. S. Communicated by Mr. Price, in a Letter to John Canton, A. M. F. R. S". Philosophical Transactions of the Royal Society of London. 53: 370–418. doi:10.1098/rstl.1763.0053.
  25. ^ Strickberger, Monroe. 1996. Evolution, 2nd. ed. Jones & Bartlett.[page needed]
  26. ^ The Theory of Evolution, Teaching Company course, Lecture 1
  27. ^ Darwin's Tree of Life Archived 13 March 2014 at the Wayback Machine
  28. ^ Archibald, J. David (1 August 2009). "Edward Hitchcock's Pre-Darwinian (1840) "Tree of Life"". Journal of the History of Biology. 42 (3): 561–592. doi:10.1007/s10739-008-9163-y. ISSN 1573-0387. PMID 20027787. S2CID 16634677.
  29. ^ Archibald, J. David (2008). "Edward Hitchcock's Pre-Darwinian (1840) 'Tree of Life'". Journal of the History of Biology. 42 (3): 561–92. CiteSeerX 10.1.1.688.7842. doi:10.1007/s10739-008-9163-y. PMID 20027787. S2CID 16634677.
  30. ^ Darwin, Charles; Wallace, Alfred (1858). "On the Tendency of Species to form Varieties; and on the Perpetuation of Varieties and Species by Natural Means of Selection". Journal of the Proceedings of the Linnean Society of London. Zoology. 3 (9): 45–62. doi:10.1111/j.1096-3642.1858.tb02500.x.
  31. ^ Cavalier-Smith, Thomas (12 January 2010). "Deep phylogeny, ancestral groups and the four ages of life". Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences. 365 (1537): 111–132. doi:10.1098/rstb.2009.0161. ISSN 1471-2970. PMC 2842702. PMID 20008390.
  32. ^ Dollo, Louis. 1893. Les lois de l'évolution. Bull. Soc. Belge Géol. Paléont. Hydrol. 7: 164–66.
  33. ^ Galis, Frietson; Arntzen, Jan W.; Lande, Russell (2010). "Dollo's Law and the Irreversibility of Digit Loss in Bachia". Evolution. 64 (8): 2466–76, discussion 2477–85. doi:10.1111/j.1558-5646.2010.01041.x. PMID 20500218. S2CID 24520027. Retrieved 23 April 2023.
  34. ^ Tillyard, R. J (2012). "A New Classification of the Order Perlaria". The Canadian Entomologist. 53 (2): 35–43. doi:10.4039/Ent5335-2. S2CID 90171163.
  35. ^ Cuénot, Lucien (1940). "Remarques sur un essai d'arbre genealogique du regne animal". Comptes rendus Academie des sciences 0210 (in French). Paris (France): Académie des sciences (France). p. 24.
  36. ^ a b Tassy, P.; Fischer, M. S. (2021). ""Cladus" and clade: a taxonomic odyssey". Theory in Biosciences. 140 (1): 77–85. doi:10.1007/s12064-020-00326-2. ISSN 1431-7613. PMC 7583691.
  37. ^ Waddington, C. H. (1948). "Neuere Probleme der Abstammungslehre Die Transspezifische Evolution". Nature. 162 (4130): 979–980. doi:10.1038/162979a0. ISSN 1476-4687.
  38. ^ Elton, C. S. (1960). "Evolutionary Divergence". Nature. 187 (4736): 446. doi:10.1038/187446a0. ISSN 1476-4687.
  39. ^ Hennig, Willi (1950). Grundzüge einer Theorie der Phylogenetischen Systematik [Basic features of a theory of phylogenetic systematics] (in German). Berlin: Deutscher Zentralverlag. OCLC 12126814.[page needed]
  40. ^ Wagner, Warren Herbert (1952). "The fern genus Diellia: structure, affinities, and taxonomy". University of California Publications in Botany. 26 (1–6): 1–212. OCLC 4228844.
  41. ^ Huxley, Julian (1957). "The Three Types of Evolutionary Process". Nature. 180 (4584): 454–455. doi:10.1038/180454a0. ISSN 1476-4687.
  42. ^ Cain, A. J; Harrison, G. A (2009). "Phyletic Weighting". Proceedings of the Zoological Society of London. 135 (1): 1–31. doi:10.1111/j.1469-7998.1960.tb05828.x.
  43. ^ "The reconstruction of evolution" in "Abstracts of Papers". Annals of Human Genetics. 27 (1): 103–5. 1963. doi:10.1111/j.1469-1809.1963.tb00786.x.
  44. ^ Camin, Joseph H; Sokal, Robert R (1965). "A Method for Deducing Branching Sequences in Phylogeny". Evolution. 19 (3): 311–26. doi:10.1111/j.1558-5646.1965.tb01722.x. S2CID 20957422.
  45. ^ Wilson, Edward O (1965). "A Consistency Test for Phylogenies Based on Contemporaneous Species". Systematic Zoology. 14 (3): 214–20. doi:10.2307/2411550. JSTOR 2411550.
  46. ^ Hennig. W. (1966). Phylogenetic systematics. Illinois University Press, Urbana.[page needed]
  47. ^ Farris, James S (1969). "A Successive Approximations Approach to Character Weighting". Systematic Zoology. 18 (4): 374–85. doi:10.2307/2412182. JSTOR 2412182.
  48. ^ a b Kluge, A. G; Farris, J. S (1969). "Quantitative Phyletics and the Evolution of Anurans". Systematic Biology. 18 (1): 1–32. doi:10.1093/sysbio/18.1.1.
  49. ^ Quesne, Walter J. Le (1969). "A Method of Selection of Characters in Numerical Taxonomy". Systematic Zoology. 18 (2): 201–205. doi:10.2307/2412604. JSTOR 2412604.
  50. ^ Farris, J. S (1970). "Methods for Computing Wagner Trees". Systematic Biology. 19: 83–92. doi:10.1093/sysbio/19.1.83.
  51. ^ Neyman, Jerzy (1971). "Molecular studies of evolution: a source of novel statistical problems". Statistical Decision Theory and Related Topics. pp. 1–27. doi:10.1016/B978-0-12-307550-5.50005-8. ISBN 978-0-12-307550-5.
  52. ^ Fitch, W. M (1971). "Toward Defining the Course of Evolution: Minimum Change for a Specific Tree Topology". Systematic Biology. 20 (4): 406–16. doi:10.1093/sysbio/20.4.406. JSTOR 2412116.
  53. ^ Robinson, D.F (1971). "Comparison of labeled trees with valency three". Journal of Combinatorial Theory. Series B. 11 (2): 105–19. doi:10.1016/0095-8956(71)90020-7.
  54. ^ Kidd, K. K; Sgaramella-Zonta, L. A (1971). "Phylogenetic analysis: Concepts and methods". American Journal of Human Genetics. 23 (3): 235–52. PMC 1706731. PMID 5089842.
  55. ^ Adams, E. N (1972). "Consensus Techniques and the Comparison of Taxonomic Trees". Systematic Biology. 21 (4): 390–397. doi:10.1093/sysbio/21.4.390.
  56. ^ Farris, James S (1976). "Phylogenetic Classification of Fossils with Recent Species". Systematic Zoology. 25 (3): 271–282. doi:10.2307/2412495. JSTOR 2412495.
  57. ^ Farris, J. S (1977). "Phylogenetic Analysis Under Dollo's Law". Systematic Biology. 26: 77–88. doi:10.1093/sysbio/26.1.77.
  58. ^ Nelson, G (1979). "Cladistic Analysis and Synthesis: Principles and Definitions, with a Historical Note on Adanson's Familles Des Plantes (1763-1764)". Systematic Biology. 28: 1–21. doi:10.1093/sysbio/28.1.1.
  59. ^ Gordon, A. D (1979). "A Measure of the Agreement between Rankings". Biometrika. 66 (1): 7–15. doi:10.1093/biomet/66.1.7. JSTOR 2335236.
  60. ^ Efron B. (1979). Bootstrap methods: another look at the jackknife. Ann. Stat. 7: 1–26.
  61. ^ Margush, T; McMorris, F (1981). "Consensus-trees". Bulletin of Mathematical Biology. 43 (2): 239. doi:10.1016/S0092-8240(81)90019-7 (inactive 1 November 2024).{{cite journal}}: CS1 maint: DOI inactive as of November 2024 (link)
  62. ^ Sokal, Robert R; Rohlf, F. James (1981). "Taxonomic Congruence in the Leptopodomorpha Re-Examined". Systematic Zoology. 30 (3): 309. doi:10.2307/2413252. JSTOR 2413252.
  63. ^ Felsenstein, Joseph (1981). "Evolutionary trees from DNA sequences: A maximum likelihood approach". Journal of Molecular Evolution. 17 (6): 368–76. Bibcode:1981JMolE..17..368F. doi:10.1007/BF01734359. PMID 7288891. S2CID 8024924.
  64. ^ Hendy, M.D; Penny, David (1982). "Branch and bound algorithms to determine minimal evolutionary trees". Mathematical Biosciences. 59 (2): 277. doi:10.1016/0025-5564(82)90027-X.
  65. ^ Lipscomb, Diana (1985). "The Eukaryotic Kingdoms". Cladistics. 1 (2): 127–40. doi:10.1111/j.1096-0031.1985.tb00417.x. PMID 34965673. S2CID 84151309.
  66. ^ Felsenstein, J (1985). "Confidence limits on phylogenies: an approach using the bootstrap". Evolution. 39 (4): 783–791. doi:10.2307/2408678. JSTOR 2408678. PMID 28561359.
  67. ^ Lanyon, S. M (1985). "Detecting Internal Inconsistencies in Distance Data". Systematic Biology. 34 (4): 397–403. CiteSeerX 10.1.1.1000.3956. doi:10.1093/sysbio/34.4.397.
  68. ^ Saitou, N.; Nei, M. (1987). "The neighbor-joining method: A new method for reconstructing phylogenetic trees". Molecular Biology and Evolution. 4 (4): 406–25. doi:10.1093/oxfordjournals.molbev.a040454. PMID 3447015.
  69. ^ Bremer, K��re (1988). "The Limits of Amino Acid Sequence Data in Angiosperm Phylogenetic Reconstruction". Evolution. 42 (4): 795–803. doi:10.1111/j.1558-5646.1988.tb02497.x. PMID 28563878. S2CID 13647124.
  70. ^ Farris, James S (1989). "The Retention Index and the Rescaled Consistency Index". Cladistics. 5 (4): 417–419. doi:10.1111/j.1096-0031.1989.tb00573.x. PMID 34933481. S2CID 84287895.
  71. ^ Archie, James W (1989). "Homoplasy Excess Ratios: New Indices for Measuring Levels of Homoplasy in Phylogenetic Systematics and a Critique of the Consistency Index". Systematic Zoology. 38 (3): 253–269. doi:10.2307/2992286. JSTOR 2992286.
  72. ^ Bremer, Kåre (1990). "Combinable Component Consensus". Cladistics. 6 (4): 369–372. doi:10.1111/j.1096-0031.1990.tb00551.x. PMID 34933485. S2CID 84151348.
  73. ^ D. L. Swofford and G. J. Olsen. 1990. Phylogeny reconstruction. In D. M. Hillis and G. Moritz (eds.), Molecular Systematics, pages 411–501. Sinauer Associates, Sunderland, Mass.
  74. ^ Goloboff, Pablo A (1991). "Homoplasy and the Choice Among Cladograms". Cladistics. 7 (3): 215–232. doi:10.1111/j.1096-0031.1991.tb00035.x. PMID 34933469. S2CID 85418697.
  75. ^ Goloboff, Pablo A (1991). "Random Data, Homoplasy and Information". Cladistics. 7 (4): 395–406. doi:10.1111/j.1096-0031.1991.tb00046.x. S2CID 85132346.
  76. ^ Goloboff, Pablo A (1993). "Estimating Character Weights During Tree Search". Cladistics. 9 (1): 83–91. doi:10.1111/j.1096-0031.1993.tb00209.x. PMID 34929936. S2CID 84231334.
  77. ^ Wilkinson, M (1994). "Common Cladistic Information and its Consensus Representation: Reduced Adams and Reduced Cladistic Consensus Trees and Profiles". Systematic Biology. 43 (3): 343–368. doi:10.1093/sysbio/43.3.343.
  78. ^ Wilkinson, Mark (1995). "More on Reduced Consensus Methods". Systematic Biology. 44 (3): 435–439. doi:10.2307/2413604. JSTOR 2413604.
  79. ^ Li, Shuying; Pearl, Dennis K; Doss, Hani (2000). "Phylogenetic Tree Construction Using Markov Chain Monte Carlo". Journal of the American Statistical Association. 95 (450): 493. CiteSeerX 10.1.1.40.4461. doi:10.1080/01621459.2000.10474227. JSTOR 2669394. S2CID 122459537.
  80. ^ Mau, Bob; Newton, Michael A; Larget, Bret (1999). "Bayesian Phylogenetic Inference via Markov Chain Monte Carlo Methods". Biometrics. 55 (1): 1–12. CiteSeerX 10.1.1.139.498. doi:10.1111/j.0006-341X.1999.00001.x. JSTOR 2533889. PMID 11318142. S2CID 932887.
  81. ^ Rannala, Bruce; Yang, Ziheng (1996). "Probability distribution of molecular evolutionary trees: A new method of phylogenetic inference". Journal of Molecular Evolution. 43 (3): 304–11. Bibcode:1996JMolE..43..304R. doi:10.1007/BF02338839. PMID 8703097. S2CID 8269826.
  82. ^ Goloboff, P (2003). "Improvements to resampling measures of group support". Cladistics. 19 (4): 324–32. doi:10.1111/j.1096-0031.2003.tb00376.x. hdl:11336/101057. S2CID 55516104.
  83. ^ Li, M.; Chen, X.; Li, X.; Ma, B.; Vitanyi, P.M.B. (December 2004). "The Similarity Metric". IEEE Transactions on Information Theory. 50 (12): 3250–3264. doi:10.1109/TIT.2004.838101. S2CID 221927.
  84. ^ Cilibrasi, R.; Vitanyi, P.M.B. (April 2005). "Clustering by Compression". IEEE Transactions on Information Theory. 51 (4): 1523–1545. arXiv:cs/0312044. doi:10.1109/TIT.2005.844059. S2CID 911.
  85. ^ Alam, M. Masidur; Naeem, M.; Khan, M. Masroor A.; Uddin, Moin (2017), Naeem, M.; Aftab, Tariq; Khan, M. Masroor A. (eds.), "Vincristine and Vinblastine Anticancer Catharanthus Alkaloids: Pharmacological Applications and Strategies for Yield Improvement", Catharanthus roseus: Current Research and Future Prospects, Cham: Springer International Publishing, pp. 277–307, doi:10.1007/978-3-319-51620-2_11, ISBN 978-3-319-51620-2, retrieved 23 July 2024
  86. ^ a b Colijn, Caroline; Gardy, Jennifer (1 January 2014). "Phylogenetic tree shapes resolve disease transmission patterns". Evolution, Medicine, and Public Health. 2014 (1): 96–108. doi:10.1093/emph/eou018. ISSN 2050-6201. PMC 4097963. PMID 24916411.
  87. ^ Pagel, Mark (2017). "Darwinian perspectives on the evolution of human languages". Psychonomic Bulletin & Review. 24 (1): 151–157. doi:10.3758/s13423-016-1072-z. ISSN 1069-9384. PMC 5325856. PMID 27368626.
  88. ^ Heggarty, Paul (2006). "Interdisciplinary Indiscipline? Can Phylogenetic Methods Meaningfully Be Applied to Language Data — and to Dating Language?" (PDF). In Peter Forster; Colin Renfrew (eds.). Phylogenetic Methods and the Prehistory of Languages. McDonald Institute Monographs. McDonald Institute for Archaeological Research. Archived from the original (PDF) on 28 January 2021. Retrieved 19 January 2021.
  89. ^ a b Bowern, Claire (14 January 2018). "Computational Phylogenetics". Annual Review of Linguistics. 4 (1): 281–296. doi:10.1146/annurev-linguistics-011516-034142. ISSN 2333-9683.
  90. ^ Retzlaff, Nancy; Stadler, Peter F. (2018). "Phylogenetics beyond biology". Theory in Biosciences. 137 (2): 133–143. doi:10.1007/s12064-018-0264-7. ISSN 1431-7613. PMC 6208858. PMID 29931521.
  91. ^ Hoffmann, Konstantin; Bouckaert, Remco; Greenhill, Simon J; Kühnert, Denise (25 November 2021). "Bayesian phylogenetic analysis of linguistic data using BEAST". Journal of Language Evolution. 6 (2): 119–135. doi:10.1093/jole/lzab005. hdl:1885/311145. ISSN 2058-458X.
  92. ^ Spencer, Matthew; Davidson, Elizabeth A; Barbrook, Adrian C; Howe, Christopher J (21 April 2004). "Phylogenetics of artificial manuscripts". Journal of Theoretical Biology. 227 (4): 503–511. Bibcode:2004JThBi.227..503S. doi:10.1016/j.jtbi.2003.11.022. ISSN 0022-5193. PMID 15038985.
  93. ^ Lycett, Stephen J. (14 October 2009). "Understanding Ancient Hominin Dispersals Using Artefactual Data: A Phylogeographic Analysis of Acheulean Handaxes". PLOS ONE. 4 (10): e7404. Bibcode:2009PLoSO...4.7404L. doi:10.1371/journal.pone.0007404. PMC 2756619. PMID 19826473.
  94. ^ Tripp, Allison (2016). "A Cladistics Analysis Exploring Regional Patterning of the Anthropomorphic Figurines from the Gravettian". Cultural Phylogenetics. Interdisciplinary Evolution Research. Vol. 4. pp. 179–202. doi:10.1007/978-3-319-25928-4_8. ISBN 978-3-319-25926-0. {{cite book}}: |journal= ignored (help)
  95. ^ Marwick, Ben; Matzig, David; Riede, Felix. "Bayesian inference of material culture phylogenies using continuous traits: A birth–death model for Late Neolithic/Early Bronze Age arrowheads from Northwestern Europe". Osf.io. doi:10.31235/osf.io/j2kva.
  96. ^ Manem, Sébastien (1 December 2020). "Modeling the Evolution of Ceramic Traditions Through a Phylogenetic Analysis of the Chaînes Opératoires: the European Bronze Age as a Case Study". Journal of Archaeological Method and Theory. 27 (4): 992–1039. doi:10.1007/s10816-019-09434-w.
  97. ^ O'Brien, Michael J.; Lyman, R. Lee (1 March 2000). "Darwinian Evolutionism Is Applicable to Historical Archaeology". International Journal of Historical Archaeology. 4 (1): 71–112. doi:10.1023/A:1009556427520.
  98. ^ Matzig, David N.; Marwick, Ben; Riede, Felix; Warnock, Rachel C. M. (August 2024). "A macroevolutionary analysis of European Late Upper Palaeolithic stone tool shape using a Bayesian phylodynamic framework". Royal Society Open Science. 11 (8). Bibcode:2024RSOS...1140321M. doi:10.1098/rsos.240321. PMC 11321859. PMID 39144489.

Bibliography

edit
edit