×

Probability distributions of ancestries and genealogical distances on stochastically generated rooted binary trees. (English) Zbl 1397.92508

J. Theor. Biol. 280, 139-145 (2011); corrigendum ibid. 314, 216-217 (2012).
Summary: The stationary birth-only, or Yule-Furry, process for rooted binary trees has been analysed with a view to developing explicit expressions for two fundamental statistical distributions: the probability that a randomly selected leaf is preceded by \(N\) nodes, or “ancestors”, and the probability that two randomly selected leaves are separated by \(N\) nodes. For continuous-time Yule processes, the first of these distributions is presented in closed analytical form as a function of time, with time being measured with respect to the moment of “birth” of the common ancestor (which is essentially inaccessible to phylogenetic analysis), or with respect to the instant at which the first bifurcation occurred.
The second distribution is shown to follow in an iterative manner from a hierarchy of second-order ordinary differential equations.
For Yule trees of a given number \(n\) of tips, expressions have been derived for the mean and variance for each of these distributions as functions of \(n\), as well as for the distributions themselves.
In addition, it is shown how the methods developed to obtain these distributions can be employed to find, with minor effort, expressions for the expectation values of two statistics on Yule trees, the Sackin index (sum over all root-to-leaf distances), and the sum over all leaf-to-leaf distances.

MSC:

92D15 Problems related to evolution
05C90 Applications of graph theory
05C05 Trees
62P10 Applications of statistics to biology and medical sciences; meta analysis
Full Text: DOI

References:

[1] Abramowitz, M.; Stegun., I., Handbook of mathematical functions and formulas, graphs, and mathematical tables, (1972), Dover New York, p. 824 · Zbl 0543.33001
[2] Aldous, D., Stochastic models and descriptive statistics for phylogenetic trees, from Yule to today, Stat. sci., 16, 23-34, (2001) · Zbl 1127.60313
[3] Bailey, N.T.J., The elements of stochastic processes with applications to the natural sciences, (1964), Wiley New York · Zbl 0127.11203
[4] Beichelt, F.E.; Fatti, L.P., Stochastic processes and their applications, (2002), CRC Press · Zbl 1064.60001
[5] Blum, M.G.B.; François, O.; Janson, S., The Mean, variance and limiting distribution of two statistics sensitive to phylogenetic tree balance, Ann. appl. prob., 16, 2195-2214, (2006) · Zbl 1124.05025
[6] Felsenstein, J., Inferring phylogenies, (2004), Sinauer Associates Sunderland, MA
[7] Gernhard, T., The conditioned reconstructed process, J. theoret. biol., 253, 769-778, (2008) · Zbl 1398.92171
[8] Hwang, H.-K., Asymptotic expansions for the Stirling numbers of the first kind, J. combinatorial theory A, 71, 343-351, (1995) · Zbl 0833.05005
[9] Kirkpatrick, M.; Slatkin, M., Searching for evolutionary patterns in the shape of a phylogenetic tree, Evolution, 47, 1171-1181, (1993)
[10] Lynch, W.C., More combinatorial properties of certain trees, Comput. J., 71, 299-302, (1965) · Zbl 0136.38801
[11] McKenzie, A.; Steel, M., Distributions of cherries for two models of trees, Math. biosci., 164, 81-92, (2000) · Zbl 0947.92021
[12] Sackin, M.J., “good“ and “bad” phenograms, Syst. zool., 21, 225-226, (1972)
[13] Semple, C.; Steel, M., Phylogenetics, (2003), Oxford University Press · Zbl 1043.92026
[14] Steel, M.; McKenzie, A., Properties of phylogenetic trees generated by Yule-type speciation models, Math. biosci., 170, 91-112, (2001) · Zbl 0977.92017
[15] Steel, M.; Mooers, A., Expected length of pendant and interior edges of a Yule tree, Appl. math. lett., 23, 1315-1319, (2010) · Zbl 1195.92048
[16] Venditti, C.; Meade, A.; Pagel, M., Phylogenies reveal new interpretation of speciation and the red queen, Nature, 463, 349-352, (2010)
[17] Yule, G.U., A mathematical theory of evolution, based on the conclusions of Dr. J.C. willis, FRS, Phil. trans. R. soc. London B, 213, 21-87, (1924)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.