National Academies Press: OpenBook
« Previous: Approximations for the Ewens Sampling Formula
Suggested Citation:"Combinatorial Assemblies." National Research Council. 1995. Calculating the Secrets of Life: Contributions of the Mathematical Sciences to Molecular Biology. Washington, DC: The National Academies Press. doi: 10.17226/2121.
×
Page 139
Suggested Citation:"Combinatorial Assemblies." National Research Council. 1995. Calculating the Secrets of Life: Contributions of the Mathematical Sciences to Molecular Biology. Washington, DC: The National Academies Press. doi: 10.17226/2121.
×
Page 140
Suggested Citation:"Combinatorial Assemblies." National Research Council. 1995. Calculating the Secrets of Life: Contributions of the Mathematical Sciences to Molecular Biology. Washington, DC: The National Academies Press. doi: 10.17226/2121.
×
Page 141

Below is the uncorrected machine-read text of this chapter, intended to provide our own search engines and external engines with highly rich, chapter-representative searchable text of each book. Because it is UNCORRECTED material, please consider the following text as a useful but insufficient proxy for the authoritative book pages.

CALIBRATING THE CLOCK: USING STOCHASTIC PROCESSES TO MEASURE THE RATE OF EVOLUTION 139 (5.27) so that indeed Cb and Zb can be coupled closely if (and, it turns out, only if) b = o(n). Combinatorial Assemblies The spirit of the approximations in the preceding subsection—replacing a dependent process with an independent one—carries over to other combinatorial structures. The first of these is the class of assemblies. Assemblies are labeled structures built as follows. The set {1,2,. . .,n} is partitioned into ak subsets of size k, for k = 1,2,. . .,n, and each subset of size k is marked as one of mk indecomposable components of size k. For example, in the case of permutations, mk = (k − 1)!, and the components of size k are the cycles on k elements. The number of structures N(a) of weight n having ai components of size i, i = 1,2,. . .,n, is therefore given by (5.28) and the total numberp(n) of structures of weight n is given by (5.29) A random structure of weight n is obtained by choosing one of the p(n) possibilities with equal probability. If Cj − Cj(n) denotes the number of components of size j, then (5.30)

CALIBRATING THE CLOCK: USING STOCHASTIC PROCESSES TO MEASURE THE RATE OF EVOLUTION 140 In the case of permutations, this reduces to (5.22), because then mj / j! = 1 / j. Note that for any x > 0, the probability above is proportional to so that by comparison with (5.22) we see that (C1, C2,. . ., Cn) = (Zl,Z2,. . .,Zn| T = n), where the Zi are independent Poisson random variables with means In particular this implies that where Rb = ∑1≤i≤b iZi. This observation reduces the calculation of a total variation distance between two processes to the calculation of a total variation distance between two random variables. We focus our attention on the class of assemblies that satisfies the logarithmic condition (5.31) for some κ,y > 0. Among these are random permutations (for which (5.31) holds identically in i with κ = y = 1), and random mappings of [n] to itself, for which . The study of random mappings has a long and venerable history in the combinatorics literature and is reviewed in Mutafciev (1984), Kolchin (1986), and Flajolet and Odlyzko (1990), for example.

CALIBRATING THE CLOCK: USING STOCHASTIC PROCESSES TO MEASURE THE RATE OF EVOLUTION 141 For the logarithmic class we may choose x = y−1, and then it is known (under an additional mild rate of convergence assumption in (5.31)) that (5.32) just as for the ESF. But more detailed information is available. For example, Arratia et al. (1994b) show that for fixed b, (5.33) The term |κ − 1| reflects the similarity of the structure to an ESF with parameter κ, whereas the term E[|Rb − E [Rb]|] reflects the local behavior of the structure. The θ-biased structures, those with probability proportional to q to the number of components, can also be studied in this way. In particular (5.30) holds, the Poisson-distributed Z, now having mean The accuracy of the approximation of Cb by Zb for the logarithmic class is still measured by (5.32) and (5.33), with κ replaced by θκ. A rather weak consequence of the bounds typified by (5.32) and (5.33) is the fact that for each fixed b, (C1 (n),C2(n),. . .,Cb(n)) (Z1 ,Z2,. . .,Zb), meaning that the component counting process C converges in distribution (in R∞ ) to the independent process Z. For each n, we are comparing the combinatorial process to a single limiting process. This recovers the classical result of Goncharov (1944) showing that the cycle counts of a random permutation are asymptotically independent Poisson random variables with means 1/i. The analog for random mappings is due to Kolchin (1976).

Next: Other Combinatorial Structures »
Calculating the Secrets of Life: Contributions of the Mathematical Sciences to Molecular Biology Get This Book
×
 Calculating the Secrets of Life: Contributions of the Mathematical Sciences to Molecular Biology
Buy Paperback | $80.00
MyNAP members save 10% online.
Login or Register to save!
Download Free PDF

As researchers have pursued biology's secrets to the molecular level, mathematical and computer sciences have played an increasingly important role—in genome mapping, population genetics, and even the controversial search for "Eve," hypothetical mother of the human race.

In this first-ever survey of the partnership between the two fields, leading experts look at how mathematical research and methods have made possible important discoveries in biology.

The volume explores how differential geometry, topology, and differential mechanics have allowed researchers to "wind" and "unwind" DNA's double helix to understand the phenomenon of supercoiling. It explains how mathematical tools are revealing the workings of enzymes and proteins. And it describes how mathematicians are detecting echoes from the origin of life by applying stochastic and statistical theory to the study of DNA sequences.

This informative and motivational book will be of interest to researchers, research administrators, and educators and students in mathematics, computer sciences, and biology.

READ FREE ONLINE

  1. ×

    Welcome to OpenBook!

    You're looking at OpenBook, NAP.edu's online reading room since 1999. Based on feedback from you, our users, we've made some improvements that make it easier than ever to read thousands of publications on our website.

    Do you want to take a quick tour of the OpenBook's features?

    No Thanks Take a Tour »
  2. ×

    Show this book's table of contents, where you can jump to any chapter by name.

    « Back Next »
  3. ×

    ...or use these buttons to go back to the previous chapter or skip to the next one.

    « Back Next »
  4. ×

    Jump up to the previous page or down to the next one. Also, you can type in a page number and press Enter to go directly to that page in the book.

    « Back Next »
  5. ×

    To search the entire text of this book, type in your search term here and press Enter.

    « Back Next »
  6. ×

    Share a link to this book page on your preferred social network or via email.

    « Back Next »
  7. ×

    View our suggested citation for this chapter.

    « Back Next »
  8. ×

    Ready to take your reading offline? Click here to buy this book in print or download it as a free PDF, if available.

    « Back Next »
Stay Connected!