National Academies Press: OpenBook
« Previous: RNA EVOLUTION REVISITED
Suggested Citation:"REFERENCES." National Research Council. 1995. Calculating the Secrets of Life: Contributions of the Mathematical Sciences to Molecular Biology. Washington, DC: The National Academies Press. doi: 10.17226/2121.
×
Page 112
Suggested Citation:"REFERENCES." National Research Council. 1995. Calculating the Secrets of Life: Contributions of the Mathematical Sciences to Molecular Biology. Washington, DC: The National Academies Press. doi: 10.17226/2121.
×
Page 113

Below is the uncorrected machine-read text of this chapter, intended to provide our own search engines and external engines with highly rich, chapter-representative searchable text of each book. Because it is UNCORRECTED material, please consider the following text as a useful but insufficient proxy for the authoritative book pages.

HEARING DISTANT ECHOES: USING EXTREMAL STATISTICS TO PROBE EVOLUTIONARY ORIGINS 112 while the standard deviation is estimated to be = 1.49. In contrast to Table 4.1, there is one tRNA, that for cystine, that scores exceptionally high. The tails of the extremal distributions in the logarithmic region probably behave like exp(−λt), where t is the test value as in the section "Local Sequence Comparisons" and λ is a constant, but this has not yet been proven rigorously. The usual intuition informed by the tail of a normal distribution has the probabilities behaving like exp(−t2)/2, which converges to 0 much more rapidly than the Poisson or exponential tails. Thus except for the cystine score, the remaining scores look very much like scores from random sequences. Simulations were performed, and the score 21.0 has an approximate p-value of 10−3, so that it is not possible to dismiss this matching for statistical reasons alone. As far as we know, no one has offered a biological explanation of this interesting match. As to the hypothesis of Bloch et al. (1983), while their work concluded that "matches are too frequent and extensive to be attributed to coincidence," it is not supported by the data but is instead the result of incorrect estimation of p-values. This data set received their most extensive analysis, and they concluded that over 30 percent of the matchings between E. coli 16S rRNA and tRNAs were significant at the level α = 0.10. Correct estimates show about 10 percent of the matching at the level α = 0.10. While the origin of life may be hiding in tRNA and 16S rRNA, it remains elusive. REFERENCES Aldous, D.J., 1989, Probability Approximations via the Poisson Clumping Heuristic, New York: Springer-Verlag. Arratia, R.A., L. Goldstein, and L. Gordon, 1989, "Two moments suffice for Poisson approximation: The Chen-Stein method," Annals of Probability 17, 9-25. Arratia, R.A., L. Goldstein, and L. Gordon, 1990, "Poisson approximation and the Chen-Stein method," Statistical Science 5, 403-434. Arratia, R.A., and M.S. Waterman, 1994, "A phase transition for the score in matching random sequences allowing deletions," Annals of Applied Probability 4, 200-225. Bloch, D.P., B. McArthur, R. Widdowson, D. Spector, R.C. Guimaraes, and J. Smith, 1983, "tRNA-rRNA sequence homologies: Evidence for a common evolutionary origin?," Journal of Molecular Evolution 19, 420-428. Chung, K.L., 1974, A Course in Probability Theory, 2nd ed, San Diego, CA: Academic Press.

HEARING DISTANT ECHOES: USING EXTREMAL STATISTICS TO PROBE EVOLUTIONARY ORIGINS 113 Chvátal, V, and D. Sankoff, 1975, "Longest common subsequences of two random sequences," Journal of Applied Probability 12, 306-315. Deken, J., 1979, "Some limit results for longest common subsequences," Discrete Mathematics 26, 17-31. Erdös, P., and A. Rényi, 1970, "On a new law of large numbers," Journal d' Analyse Mathematique 22, 103-111. Reprinted in 1976 in Selected Papers of Alfred Rényi, Vol. 3, 1962-1970s, Budapest: Akadémiai Kiadó. Feller, W., 1968, An Introduction to Probability Theory and Its Applications, Vol. 1, 3rd ed., New York: Wiley and Sons. Goldstein, L., 1990, "Poisson approximation and DNA sequence matching," Communications in Statistics. Part A: Theory and Methods 19, 4167-4179. Kingman, J.F.C., 1973, "Subadditive ergodic theory," Annals of Probability 1, 883-909. Smith, T.F., and M.S. Waterman, 1981, "Identification of common molecular subsequences," Journal of Molecular Biology 147, 195-197. Steele, J.M., 1986, "An Efron-Stein inequality for nonsymmetric statistics," Annals of Statistics 14, 753-758. Waterman, M.S., and M. Eggert, 1987, "A new algorithm for best subsequence alignments with application to tRNA-rRNA comparisons," Journal of Molecular Biology 197, 723-728. Waterman, M.S., L. Gordon, and R. Arratia, 1987, "Phase transitions in sequence matches and nucleic acid structure," Proceedings of the National Academy of Sciences USA 84, 1239-1243. Waterman, M.S., and M. Vingron, 1994, "Rapid and accurate estimates of statistical significance for data base searches," Proceedings of the National Academy of Sciences USA 91, 4625-4628.

Next: Chapter 5 Calibrating the Clock: Using Stochastic Processes to Measure the Rate of Evolution »
Calculating the Secrets of Life: Contributions of the Mathematical Sciences to Molecular Biology Get This Book
×
 Calculating the Secrets of Life: Contributions of the Mathematical Sciences to Molecular Biology
Buy Paperback | $80.00
MyNAP members save 10% online.
Login or Register to save!
Download Free PDF

As researchers have pursued biology's secrets to the molecular level, mathematical and computer sciences have played an increasingly important role—in genome mapping, population genetics, and even the controversial search for "Eve," hypothetical mother of the human race.

In this first-ever survey of the partnership between the two fields, leading experts look at how mathematical research and methods have made possible important discoveries in biology.

The volume explores how differential geometry, topology, and differential mechanics have allowed researchers to "wind" and "unwind" DNA's double helix to understand the phenomenon of supercoiling. It explains how mathematical tools are revealing the workings of enzymes and proteins. And it describes how mathematicians are detecting echoes from the origin of life by applying stochastic and statistical theory to the study of DNA sequences.

This informative and motivational book will be of interest to researchers, research administrators, and educators and students in mathematics, computer sciences, and biology.

READ FREE ONLINE

  1. ×

    Welcome to OpenBook!

    You're looking at OpenBook, NAP.edu's online reading room since 1999. Based on feedback from you, our users, we've made some improvements that make it easier than ever to read thousands of publications on our website.

    Do you want to take a quick tour of the OpenBook's features?

    No Thanks Take a Tour »
  2. ×

    Show this book's table of contents, where you can jump to any chapter by name.

    « Back Next »
  3. ×

    ...or use these buttons to go back to the previous chapter or skip to the next one.

    « Back Next »
  4. ×

    Jump up to the previous page or down to the next one. Also, you can type in a page number and press Enter to go directly to that page in the book.

    « Back Next »
  5. ×

    To search the entire text of this book, type in your search term here and press Enter.

    « Back Next »
  6. ×

    Share a link to this book page on your preferred social network or via email.

    « Back Next »
  7. ×

    View our suggested citation for this chapter.

    « Back Next »
  8. ×

    Ready to take your reading offline? Click here to buy this book in print or download it as a free PDF, if available.

    « Back Next »
Stay Connected!