Read "Calculating the Secrets of Life: Contributions of the Mathematical Sciences to Molecular Biology" at NAP.edu

« Previous: FINDING GLOBAL SIMILARITIES

Page 59 Cite

Suggested Citation:"Visualizing Alignments: Edit Graphs." National Research Council. 1995. Calculating the Secrets of Life: Contributions of the Mathematical Sciences to Molecular Biology. Washington, DC: The National Academies Press. doi: 10.17226/2121.

Below is the uncorrected machine-read text of this chapter, intended to provide our own search engines and external engines with highly rich, chapter-representative searchable text of each book. Because it is UNCORRECTED material, please consider the following text as a useful but insufficient proxy for the authoritative book pages.

SEEING CONSERVED SIGNALS: USING ALGORITHMS TO DETECT SIMILARITIES BETWEEN BIOSEQUENCES 59 The unit-cost scoring scheme of Figure 3.1 is not the only possible scheme. Later in this chapter, we will see a much more complex scoring scheme used in the comparison of proteins (20-letter alphabet). In that scheme and other scoring schemes, the scores in the table are real numbers assigned on the basis of various interpretations of empirical evidence. Let us introduce here a formal framework to assist our thinking. Figure 3.1 Unit-cost scoring scheme. Consider comparing sequence A = Î±1Î±2Â·Â·Â·Î±M and sequence B = b1b2Â·Â·Â· bN, whose symbols range over some alphabet Ï, for example, Ï = {A,C,G,T} for DNA sequences. Let Î´ (a,b) be the score for aligning a with b, let Î´ (a,â) be the score of leaving symbol a unaligned in sequence A, and let Î´(â,b) be the score of leaving b unaligned in B. Here a and b range over the symbols in Ï and the gap symbol "â". The score of an alignment is simply the sum of the scores d assigns to each pair of aligned symbols, for example, the score of is Î´(A,A) + Î´ (T,â) + Î´ (T,T) + Î´ (A,A) + Î´ (â,T) + Î´ (C,C) + Î´ (G,G), which for the scoring scheme of Figure 3.1 equals 5. An optimal alignment under a given scoring scheme is an alignment that yields the highest sum. Visualizing Alignments: Edit Graphs Many investigators have found it illuminating to convert the problem of finding similarities into one of finding certain paths in an edit graph.

Next: The Basic Dynamic Programming Algorithm »

Welcome to OpenBook!

You're looking at OpenBook, NAP.edu's online reading room since 1999. Based on feedback from you, our users, we've made some improvements that make it easier than ever to read thousands of publications on our website.

Do you want to take a quick tour of the OpenBook's features?

No Thanks

Take a Tour »

Calculating the Secrets of Life: Contributions of the Mathematical Sciences to Molecular Biology (1995)

Chapter: Visualizing Alignments: Edit Graphs

Welcome to OpenBook!

Get Email Updates