Below are the first 10 and last 10 pages of uncorrected machine-read text (when available) of this chapter, followed by the top 30 algorithmically extracted key phrases from the chapter as a whole.
Intended to provide our own search engines and external engines with highly rich, chapter-representative searchable text on the opening pages of each chapter.
Because it is UNCORRECTED material, please consider the following text as a useful but insufficient proxy for the authoritative book pages.
Do not use for reproduction, copying, pasting, or reading; exclusively for search engines.
OCR for page 214
OCR for page 215
FREDERICK JELINEK
1932–2010
elected in 2006
“For contributions to statistical language processing with
applications to automatic speech recognition.”
By BiiNg-HWaNg (fred) JUaNg aNd laWreNce raBiNer
f REDERICK JELINEK, an information-theoretic linguist,
died september 14, 2010, of a heart attack, at the age of 77.
Jelinek was a czech american researcher in information theory
and natural language processing. Born just before the war,
on November 18, 1932, in czechoslovakia to a Jewish father
(who was a dentist and died of disease in the Theresienstadt
concentration camp in 1945) and a catholic mother (who
converted to Judaism), fred Jelinek and his family managed
to escape being sent to the concentration camps, due to his
mother’s background, and to eventually emigrate to the United
states in the early years of the communist regime.
Jelinek studied the newly developing field of information
theory at the Massachusetts institute of Technology under
Professor robert fano. Jelinek had begun to develop an
interest in linguistics after the immigration of his wife, who
initially enrolled in the linguistics program at MiT; he often
accompanied her to linguistics classes. after his graduate
degree, he joined cornell University in anticipation of the
opportunity to work with charles f. Hockett, a prominent
linguist there. although the intended research program in
linguistics fell through, Jelinek went on to teach and develop
information theory at cornell in the subsequent years before
joining IBM Research in 1972. Jelinek’s affiliation with Cornell
was not a long one, but he deserved credit for the creation and
215
OCR for page 216
216 MeMorial TriBUTes
recognized prominence of cornell’s program in information
theory.
interestingly, at iBM, Jelinek got the opportunity to pursue
research in the two fields of most interest to him—linguistics
and information theory—through the challenge of designing an
automatic typewriter that would respond to the human voice.
He saw speech recognition as an information theory problem,
rather than the traditional view that speech recognition could
(or to some, should) be solved using fundamental principles in
acoustics and/or linguistics (or more likely a combination of
the two technologies). as an admirer of claude shannon, the
“Father of Information Theory,” Jelinek’s approach followed
the fundamental teaching of the “Shannon game,” in which
a person competes with a computer, which keeps an array of
conditional probabilities that define the odds of a given word
following a specified sequence of words, in completing a
partial sentence.
information theorists believe that a machine that chooses the
most probable word to succeed the three (or more) preceding
(already seen) words would be more statistically likely than
the guess made by a human, thereby winning the game. The
machine need not actually understand or guess the meaning
the sentence really intends to express. The computer-selected
word is thus based on the so-called n-gram statistical grammar
commonly used in speech recognition systems today. With
such types of grammar as the discrete source models, Jelinek
treated the speech recognition problem, or rather the natural
language processing problem, as a noisy-channel discrete
decoding problem and advocated the method of maximum
likelihood decoding. This statistical approach to linguistics,
built on the framework of information theory, stayed in
the course of the axiomatic probability theory, avoided the
mathematically intractable problem of human intelligence,
but clashed with the conventional discipline of linguistics.
Jelinek was well known for his wry humor, most especially
this oft-quoted quip: “Every time I fire a linguist, my payroll
goes down and the performance of my speech recognizer goes
up.” Jelinek’s engineering results, as manifested in systems
OCR for page 217
217
FREDERICK JELINEK
such as iBM’s Tangora, eventually won the respect of many
practically minded linguists or linguistic engineers and
earned him many distinctions, including induction into the
National academy of engineering. Jelinek can be considered
a pioneering figure in the field of information-theoretic
linguistics and its most prominent advocate.
in 1993 Jelinek retired from iBM and joined John Hopkins
University as director of the center for language and speech
Processing and Julian sinclair smith Professor of electrical
and computer engineering. soon after joining Johns Hopkins,
Jelinek started to organize the annual summer workshop on
spoken-language research, which has benefitted many young-
generation research engineers and students, even to this date.
Jelinek won numerous distinctions. He received the signal
Processing society award of the institute of electrical and
Electronics Engineers in 1997, the ESCA Medal for Scientific
achievement in 1999, the european language resource
Association’s first Antonio Zampolli Prize in 2004, the James
l. flanagan speech and audio Processing award in 2005, and
the 2009 lifetime achievement award from the association
for computational linguistics. He received a honoris causa
Ph.d. from charles University in 2001 and was inducted into
the National academy of engineering in 2006.
fred Jelinek is survived by his wife, czech screenwriter
Milena Jelinek; a daughter and son; a sister and stepsister; and
three grandchildren.