National Academy of Sciences | 150 Year Anniversary

Questions? Call 800-624-6242

| Items in cart [0]

The National Academies Press

HARDBACK
price:$89.95
add to cart

Rights & Permissions

topleft topright

Voice Communication Between Humans and Machines (1994)
National Academy of Sciences (NAS)

Citation Manager

. "Voice Communication Between Humans and Machines--An Introduction." Voice Communication Between Humans and Machines. Washington, DC: The National Academies Press, 1994.

Please select a format:

BibTeX EndNote RefMan


Page
12
bottomleft bottomright

The following HTML text is provided to enhance online readability. Many aspects of typography translate only awkwardly to HTML. Please use the page image as the authoritative form to ensure accuracy.


Page 12

overview of current speech recognition techniques by John Makhoul (BBN) and a talk on training and search methods by Fred Jelinek (IBM). Makhoul's talk emphasized the rate of progress in improving performance (as measured in terms of word accuracy) in continuous speech recognition over the past several years and the factors that led to these performance improvements. Jelinek concentrated on the mathematical procedures used to train and decode speech recognizers.

The final session of the first day, chaired by Lynette Hirschman (Massachusetts Institute of Technology), dealt with natural language understanding. Madeleine Bates (BBN) discussed models of natural language understanding and reviewed our current understanding in the areas of syntax, semantics, pragmatics, and discourse. Bob Moore (SRI) discussed the way in which speech can be integrated with natural language as the basis for a speech understanding system.

The two morning sessions on the second day were devoted to applications of the technology and were chaired by Chris Seelbach (Seelbach Associates) and John Oberteuffer (ASR News). Excellent overviews of key applications in the areas of telecommunications (Jay Wilpon, AT&T Bell Laboratories), aids for the handicapped (Harry Levitt, CUNY), the military (Cliff Weinstein, MIT Lincoln Laboratory), and consumer electronics (George Doddington, SISTO/DARPA) were given and stimulated lively discussion.

The next session, chaired by David Roe (AT&T Bell Laboratories), concentrated on technical and human requirements for successful technology deployment. Ryohei Nakatsu (NTT) discussed the hardware/ software issues, and Candace Kamm (Bellcore) discussed the user interface issues that needed to be addressed and understood.

The final session, titled "Technology 2001," was chaired by Frank Fallside (University of Cambridge) and consisted of three views of where the technology is headed over the next decade and how each speaker thought it would get there. Bishnu Atal (AT&T Bell Laboratories) looked at fundamentally new research directions. Sadaoki Furui (NTT) predicted the directions of research in synthesis and recognition systems. Finally, Mitch Marcus (University of Pennsylvania) discussed new developments in statistical modeling of semantic concepts.

A highlight of the colloquium was an after-dinner talk by Yasuo Kato (NEC) on the future of voice processing technology in the world of computers and communications. Kato, who has contributed to the field for close to 40 years, looked back at how far we have come and gave glimpses of how far we might go in the next decade.

Page
12
Front Matter (R1-R10)
Dedication (1-4)
Voice Communication Between Humans and Machines--An Introduction (5-12)
Scientific Bases of Human-Machine Communication by Voice (13-14)
Scientific Bases of Human-Machine Communication by Voice (15-33)
The Role of Voice in Human-Machine Communication (34-75)
Speech Communication -- An Overview (76-104)
Speech Synthesis Technology (105-106)
Computer Speech Synthesis: Its Status and Prospects (107-115)
Models of Speech Synthesis (116-134)
Linguistic Aspects of Speech Synthesis (135-156)
Speech Recognition Technology (157-158)
Speech Recognition Technology: A Critique (159-164)
State of the Art in Continuous Speech Recognition (165-198)
Training and Search Methods for Speech Recognition (199-214)
Natural Language Understanding Technology (215-216)
The Roles of Language Processing in a Spoken Language Interface (217-237)
Models of Natural Language Understanding (238-253)
Integration of Speech with Natural Language Understanding (254-272)
Applications of Voice-Processing Technology I (273-274)
A Perspective on Early Commercial Applications of Voice-Processing Technology for Telecommunications and Aids for the Handicapped (275-279)
Applications of Voice-Processing Technology in Telecommunications (280-310)
Speech Processing for Physical and Sensory Disabilities (311-344)
Applications of Voice-Processing Technology II (345-346)
Commercial Applications of Speech Interface Technology: An Industry at the Threshold (347-356)
Military and Government Applications of Human-Machine Communication by Voice (357-370)
Technology Deployment (371-372)
Deployment of Human-Machine Dialogue Systems (373-389)
What Does Voice-Processing Technology Support Today? (390-421)
User Interfaces for Voice Applications (422-442)
Technology in 2001 (443-444)
Speech Technology in the Year 2001 (445-449)
Toward the Ultimate Synthesis/Recognition System (450-466)
Speech Technology in 2001: New Research Directions (467-481)
New Trends in Natural Language Processing: Statistical Natural Language Processing (482-504)
The Future of Voice-Processing Technology in the World of Computers and Communications (505-514)
Author Biographies (515-524)
Index (525-548)