National Academy of Sciences | 150 Year Anniversary

Questions? Call 800-624-6242

| Items in cart [0]

The National Academies Press

HARDBACK
price:$89.95
add to cart

Rights & Permissions

topleft topright

Voice Communication Between Humans and Machines (1994)
National Academy of Sciences (NAS)

Citation Manager

. "Speech Technology in the Year 2001." Voice Communication Between Humans and Machines. Washington, DC: The National Academies Press, 1994.

Please select a format:

BibTeX EndNote RefMan


Page
446
bottomleft bottomright

The following HTML text is provided to enhance online readability. Many aspects of typography translate only awkwardly to HTML. Please use the page image as the authoritative form to ensure accuracy.


Page 446

technologies, recognition, synthesis, and natural language understanding; and, finally, the applications of this technology.

When the blueprint for this session was fitted together this session was called "Future Technology." The organizers felt that we should think really about it in a very "blue sky" sort of way. I was alarmed by the project altogether at that stage, rushed back home, and started reading about Leonardo da Vinci, H. G. Wells, and dreamed up a few impossible applications for speech recognition. During these ruminations, I thought, there are many interesting things we could discover—how to navigate the oceans of the world safely or, possibly, information about the location of treasure ships lost by the Spanish many years ago. I am sure that squids and other marine animals could tell us a great deal about that. There is also the question of HAL or Blade Runner, Ed Newbard, and old Napoleon Solo who used to ask for channel D. However, after some discussion with the speakers today, they indicated they did not want this sort of stuff at all.

It was decided that we should talk about evolutionary technology—rather than revolutionary technology. So we are talking about what is likely to be possible in the year 2001. In passing, we might note that the ideas of some of our predictions are not all that far away. We have rough models of HAL right now; of Blade Runner, I'm less certain.

However, we have put together a very interesting program for this last session. Certainly, the three speakers are eminently suited to this. They have all made significant contributions to the state of the art in several areas. One of the things we decided to do was to change the order slightly so that Sadaoki Furui will talk first about ultimate synthesis/recognition systems to give us a flavor of his view of the systems that are likely to be available. And then our two other experts will discuss research directions—B. Atal, in the area of speech, and M. Marcus in the area of natural language.

The paragraphs above are a slightly edited version of an audio recording of Frank Fallside's introduction of this session of the colloquium. They are included here for two reasons. First, they capture rather well Frank's persona. As I read them, I can hear his enunciation of the words in his marvelous accent and diction, which ever so slightly betrayed the intended intellectual mischief. Second, of course, is the intellectual mischief itself. What Frank was saying was that predicting the future of technology is fraught with danger and is thus best approached with a bit of self-deprecating humor.

Before exploring that idea further, it is worthwhile to make a few observations about the views of the speakers in this session. There is no need to summarize the material as the papers are presented in

Page
446
Front Matter (R1-R10)
Dedication (1-4)
Voice Communication Between Humans and Machines--An Introduction (5-12)
Scientific Bases of Human-Machine Communication by Voice (13-14)
Scientific Bases of Human-Machine Communication by Voice (15-33)
The Role of Voice in Human-Machine Communication (34-75)
Speech Communication -- An Overview (76-104)
Speech Synthesis Technology (105-106)
Computer Speech Synthesis: Its Status and Prospects (107-115)
Models of Speech Synthesis (116-134)
Linguistic Aspects of Speech Synthesis (135-156)
Speech Recognition Technology (157-158)
Speech Recognition Technology: A Critique (159-164)
State of the Art in Continuous Speech Recognition (165-198)
Training and Search Methods for Speech Recognition (199-214)
Natural Language Understanding Technology (215-216)
The Roles of Language Processing in a Spoken Language Interface (217-237)
Models of Natural Language Understanding (238-253)
Integration of Speech with Natural Language Understanding (254-272)
Applications of Voice-Processing Technology I (273-274)
A Perspective on Early Commercial Applications of Voice-Processing Technology for Telecommunications and Aids for the Handicapped (275-279)
Applications of Voice-Processing Technology in Telecommunications (280-310)
Speech Processing for Physical and Sensory Disabilities (311-344)
Applications of Voice-Processing Technology II (345-346)
Commercial Applications of Speech Interface Technology: An Industry at the Threshold (347-356)
Military and Government Applications of Human-Machine Communication by Voice (357-370)
Technology Deployment (371-372)
Deployment of Human-Machine Dialogue Systems (373-389)
What Does Voice-Processing Technology Support Today? (390-421)
User Interfaces for Voice Applications (422-442)
Technology in 2001 (443-444)
Speech Technology in the Year 2001 (445-449)
Toward the Ultimate Synthesis/Recognition System (450-466)
Speech Technology in 2001: New Research Directions (467-481)
New Trends in Natural Language Processing: Statistical Natural Language Processing (482-504)
The Future of Voice-Processing Technology in the World of Computers and Communications (505-514)
Author Biographies (515-524)
Index (525-548)