National Academy of Sciences | 150 Year Anniversary

Questions? Call 800-624-6242

| Items in cart [0]

The National Academies Press

HARDBACK
price:$89.95
add to cart

Rights & Permissions

topleft topright

Voice Communication Between Humans and Machines (1994)
National Academy of Sciences (NAS)

Citation Manager

. "The Future of Voice-Processing Technology in the World of Computers and Communications." Voice Communication Between Humans and Machines. Washington, DC: The National Academies Press, 1994.

Please select a format:

BibTeX EndNote RefMan


Page
506
bottomleft bottomright

The following HTML text is provided to enhance online readability. Many aspects of typography translate only awkwardly to HTML. Please use the page image as the authoritative form to ensure accuracy.


Page 506

tial reasons for voice communication for humans, and this thought has driven me to pour energy into speech recognition and synthesis for a long time—for the past 35 years.

I think we are still only halfway to our goal of an advanced or smart interface. From here on the scientific path to our goal only gets steeper.

Today we live in the age of information. Five billion people can benefit from an economically efficient society supported by computers and communications. This will become truer as we become more information oriented.

The NEC Corporation recognized the importance of integrating computers and communication long ago and adopted ''C&C" (computer and communications) as its corporate identity in 1977. In the future, C&C will become an indispensable tool for everyone—for many it already has. In this kind of environment, C&C must be easy to use. It must provide a friendly, natural, smart interface for people. Thus, the voice interface is an important component for the C&C of the future.

Recently, we have seen significant progress in speech-processing technologies, especially in the United States. At NEC we have also made a little progress with large-vocabulary recognition (10,000 words), speaker-independent continuous speech recognition (1000 words), and naturally spoken language understanding. Some of this technology is close to being commercialized.

I will not spend a lot of time discussing speech synthesis, but I must make one important comment. Prosodic features such as pauses, stresses, and intonation are related to semantic structure and emotion and are difficult to convey in written messages. But the role of these features will become very important in sophisticated smart interfaces.

VOICE INTERFACE IN THE C&C INFORMATION SOCIETY

There are many applications for new C&C technologies. In the area of public information services there are applications in employment, securities, risk management, taxation, health, education, marriage, entertainment, shopping, and even funeral arrangements. For business and personal use, there are applications in document design, presentations, publications, information access, and inter/intraoffice communication. And personal applications are also important: text preparation (dictation), e-mail, telephone, scheduling, and personal database management.

Page
506
Front Matter (R1-R10)
Dedication (1-4)
Voice Communication Between Humans and Machines--An Introduction (5-12)
Scientific Bases of Human-Machine Communication by Voice (13-14)
Scientific Bases of Human-Machine Communication by Voice (15-33)
The Role of Voice in Human-Machine Communication (34-75)
Speech Communication -- An Overview (76-104)
Speech Synthesis Technology (105-106)
Computer Speech Synthesis: Its Status and Prospects (107-115)
Models of Speech Synthesis (116-134)
Linguistic Aspects of Speech Synthesis (135-156)
Speech Recognition Technology (157-158)
Speech Recognition Technology: A Critique (159-164)
State of the Art in Continuous Speech Recognition (165-198)
Training and Search Methods for Speech Recognition (199-214)
Natural Language Understanding Technology (215-216)
The Roles of Language Processing in a Spoken Language Interface (217-237)
Models of Natural Language Understanding (238-253)
Integration of Speech with Natural Language Understanding (254-272)
Applications of Voice-Processing Technology I (273-274)
A Perspective on Early Commercial Applications of Voice-Processing Technology for Telecommunications and Aids for the Handicapped (275-279)
Applications of Voice-Processing Technology in Telecommunications (280-310)
Speech Processing for Physical and Sensory Disabilities (311-344)
Applications of Voice-Processing Technology II (345-346)
Commercial Applications of Speech Interface Technology: An Industry at the Threshold (347-356)
Military and Government Applications of Human-Machine Communication by Voice (357-370)
Technology Deployment (371-372)
Deployment of Human-Machine Dialogue Systems (373-389)
What Does Voice-Processing Technology Support Today? (390-421)
User Interfaces for Voice Applications (422-442)
Technology in 2001 (443-444)
Speech Technology in the Year 2001 (445-449)
Toward the Ultimate Synthesis/Recognition System (450-466)
Speech Technology in 2001: New Research Directions (467-481)
New Trends in Natural Language Processing: Statistical Natural Language Processing (482-504)
The Future of Voice-Processing Technology in the World of Computers and Communications (505-514)
Author Biographies (515-524)
Index (525-548)