Questions? Call 888-624-8373

Rights & Permissions

topleft topright

Language and Machines: Computers in Translation and Linguistics (1966)

Page
25
bottomleft bottomright
Page
25
Front Matter (R1-R11)
Contents (R12-R14)
Human Translation (1-1)
Types of Translator Employment (2-3)
English as the Language of Science (4-4)
Time Required for Scientists to Learn Russian (5-5)
Translation in the United States Government (6-6)
Number of Government Translators (7-8)
Amount Spent for Translation (9-10)
Is there a Shortage of Translators or Translation? (11-12)
Regarding a Possible Excess of Translation (13-15)
The Crucial Problems of Translation (16-18)
The Present State of Machine Translation (19-24)
Machine-Aided Translation at Mannheim and Luxembourg (25-28)
Automatic Language Processing and Computational Linguistics (29-31)
Avenues to Improvement of Translation (32-33)
Recommendations (34-34)
Appendix 1. Experiments in Sight Translation and Full Translation (35-36)
Appendix 2. Defense Language Institute Course in Scientific Russian (37-38)
Appendix 3. The Joint Publications Research Service (39-40)
Appendix 4. Public Law 480 Translations (41-42)
Appendix 5. Machine Translations at the Foreign Technology Division, U.S. Air Force Systems Command (43-44)
Appendix 6. Journals Translated with Support by the National Science Foundation (45-49)
Appendix 7. Civil Service Commission Data on Federal Translators (50-53)
Appendix 8. Demand for and Availability of Translators (54-56)
Appendix 9. Cost Estimates of Various Types of Translation (57-66)
Appendix 10. An Experiment in Evaluating the Quality of Translations (67-75)
Appendix 11. Types of Errors Common in Machine Translation (76-78)
Appendix 12. Machine-Aided Translation at the Federal Armed Forces Translation Agency, Mannheim, Germany (79-86)
Appendix 13. Machine-Aided Translation at the European Coal and Steel Community, Luxembourg (87-90)
Appendix 14. Translation Versus Postediting of Machine Translation (91-101)
Appendix 15. Evaluation by Science Editors and Joint Publications Research Service and Foreign Technology Division Translations (102-106)
Appendix 16. Government Support of Machine-Translation Research (107-112)
Appendix 17. Computerized Publishing (113-117)
Appedix 18. Relation Between Programming Languages and Linguistics (118-120)
Appendix 19. Machine Translation and Linguistics (121-123)
Appendix 20. Persons Who Appeared Before the Committee (124-124)

Below are the first 10 and last 10 pages of uncorrected machine-read text (when available) of this chapter, followed by the top 30 algorithmically extracted key phrases from the chapter as a whole.
Intended to provide our own search engines and external engines with highly rich, chapter-representative searchable text on the opening pages of each chapter. Because it is UNCORRECTED material, please consider the following text as a useful but insufficient proxy for the authoritative book pages.

Do not use for reproduction, copying, pasting, or reading; exclusively for search engines.

OCR for page 25
Machine-Aided Translation at Mannheim ~ and Luxembourg As it becomes increasingly evident that fully automatic high-quality machine translation was not going to be realized for a long time, interest began to be shown in machine-aided translation. The Com- mittee has knowledge of two important machine-aided translation systems in operation: the Federal Armed Forces Translation Agency, Mannheim, Germany, and the Terminological Bureau of the European Coal and Steel Community, Luxembourg. At these centers the approach is conservative; a machine is used to produce specialized glossaries helpful in the translation of particular docu- ments. (Although the translation system in operation at the USAF Foreign Technology Division, Wright-Patterson Air Force Base, is being called, with increasing frequency, ',machine- aided translation," it is actually a system of human-aided machine translation, relying, as it must, on posteditors to make up for the deficiencies of the machine output.) MACHINE-AIDED TRANSLATION AT THE FEDERAL ARMED FORCES TRANSLATION AGENCY, MANNHEIM, GERMANY The Federal Armed Forces Translation Agency conducted an ex- periment designed to determine to what extent and in what areas machine output could aid the human translator. Two translators were given identical English texts to be translated into German. Neither translator was a specialist in the technical field treated in the text. Translator A had the conventional dictionaries and other reference works found in technical libraries and access to experienced experts. Translator B was given only a text-based or text-related glossary (TRG) that listed all and only the technical terms in the original text in the sequence in which they occurred plus their German equivalent or equivalents. To minimize any differences in the translators' abilities, a second text was 25

OCR for page 26
translated in which translator A used the TRG and translator B worked in the conventional way. The procedure above was repeated with two different translators and two different technical texts. Results of the test indicated that a translator working with conventional aids requires between 50- 86 percent (average, 66 percent) more time than a translator work- ing with a text-related glossary. In addition to increased speed, another advantage of the TRG type of translation was that using this method the translators made one third fewer errors. We quote below from a translation of a paper titled "Production of Text-Related Technical Glossaries by Digital Computer, A Pro- cedure to Provide an Automatic Translation Aid," by F. Krollmann, H. J. Schuck, and U. Winkler (the German original appeared in the January 1965 issue of Beitrage zur Sprachkunde und Informations- verarbeitung): These two experiments have shown that the speed (and thus the cost) of the translator's work as well as the quality of his product (and thus the output of the editor) can be considerably improved if it is possible to relieve the translator of the unproductive and tiresome search for the correct techni- cal term that frequently cannot possibly be included yet in any of the con- ventional dictionaries. These figures would suggest that, ideally, the error quota in translations of technical-scientific texts can be reduced by appro=- mately 40 percent—a figure which experience indicates can be improved by at least another 10-15 percent since better understanding of the text fre- quently results in improved linguistic rendition (unambiguity of style)—and that translator productivity can be increased by over 50 percent. The system works in the following way. The translator reads through the text to be translated and underlines the English words for which he desires to know the German equivalent. The text is then given to a keypunch operator who punches the cards for the underlined words and at the same time performs morphological reduction of the English words (in most cases this simply involves omitting the inflectional suffixes). The information on the cards is then put into the computer, which can produce three or four text- related glossaries in about 10 min. The TRG system became opera- tional in 1965 and in early 1966 was connected by a data-link with a Telefunken TR-4 computer in Trier. At present the Federal Air Force Translation Agency has a co- operative agreement for exchange of terminologies with the U.S. Defense Language Institute/West Coast Branch, the British Admiralty, the European Coal and Steel Community, and others. An analysis of a test run and some sample output is to be found in Appendix 12. This technique was developed by the Federal 26

OCR for page 27
Ministry of Defense of West Germany which very kindly made available for the Committee use of the material in Appendix 12. MACHI NE-AIDED TRANSLATION AT THE EUROPEAN COAL AND STEEL COMMUNITY, LUXEMBOURG The Terminological Bureau of the European Coal and Steel Com- munity (CECA) was established in 1950 to provide assistance to the Translation Bureau, which had the task of performing translations into and out of the four official languages of CECA—French, Dutch, Italian, and German. The Head of the Terminological Bureau, Mr. J. A. Bachrach, estimates that a minimum of 25 percent of the translator's time is spent on terminological questions and that, in difficult documents, up to 75 percent of the translator's time is spent on these problems. In collaboration with Mrs. Lydia Hirschberg of the Free University of Brussels and her group, various approaches to this problem were considered. Soon a system was devised by which the translator's time-consuming job of finding the answers to questions of termi- nology was made easier. The system utilized at CECA is one of automatic dictionary look-up with context included. The operation is similar to that used at Mannheim, but the output is somewhat different. It is simi- lar in that the translator indicates, by underlining, the words with which he desires help. The entire sentence is then keypunched and fed into a computer. The computer goes through a search routine and prints out the sentence or sentences that most nearly match (in lexical items) the sentences in question. The translator then re- ceives the desired items printed out with their context and in the order in which they occur in the source. The translation of the sentence is not done by the computer, but by a human translator. However, since the data produced by each query are added to the data base, the more the system is in use, the greater is the probability of finding sentences that have the desired term in the proper context. A sample of typical CECA French- English output in shown in Appendix 13. The information that has been built up by CECA not only is of value in answering the queries of translators but also enables CECA to publish specialized glossaries in a very short time. Appendix 13, a copy of one extract from a five-language glossary prepared for the Congress on Steel Utilization is attached. The Committee finds it difficult to assess the difficulty and cost of postediting. An initial reaction is apt to be like that of R. T. Beyer Whys. Today_ (1), 50 (1965~: 27

OCR for page 28
I must confess that the results were most unhappy. I found that I spent at least as much time In editing as if I had carried out the entire translation from the start. Even at that, I doubt if the edited translation reads as smoothly as one which I would have started from scratch. I drew the con- clusion that the machine today translates from a foreign language to a form of broken English somewhat comparable to pidgin English. But it then re- mains for the reader to learn this patois in order to understand what the Russian actually wrote. Learning Russian would not be much more difficult. Someday, perhaps, the machines will make it, but I as a translator do not yet believe that I must throw my monkey wrench into the machinery in order to prevent my technological unemployment. The Committee had some postediting done as an experiment (see Appendix 14~. Postediting took as long as translation, yet people said they were willing to do it for less per word! FTD figures indicate that in-house postediting is done faster than in-house translation. Studies of the FTD operation indicate that keyboard transcrip- tion of the cyrillic text is a very minor part of the total cost. Thus, automatic character recognition could cut the cost of the operation only a little. On the other hand, a large fraction of the cost is in putting the final translation together, with figures and equations, and reproducing it. If we compare the cost of human in-house translation ($40 per 1,000 Russian words) with the cost of machine-aided translation within FTD ($36 per 1,000 Russian words), machine-aided transla- tion appears to be somewhat less expensive. But FTD machine- aided translation is costlier than contract translation ($33 per 1,000) and far costlier than Joint Publications Research Service (JPRS) translation ($16 per 1,000 English words). Appendix 15 gives data on a comparison by experts of the quality of some recent JPRS translations and FTD machine-aided trans- lations. The text of the JPRS translations was judged to be better than that of the FTD translations. The quality of the reproduction of text and figures was judged to be poor in both cases, with JPRS superior to FTD. We wonder why the Air Force pays more for translations made by FTI) than superior and prompter JPRS translations would cost. 28

Representative terms from entire chapter:

translation agency