Machine Translation Today

A critical look at current desktop systems by Derek Lewis
The appearance of the first machine translation (MT) system for the PC in the early 1980s marked the beginning of a new phase in MT. From them on, and especially in the following decade, developers began to re-engineer systems designed originally on mainframe-type computers for the potentially much larger desktop market; prices were adjusted accordingly. As a result there are now several systems to choose from. Often at prices which compete favourably with standard business applications software. This article discusses some technical advances in MT, especially in relation to the quality of language output, and reviews the types of package currently available. Examples of output from three representative systems are briefly presented in order to give the reader a concrete idea of what to expect from MT. The article concludes with remarks on the difficult task of evaluation.
Improvements in interfaces
The most obvious advance in MT is the incorporation of windows-based interfaces for the user. These provide rapid and easy access to source and target text files and to system dictionaries. Compared with the facilities of the first MT system for PC, Weidner's DOS-based MicroCat, the latest system offer clearly laid out, on-screen menus for entering new terms, coupled with extensive on-line help: the ability to update and create dictionaries is particularly useful for users who are willing to customize the system for their own text types and subject domains. In addition, most MT systems are able to maintain original document formatting; some can even be incorporated directly into a wordprocessing package as an on-screen menu option. A far-reaching and very recent innovation is the interfacing of MT with other applications, such as e-mail packages, web browsers, and voice recognition software. In theory, and in the not too distant future, it should be possible to enter a text as direct speech input and listen to the automatically translated results.
Linguistic capabilities
The main question for professional translators, however, is whether the fundamental linguistic capability of operational systems has improved in line with their interfaces. That this may in fact not be so is suggested by the fact that suppliers on the whole still take care to stress the limitations of raw MT output and are keen not to generate unrealistic expectations on the part of the user. The manual may provide detailed guidelines on preparing text for translation. Examples are: 'use simple sentence structures and punctuation ';'avoid sentence embedding';' rewrite complex sentences into smaller units with a parallel structure ';'include redundant relative pronouns, prepositions, And other words that clarify sentence structure';'avoid strings of prepositional phrases' ; 'avoid a nominal style which reduces the role of verbs' ; 'introduce relative clauses with a pronoun'(e.g. write: 'the car which was heating the air..', not 'the car heating the air¡­.'). The supplier may even include examples of common error types. This is both to assist in post-editing and to prepare the user for less than perfect quality. Clearly it can be difficult for a supplier to strike a balance between the need to promote a product and the requirement to be honest about its limitations. One supplier illustrates this dilemma when it claims, on the one hand, that its system possesses a high degree of 'world knowledge' ( understood to be of extensive grammar rules and dictionary entries) and, on the other hand, includes several pages of advice on how to minimize the complexity of the input text in order to maximize the quality of the output.
It remains the case that MT delivers the best results within a well-defined and carefully managed environment. This generally means using MT for a large volume of texts drawn from limited subject domains in which syntax and lexis are restricted and/ or remain relatively stable over long periods. MT is most effectively employed as part of a professionally planned document production and updating process, ideally involving both pre-and post-editing. Indeed some companies already specialise in integrating translation tools within their publishing workflow. This process can involve, for instance, holding multilingual documentation in the form of a layered database in which text units are tagged and structured (e.g. in SGML). Within such a system, individual units are exported for translation and re-imported into the database as required; at the same time standard layouting formats (such as size and length of text, font, etc.) may be applied to particular text units. One of the purposes of managing documents in this way is to preserve consistency of content and presentation across different languages. Properly managed, the approach speeds up the period between the initial writing of a document and it final delivery to the customer (Brougham, 1997).
Unit recently it would have been unusual to apply MT to anything but formal, written language. However,the translation of less formal language in the form of on-line texts. Notably e-mail messages, is a growth application area that is being specifically targeted by MT suppliers. The potential market for MT here is enormous: the internet generates vast amounts of text across language frontiers, and there is certainly a demand for rapid draft translations where users are more interested in receiving texts for basic information extraction, even at the expense of a loss in stylistic quality. At the same time the linguistic challenges for MT in this context are considerable: like spoken or conversational language, on-line internet' chat ' contains a high percentage of incomplete (ill-formed) sentences, not to mention ellipses, rapid topic shifts, and spelling and grammar errors. Such language is likely to be more difficult to parse than formally correct text. In addition, end users will expect translations to be delivered at little or no cost, although they are likely to be tolerant of output that, while of poor quality by the standards of formal language, provides and adequate basis for mutual communication. CompuServe, who are pioneering MT in this area, have developed a chat translation prototype that translates messages in 5 to 8 seconds; translation of other forms of on-line communication (so called' forum messages') takes up to 3 minutes (Flanagan, 1997) CompuServe's aim is to develop an on-line message translation system that is embedded within the e-mail service itself; the MT component, which is based on existing proprietary software, is centrally provided and invisible to the user. Likewise, Altavista have recently made available an on-line MT service for the on-line translation of short text and web pages; the underlying MT system is Systran and it is available in several language pairs.
Given modern windows environments most MT systems can be interfaced in one way or another with internet applications, a feature which developers and suppliers of established MT systems are now promoting. At its simplest, text can be transferred to and from the MT program via the windows clipboard. MT suppliers who advertise integration with e-mail or the internet include Telegraph and Logos. Transcend's Easy translator is one system that claims to be designed specifically for translating e-mail and web text.
Translation Memory systems
Another important advance is the emergence of Translation Memory (TM) systems. The principle behind TM is simple: the computer memorises previous translations and suggests these to the translator as he works on the a new source test. The units of memorized translation can be word, phrase, or sentence and can be built up on-line while the user is translating a text or, with the aid of text alignment software, constructed form translations done earlier (so-called 'legacy text').Typically , a translator works within an already familiar wordprocessing environment, running the TM package alongside it. As the translator moves through the source test, the TM program matches the test with the contents of its database and offers possible translations which can be pasted in directly to the target text; 'fuzzy matches', which are partial correspondences between source and target text, are also detected and are offered for inclusion or editing as appropriate.
Although not strictly MT (there is no automatic translation in the sense of analysis and synthesis of source and target language structures) TM has obvious attractions. Firstly, it supports the translator within a familiar work environment. Secondly ,it ensures consistency and reliability of terminology, especially for repetitive texts containing small structural units that offer the best chance for successful matching .Thirdly, it allows the translator to draw directly and comprehensively on previous work. Fourthly, unlike MT , TM can be said to 'learn' from text that have been translated and revised .It can even be integrated with MT programs: in this case the TM system is used to trawl through the source text for matching items before the text is submitted for processing by MT. Finally, as well as exploiting the translator's own previous translations to function as the database memory, it should be possible to in corporate the contents of ready-made, commercially produced dictionaries or term banks: with portability of translation memory databases in mind , there have been recent moves to create standards for the various TM formats (TMX, or translation memory exchange). Suppliers are already making strong claims for TM. reminiscent of early MT. They state that, with TM support, translators can produce up to 12,000 words a day. They stress the time savings achieved and the consistency of translation output. It has even been suggested that users conceal the true benefits of TM in order to enhance their competitiveness (and not to be forced to reduce prices to customers who are unwilling to pay for recycled translations). Suppliers of MT systems include TRADOS, TRANSIT, and T1 ( an MT system with a TM module)

Evaluation
IN 1994 an EAGLES subgroup (the EU-supported Expert Advisory Group on Language Engineering Standards) published a draft report on the evaluation of natural language processing systems, ranging from TM and electronic dictionaries to MT proper. Apart from producing a comprehensive set of criteria for evaluation aids, the report discusses user profiles and summarises developments in the translation industry. According to the report the number of languages in which translations are carried out is increasing. At the same time certain languages are emerging as universal focal languages: this means that a company produces its initial documentation in, say, English, and then translates it directly into other languages. Moreover, while texts for translation may be becoming more standardized and repetitive in nature, translation requires more attention to revision, updating, and layouting; the volume of terminology has also increased. Finally , translations are increasingly contracted to outside organi-sations; even in-house translation certres are having to operate as independent business units. As for the place of MT in the translation market, the report concludes that MT is still associated with high technology and is to be found in larger, well-resourced organizations with a sophisticated infrastructure capable of providing good support for IT applications.
The question of how to evaluate MT and which system to buy remains a problem, especially for individual translators and small organisations. While laying the foundation of a benchmark-based approach for evaluating translation tools, the EAGLES draft report does not focus on MT; neither does it provide a league table of current systems and their performance. There are in fact a number of methods available for assessing MT and its output, depending on whether you are a systems developer or a user and whether you are applying strictly linguistic or economic criteria. But they must all be applied with care and generally require considerable time and resources to carry through; this usually places them beyond the reach of smaller businesses. For many organizations, the most realistic approach is to sample a system, preferably with the involvement of translations, and if possible, to consult with existing user. The existence of user groups for specific MT systems would be helpful here, but it is not clear if there are any.

Examples of output
Below are very small sample outputs from three MT systems. Although it is unwise to draw far-reaching conclusions about a system on the basis of limited input samples, it is possible to gain at least an initial idea of the basic characteristics of a system; indeed it may be the only feasible approach for an independent translator. In any case a potential user should always subject a system to some kind of test based on the type of text it will be expected to handle. The text samples here are from German to English. The first system is possibly the cheapest currently available and was the first to appear for windows-based PCs: it is the Globalink Power Translator (PT) which translates between English, French, German and Russian. The second system is langenscheidt's T1, a bi-directional English-German package based on METAL which originated at the linguistics Research Center at the University of Texas in the early 1960s and was acquired by Siemens in 1980. In 1996 T1 for PC was launched as a co-operative venture by Langenscheidt and the Munich-based company GMS (Gesellschaft fur Multilinguale System). T1 is a good example of an MT system that has been progressively re-engineered for smaller computers and that has been able to adapt its original model to different language-pairs. The third system is the Easy Translator, marketed by Transcend. A relatively new system ( it appeared in 1997),Easy Translator is designed to produce fast draft translations of small quantities of texts, in particular web pages and the contents of the windows clipboard.
The subject of the first source text ( Text 1 ) is medical care in Germany; Text 2 deals with the introduction of chess theory in German comprehensive schools; Text 3 is a business letter. So far none of the texts has been pre-edited for MT and makes no particular concessions to MT in terms of vocabulary and syntax. While Text 1 contains sentences whose syntax might be expected to give an MT system some difficulty, Text 2 is notable for its use of compound terms; Text 3 contains structures and lexis that are fairly typical of German business letters.
Assessing MT output
What is evident from this output is that MT quality varies quite markedly from system to system: the translations show significant differences in both lexis and syntactic structure. Before considering this issue in more detail, however, there are certain things that we should note about the comparative performance and facilities of our systems, not all of which are evident from just looking at the translations. Speed, for instance, is a case in point. By far the slowest system was T 1, which took several seconds longer than the other programs to complete the translations:We would expect the low speed to be offset by markedly superior output quality ( the reader may judge for himself whether this is indeed the case). Secondly, T 1 allows the user to specify a subject area for an input text: thus, if we tell the system that text 1 is in the domain of medical science, the output changes slightly: the German Behandlungen is rendered as ' cares' instead of 'processing'. A final feature of T 1 is that it marks words in the target output in various useful ways for possible post-editing. Thus the word 'fuzzy' is highlighted on screen to indicate that there are alternative translations available (viz.' indistinct',' unclear' , 'vague');these can be called up on screen and any item from the list pasted in. T 1 is also able to break down single-word German noun compounds into their constituents and offer word-for word translations for them; these are highlighted as potential multi-word terms for post-editing (or entry into the dictionary). In this respect T 1 reflects its origins in the METAL system, which was developed specifically for translation between English and German. In our example potential terms that have been recognized include: 'state secret' ( for Standesgeheimnis ), ' taboo topic' (Tabuthema) , 'school fold' (Schulfach) , and 'central stage' (Mittelstufe). Obviously the automatic translation is not always entirely successful, but the advantages over other systems are that a translation is at least attempted (helpful to a post-editor unfamiliar with the source language) and that problems are clearly flagged up for further attention. We should also note that, in addition to the dictionaries accessed by the translation programs, T 1 provides on-line bilingual dictionaries that the user can call up in order to review entries whilst post-editing.
Any potential user of MT will be concerned that the output does not fall below a minumum level of comprehensibility. The problem, however, lies in measuring or quantifying comprehensibility. One approach is to assign levels of comprehensibility to each output text, preferably by inviting other people to read the text and state what they have understood the content to be ( e.g. by writing a summary or answering multiple choice questions). There are at least two drawbacks to this approach. One is that evaluators tend to be subjective in their assessments/ Another is that systems often fail to return consistent levels of performance relative to each other. Below are translations of individual sentences (taken from the above texts) ranked in order of comprehensibility (the reader is, of course, at liberty to disagree with the suggested ranking):
German: Offiziell erkundigen kann er sich in Deutschland nicht.
ET : 'officially he cannot inquire in Germany.'
PT : Officially he/it inquire can not itself in Germany.'
T 1: ' Official can make he itself in Germany not.'

German: Auch wer unterrichten soll, steht nach Angaben des Kultus-ministeriums noch nicht fest.
PT: 'Also who should instruct, not yet is certain after statements of the Ministry of Education and the Arts.'
ET: Also who instruct should, stands according to the Kultusministeriums not yet firmly.'
T 1: Too who should inform is not yet clear according to information of the Ministry of Education,'
The point is that the rankings differ for the same systems, although evaluations of large volumes of output might reveal more consistent patterns in translation quality that do not emerge from such a tiny sample as the one here.
A vital factor in assessing any MT system is the degree to which quality of output, however measured, can be improved through dictionary updating and pre-editing of input test, All systems suppliers stress the need to simplify the syntax of input text and avoid ambiguities. Pre-edited versions of Text 1 and Text 2, for instance, might look as shown in the box opposite.
Pre-editing here has been limited to reducing subordination, splitting up sentences and clarifying compounds nouns (e.g. Eroffnungsspielvarianten und Endspielvarianten) . Effective preediting (i. e. of a kind that leads to the best MT output) needs skill and experience. IN Text 2 above, for instance, the editor must decide whether to rewrite the phrase laut Vorschrift (e.g. perhaps as Die Vorschriften verlangen, da¦Â..thereby introducing a subordinate clause and causing problem elsewhere in the output), to delete it altogether, or to enter it as a term or phrase in the dictionary. In this case we have decided to store the phrase in the dictionary because it is a standard adverbial phrase and could occur in a variety of text types. Below is the output from the Power Translator after two of the source texts have been pre-edited and the system's dictionary updated.
As a rule pre-editing that concentrates on simplifying sentence structures results in the most consistent improvement in MT output Customising a dictionary, for instance, can be a waste of time unless the subject domain is very restricted and the user is certain that alternative translations for certain items will not usually be required. It is also the case that in most MT packages the grammar rules are deeply embedded within the system. As a consequence,the user has little or no access to the syntactic processing rules; individual lexical items are the only things that can be added or modified. As mentioned earlier, many suppliers provide detailed information on the structures to avoid in the input text; such help goes a long way to training users in writing a canonical language that produces the best results for MT. At the same time it is wrong to assume that technical text operate with a vocabulary and structures that can entirely eliminate ambiguity and other features of non-technical language.
Derek Lewis is editor of the Machine Translation Review, published by the Natural Language Translation Specialist Subgroup of the British Company Society. He is also Director of the Foreign Language Centre in the School of Modern Language at the University of Exeter.
References
.Brougham, M.(1977)'Publishing Product Information in a Global Market Place', Translating and the Computer 19, Papers from the ASLIB Conference held on 13 and 14 November 1997,London.
.Flanagan, M. (1997)'Online Translation: MT's New Frontier', Translating and the Computer 19 papers from the ASLIB Conference held on 13 and 14 November 1997,London.