[lbo-talk] "Machine translation" (was: Protection of Chinese language urged)
Jon Johanning
jjohanning at igc.org
Sun Jun 6 09:55:10 PDT 2004
On Saturday, June 5, 2004, at 04:34 PM, dano wrote:
> Machine translation - getting better, still expensive.
> about: <http://www.languageweaver.com/how_it_works.php>
> Chinese:
> <http://www.languageweaver.com/language-translation-for-chinese.php>
>
> I have not seen their Chinese translation (subject of the thread) but
> have seen them translate Arabic (Al Jazeera) on the fly with expert
> Arabic speakers who confirmed that the translation is high quality.
> The Chinese may be as good or it may not.
This web site, of course, like many other similar ones, is an
advertisement for a commercial company, which puts the best possible
face on the products it has for sale.
The quality of "machine translation" software is difficult to evaluate
in a simple manner, since it depends on the purposes the user of the
software has in mind, and what the user is willing to accept. The
statistical method used by this company's software is one of the more
adaptable kinds of MT systems, but no current MT system can replace
human translators for all purposes, and it is very doubtful that MT
systems ever will, though this is a hotly debated question in the
translation and computational linguistics communities. Basically, in my
opinion, as well as that of many other translators, MT will only be
"high-quality" when robots which can essentially think and operate in
the world like human beings (e.g., Commander Data on Star Trek) are
made, which obviously will take a while.
Where communications of a political nature are concerned, especially,
it is very unlikely that "machine-translated" texts will be considered
satisfactory. The most satisfactory employment of MT currently is in
fields where the vocabulary is limited, the source texts are frequently
written on purpose in a very simplified and controlled manner (in
syntax, etc.) to facilitate the MT processing, and the resulting texts
in the target language are extensively edited to make them
comprehensible. Thus, MT is normally used where very large volumes of
very similar texts, which can be effectively analyzed statistically,
must be churned out: technical manuals, for example. Note that Language
Weaver states, concerning its Chinese module:
"Especially in cases of domain specific text where there is repetition
in the text and a limited universe of phrasing, such as technical
documentation handbooks, Language Weaver’s SMTS technology can save
customers considerable money and time through automation of the
translation process, as well as enable cost effective, real-time
translations of previously untranslated text, like Web based customer
and product support information."
These are very typical MT applications, and very different from
political contexts, in which the terms used vary rather wildly in
meaning, disagreements over their meanings are very frequent, and
misunderstandings consequently common (see for example this list, which
is conducted in only one language). The satisfactoriness of MT rises in
proportion as the language becomes more stereotyped and rigid, which is
not conducive to good political activity.
Jon Johanning // jjohanning at igc.org
__________________________________
When I was a little boy, I had but a little wit,
'Tis a long time ago, and I have no more yet;
Nor ever ever shall, until that I die,
For the longer I live the more fool am I.
-- Wit and Mirth, an Antidote against Melancholy (1684)
More information about the lbo-talk
mailing list