[Foundation-l] Free translation memory
Nikola Smolenski
smolensk at eunet.rs
Fri Jul 30 21:22:00 UTC 2010
Дана Thursday 29 July 2010 10:38:20 Samuel Klein написа:
> There is definitely a "free TM" project waiting to happen. It would
> be nice to see translatewiki [for instance] incorporate such a tool,
> but it may be a nontrivial amount of work.
At Project Rastko for years now there is the idea of building something called
Global Translation Project, where volunteers could collaboratively translate
texts in a manner somewhat similar to Distributed Proofreaders.
To give some detail: the idea is to first parse the original text with a
rule-based machine translation engine (of course this should be free software
with free dictionary). The basic problem that these engines have is that they
are unable to resolve ambiguities in the text (a classic example is
sentence "Time flies like an arrow": does it means that time is flying as
fast as an arrow or that there exist some insects called time flies (like
there are fruit flies) which like some arrow?). This often ends in a
mistranslation.
The crux of the idea is that it would be humans who resolve ambiguities in
this step. For example, these two possible meanings of the sentence would in
another language be translated to two completely different sentences. A human
could then simply pick the correct one. After several people have done this
for several independent languages, and their translations agree, the system
would know what is the correct parsing of the original text. Then this
parsing could be translated fully automatically to a large number of
languages, and it will be highly likely that the translations will be close
to correct.
An offshoot of this is a crowdsourced dictionary project in GalaxyZoo style.
Instead of doing battle with Wiktionary's or similar interface, volunteers
could build a dictionary by solving various simple tasks (say, pick a word's
gender, or verify that a word is correctly declined); if the supermajority of
the volunteers gives the same answer, the word enters the dictionary.
More information about the foundation-l
mailing list