Getting working inflection rules for even a single language is a major
task, and doing so for several hundred languages would be a
overwhelming task. I can't see how this can be implemented as part of
the Wikidata project within a reasonable time frame.
There is a few shortcuts that can be made, and it is possible to make
some generalized tools. For an open source alternative take a look at
Apertium (
http://en.wikipedia.org/wiki/Apertium). Usually it is only
the generation/disambiguation phase that is necessary, and this makes
the task somewhat simpler, but it is still a major undertaking.
Note that some of the basic tools already exist, we only need to
interface them to Mediawiki, but the tools needs definition files to
work (that is inflection rules for Northern Sami language for example,
or Norwegian bokmål and nynorsk, or Swedish) and it is those
definitions that is the major task.
John
On Thu, Oct 18, 2012 at 11:14 AM, Nikola Smolenski <smolensk(a)eunet.rs> wrote:
On 18/10/12 11:06, John Erling Blad wrote:
well-formed text automatically. One of the more common problems are
names that uses different inflection rules due to context and how they
are written. Such inflection rules are not part of the Wikidata
project and will probably be a major undertaking in itself.
Why do you think that inflection rules will not be a part of Wikidata? They
would be hugely needed on Wiktionary and there is no reason for Wikidata not
being able to contain them.
_______________________________________________
Wikitech-l mailing list
Wikitech-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l