On 2012-06-12 23:19, Steve Bennett wrote:
I've been tasked with setting up a local copy
of the English
Wikipedia for researchers - sort of like another Toolserver. I'm not
having much luck,
Have your researchers learn Icelandic. Importing the
small Icelandic Wikipedia is fast. They can test their
theories and see if their hypotheses make any sense.
When they've done their research on Icelandic, have
them learn Danish, then Norwegian, Swedish, Dutch,
before going to German and finally English. There's
a fine spiral of language sizes around the North Sea.
It's when they are frustrated waiting for an analysis
taking 15 minutes for Norwegian, that they will find
smarter algorithms that will enable them to take on
the larger languages.
--
Lars Aronsson (lars(a)aronsson.se)
Aronsson Datateknik -
http://aronsson.se