On 1/22/08, David Gerard <dgerard(a)gmail.com> wrote:
On 22/01/2008, Steve Bennett
<stevagewp(a)gmail.com> wrote:
Incidentally, does anyone have a readily
available corpus of wikitext,
perhaps from Wikipedia? Something in the format of a bunch of text
files would be really convenient.
The usual answer would be to get a dump for analysis in various
languages. (Doesn't have to be the latest.)
Afaik the dumps are in some format that has to be imported into MySQL
then exported (or analysed in-DB), no? Hence my request for something
a little more convenient...
Steve