On 1/22/08, David Gerard dgerard@gmail.com wrote:
On 22/01/2008, Steve Bennett stevagewp@gmail.com wrote:
Incidentally, does anyone have a readily available corpus of wikitext, perhaps from Wikipedia? Something in the format of a bunch of text files would be really convenient.
The usual answer would be to get a dump for analysis in various languages. (Doesn't have to be the latest.)
Afaik the dumps are in some format that has to be imported into MySQL then exported (or analysed in-DB), no? Hence my request for something a little more convenient...
Steve