Steve Bennett wrote:
On 1/23/08, Carl Beckhorn wrote:
The wikitext dumps are in XML format and can be parsed pretty easily as if they were plain text files.
Cool. Looks like the current dump is 3 Gb though, is there a subset available?
Steve
1) Choose the current flavout to avoid having all history. 2) Get the dump of a smaller wikipedia.