Hi.
I am a MSc. student in admistration in Montreal, Canada, and am doing my master's thesis on Wikipedia.
I am having troubles importing the table "old" of wikipedia.
I have first downloaded all the dump files (exported on April 21st) and then concatenated them (cat ...).The concatenation of the dump files (english version of Wikipedia) has ended up with a file of around 31 gigabytes Apparently, the compression format has changed for bzip2 does not recognize the resulting file as a bz2 one but gunzip is able to uncompress the file (by naming the compressed file old_table.sql.gz).
Has the compression format been officially changed ?
Moreover, the uncompressed file has a final size of only 34,201,462 KB which is not much bigger than the compressed file. Is that normal?
Nonetheless, the resulting sql file seems to be readable for it is possible to import the 'old table' from it. But I don't know whether the file is complete or not, and whether the old table that I got, will not miss any record.
I successfully downloaded & installed several times already the english version of the wikipedia database; but this time, there is something I don't get. Could you please indicate me what is wrong in what I am doing, or whether there has been any change in the procedures to follow?
Thank you.
Kevin Carillo