Hi.
I am a MSc. student in admistration in Montreal, Canada, and am doing my
master's thesis on Wikipedia.
I am having troubles importing the table "old" of wikipedia.
I have first downloaded all the dump files (exported on April 21st) and then
concatenated them (cat ...).The concatenation of the dump files (english
version of Wikipedia) has ended up with a file of around 31 gigabytes
Apparently, the compression format has changed for bzip2 does not recognize
the resulting file as a bz2 one but gunzip is able to uncompress the file
(by naming the compressed file old_table.sql.gz).
Has the compression format been officially changed ?
Moreover, the uncompressed file has a final size of only 34,201,462 KB
which is not much bigger than the compressed file. Is that normal?
Nonetheless, the resulting sql file seems to be readable for it is possible
to import the 'old table' from it. But I don't know whether the file is
complete or not, and whether the old table that I got, will not miss any
record.
I successfully downloaded & installed several times already the english
version of the wikipedia database; but this time, there is something I don't
get.
Could you please indicate me what is wrong in what I am doing, or whether
there has been any change in the procedures to follow?
Thank you.
Kevin Carillo