Kevin Carillo wrote:
Hi.
I am a MSc. student in admistration in Montreal, Canada, and am doing my
master's thesis on Wikipedia.
I am having troubles importing the table "old" of wikipedia.
I have first downloaded all the dump files (exported on April 21st) and then
concatenated them (cat ...).The concatenation of the dump files (english
version of Wikipedia) has ended up with a file of around 31 gigabytes
Apparently, the compression format has changed for bzip2 does not recognize
the resulting file as a bz2 one but gunzip is able to uncompress the file
(by naming the compressed file old_table.sql.gz).
Has the compression format been officially changed ?
If you hover your mouse over the "old" link, you'll see that it is
indeed supposed to be a gz, not a bzip2.
If you didn't get *any* error message while uncompressing *and* while
importing into your database, then the result is very unlikely to have
any problems such as ommissions or data corruption.
Timwi