On Thu, Apr 8, 2010 at 7:34 PM, Q <overlordq(a)gmail.com> wrote:
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256
On 4/8/2010 4:28 PM, Anthony wrote:
I'd like to add that the md5 of the
*uncompressed* file is
cd4eee6d3d745ce716db2931c160ee35 . That's what I got from both the
uncompressed 7z and the uncompressed bz2. They matched, whew.
Uncompressing and md5ing the bz2 took well over a week. Uncompressing
and
md5ing the 7z took less than a day.
Dumping and parsing large XML files came up at work today which made me
think of this, how big exactly is the uncompressed file?
5.34 terabytes was the figure I got.
"7z l enwiki-20100130-pages-meta-history.xml.7z" gives an uncompressed size
of 5873134833455. I assume that's bytes, and googling "5873134833455 bytes
to terabytes" gives me "5.34158501 terabytes".