<br><br><div class="gmail_quote">On Wed, Mar 10, 2010 at 10:43 PM, Tomasz Finc <span dir="ltr"><<a href="mailto:tfinc@wikimedia.org">tfinc@wikimedia.org</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">
Brian J Mingus wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div><div></div><div class="h5">
<br>
On Wed, Mar 10, 2010 at 8:54 PM, Tomasz Finc <<a href="mailto:tfinc@wikimedia.org" target="_blank">tfinc@wikimedia.org</a> <mailto:<a href="mailto:tfinc@wikimedia.org" target="_blank">tfinc@wikimedia.org</a>>> wrote:<br>
<br>
Yup, that's the one. If you have a fast upload pipe then I'm more then<br>
happy to setup space for it. Otherwise it should be arriving in our<br>
snail mail after a couple of days.<br>
<br>
-tomasz<br>
<br>
<br>
Anyone may download the file from me here:<br>
<br>
<a href="http://grey.colorado.edu/enwiki-20080103-pages-meta-history.xml.7z" target="_blank">http://grey.colorado.edu/enwiki-20080103-pages-meta-history.xml.7z</a><br>
<br>
The md5sum is:<br>
<br>
20a201afc05a4e5f2f6c3b9b7afa225c enwiki-20080103-pages-meta-history.xml.7z<br>
<br>
The file size is:<br>
<br>
18522193111 (~18 gigabytes)<br>
<br>
I'm sure you will find my pipe fat enough..;-)<br>
<br>
<br></div></div>
------------------------------------------------------------------------<div class="im"><br>
<br>
_______________________________________________<br>
Xmldatadumps-admin-l mailing list<br>
<a href="mailto:Xmldatadumps-admin-l@lists.wikimedia.org" target="_blank">Xmldatadumps-admin-l@lists.wikimedia.org</a><br>
<a href="https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-admin-l" target="_blank">https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-admin-l</a><br>
</div></blockquote>
<br>
That seem way too tiny to be the real thing.<br><font color="#888888">
<br>
--tomasz<br>
</font></blockquote></div><br><div>7zip has a very impressive compression ratio. From <a href="http://download.wikimedia.org">download.wikimedia.org</a>:</div><div><br></div><div><span class="Apple-style-span" style="font-family: Times; font-size: medium; "><ul style="margin-top: 4px; margin-bottom: 8px; ">
<li class="detail" style="background-color: white; list-style-type: none; font-weight: normal; font-style: italic; ">These dumps can be *very* large, uncompressing up to 100 times the archive download size. Suitable for archival and statistical use, most mirror sites won't want or need this.</li>
<div><br></div></ul></span></div><div>That notice has not changed since I downloaded this file.. the uncompressed size could be well over a terabyte. I'm not sure how long it will take to unpack but I have just started it. I wonder what drives your intuition?</div>