Just to confirm, the enwiki-20110901-pages-articles.xml.bz2 file is the concatenation of all those sub-files, right?<div><br></div><div>Would it be possible to restore the filename of this file to enwiki-latest-pages-articles.xml.bz2 for consistency with all the other wikipedias?</div>
<div><br></div><div>For example, the latest full dump in <a href="http://dumps.wikimedia.org/dewiki/latest/">http://dumps.wikimedia.org/dewiki/latest/</a> </div><div>is called dewiki-latest-pages-articles.xml.bz2 and it's the same in all other languages.</div>
<div><br></div><div>Thanks,</div><div>Eric</div><div><br><div class="gmail_quote">On Thu, Sep 8, 2011 at 12:22 AM, Ariel T. Glenn <span dir="ltr"><<a href="mailto:ariel@wikimedia.org">ariel@wikimedia.org</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">The September en wikipedia dumps are done. Folks who use them, note<br>
that this is the first run with the generation of a pile of smaller<br>
files. The naming scheme as you will have noticed has an additional<br>
string: -p<first-page-id-contained>p<last-pageid-contained> Expect the<br>
specific groupings to change from one run to the next; it's time-based,<br>
rather than based on the number of pages or revisions.<br>
<br>
You may notice a gap of a few numbers between files; this would indicate<br>
that those pages were deleted and not included in the dump at all.<br>
<br>
Since there were no issues with the network, database servers, broken MW<br>
deployments etc., the run finished without any need for restarts of a<br>
particular step; this is probably the fastest we'll ever see it run, in<br>
a little under 8 days.<br>
<br>
Any issues, please let me know. I expect people will need a script to<br>
download these files easily; didn't someone on this list have a tool in<br>
the works?<br>
<br>
Ariel<br>
<br>
<br>
_______________________________________________<br>
Xmldatadumps-l mailing list<br>
<a href="mailto:Xmldatadumps-l@lists.wikimedia.org">Xmldatadumps-l@lists.wikimedia.org</a><br>
<a href="https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l" target="_blank">https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l</a><br>
</blockquote></div><br></div>