[Xmldatadumps-l] another month, another dump. ho hum :-P
Jamie Morken
jmorken at shaw.ca
Thu Sep 8 20:49:29 UTC 2011
----- Original Message -----
From: "Ariel T. Glenn" <ariel at wikimedia.org>
Date: Thursday, September 8, 2011 12:22 am
Subject: [Xmldatadumps-l] another month, another dump. ho hum :-P
To: xmldatadumps-l at lists.wikimedia.org
> The September en wikipedia dumps are done. Folks who use
> them, note
> that this is the first run with the generation of a pile of smaller
> files. The naming scheme as you will have noticed has an
> additionalstring: -p<first-page-id-contained>p<last-pageid-
> contained> Expect the
> specific groupings to change from one run to the next; it's time-
> based,rather than based on the number of pages or revisions.
>
> You may notice a gap of a few numbers between files; this would
> indicatethat those pages were deleted and not included in the
> dump at all.
>
> Since there were no issues with the network, database servers,
> broken MW
> deployments etc., the run finished without any need for restarts
> of a
> particular step; this is probably the fastest we'll ever see it
> run, in
> a little under 8 days.
>
> Any issues, please let me know. I expect people will need
> a script to
> download these files easily; didn't someone on this list have a
> tool in
> the works?
Hi Ariel,
This download addon for firefox works quite well, and is cross-platform:
http://en.wikipedia.org/wiki/DownThemAll!
https://addons.mozilla.org/en-US/firefox/addon/downthemall/
http://www.downthemall.net/
cheers,
Jamie
>
> Ariel
>
>
> _______________________________________________
> Xmldatadumps-l mailing list
> Xmldatadumps-l at lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.wikimedia.org/pipermail/xmldatadumps-l/attachments/20110908/6e176d42/attachment.htm
More information about the Xmldatadumps-l
mailing list