The current enwiki database dump (http://download.wikimedia.org/enwiki/20081008/) has been crawling along since 10/15/2008.
I realize that dumps can appear stalled in their normal processing (http://meta.wikimedia.org/wiki/Data_dumps#Schedule), but in the recent past (as far as I know) they have not been stalled this long without there being something actually wrong. The completion date for "All pages with complete page edit history" (where it is currently) fluctuates within the latter half of 2009.
Is this purposeful? And is there anything I (or other community members) can do about it? I personally just need the pages-articles part. Would it be possible to dump up to that part on a different thread?
Thank you for your time.
Gabriel Weinberg
yegg@alum.mit.edu wrote in message news:1c624fe40901040620g1c69d070q9f830da33e84f725@mail.gmail.com...
The current enwiki database dump (http://download.wikimedia.org/enwiki/20081008/) has been crawling along since 10/15/2008.
...
Is this purposeful? And is there anything I (or other community members) can do about it? I personally just need the pages-articles part. Would it be possible to dump up to that part on a different thread?
That portion of the dump is already done, and available at http://download.wikimedia.org/enwiki/20081008/enwiki-20081008-pages-articles...
Russ
I realize that. I'm looking forward to the the next dump :)
I had been used to a dump of that part about every 2 months, and it's been about 3 now and the way it is headed it will be 12 before I see another!
On Mon, Jan 5, 2009 at 9:58 AM, Russell Blau russblau@hotmail.com wrote:
yegg@alum.mit.edu wrote in message news:1c624fe40901040620g1c69d070q9f830da33e84f725@mail.gmail.com...
The current enwiki database dump (http://download.wikimedia.org/enwiki/20081008/) has been crawling along since 10/15/2008.
...
Is this purposeful? And is there anything I (or other community members) can do about it? I personally just need the pages-articles part. Would it be possible to dump up to that part on a different thread?
That portion of the dump is already done, and available at http://download.wikimedia.org/enwiki/20081008/enwiki-20081008-pages-articles...
Russ
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
On 1/4/09 6:20 AM, yegg@alum.mit.edu wrote:
The current enwiki database dump (http://download.wikimedia.org/enwiki/20081008/) has been crawling along since 10/15/2008.
The current dump system is not sustainable on very large wikis and is being replaced. You'll hear about it when we have the new one in place. :)
-- brion
Understood--thank you. Any time-frame for when this might be launched?
On Mon, Jan 5, 2009 at 1:47 PM, Brion Vibber brion@wikimedia.org wrote:
On 1/4/09 6:20 AM, yegg@alum.mit.edu wrote:
The current enwiki database dump (http://download.wikimedia.org/enwiki/20081008/) has been crawling along since 10/15/2008.
The current dump system is not sustainable on very large wikis and is being replaced. You'll hear about it when we have the new one in place. :)
-- brion
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
wikitech-l@lists.wikimedia.org