...another dump. August is done, July 7z are done, the last of the May history and 7z are done. That brings us up to date.
I expect to test new code with production of many small files, as previously discussed on this list, starting within the next few days. This test will be for en wikipedia only, as that's the dump that's hardest to run to completion. The results might be a perfectly good dump, or not. Even if they are, I do not plan to try running en wikipedia dumps twice a month, so don't get your hopes up. (Who would process all that data every two weeks anyways?)
Ariel
On Tue, Aug 16, 2011 at 3:00 AM, Ariel T. Glenn ariel@wikimedia.org wrote:
...another dump. August is done, July 7z are done, the last of the May history and 7z are done. That brings us up to date.
\o/
Great to see we're back on track. :-)
We talked a while ago about doing more to promote mirroring of the dumps, and you said you'd been thinking about an approach to that. Would now be a good time to start making a push for more mirrors?
Thanks, Erik
Στις 17-08-2011, ημέρα Τετ, και ώρα 16:54 -0700, ο/η Erik Moeller έγραψε:
On Tue, Aug 16, 2011 at 3:00 AM, Ariel T. Glenn ariel@wikimedia.org wrote:
...another dump. August is done, July 7z are done, the last of the May history and 7z are done. That brings us up to date.
\o/
Great to see we're back on track. :-)
We talked a while ago about doing more to promote mirroring of the dumps, and you said you'd been thinking about an approach to that. Would now be a good time to start making a push for more mirrors?
Heh heh, you read my mind. The last few days I've been in touch with a possible site for mirroring. It's going to take a little time to get things set up, as we need to publish a list of which files to mirror, since no one wants to take on 25T or whatever it is now. Rsync is the tool of choice for sites wanting to do this, and it would walk through the whole tree :-D
Since it looks like we might be stable after the OS/software upgrades finally, I was planning to go back to fighting with google group accounts and buckets too,
So, anyone else reading this list, time to dig through your rolodexes again...
Ariel
What about the Library of Congress? Any news about that old contact attempt?
I heard about Internet Archive downloading the dumps several times every year, but not official confirmation.
2011/8/18 Erik Moeller erik@wikimedia.org
On Tue, Aug 16, 2011 at 3:00 AM, Ariel T. Glenn ariel@wikimedia.org wrote:
...another dump. August is done, July 7z are done, the last of the May history and 7z are done. That brings us up to date.
\o/
Great to see we're back on track. :-)
We talked a while ago about doing more to promote mirroring of the dumps, and you said you'd been thinking about an approach to that. Would now be a good time to start making a push for more mirrors?
Thanks, Erik
-- Erik Möller Deputy Director, Wikimedia Foundation
Support Free Knowledge: http://wikimediafoundation.org/wiki/Donate
Xmldatadumps-l mailing list Xmldatadumps-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l
xmldatadumps-l@lists.wikimedia.org