As we look more closely at some of the XML dumps being generating, we are noticing bad revisions hanging around from various conversion bugs. In particular, nlwiktionary has around 1500 revisions from the early part of 2004 that are no longer recoverable.
Does anyone have (or know someone who might have) a full history dump of nl wiktionary from October 2004 or earlier (but no earlier than June)? If so we could pull the data from there.
Thanks,
Ariel Glenn Software Developer / Systems Engineer Wikimedia Foundation
I checked my old archives but seem to have dumped most of those after transferring them to space in the office / other spare servers. Tomasz, I think we stashed a bunch of those in one place last year, do you have them handy?
I'm not sure how much is available for the 2004 era, but if I had any they should be with that batch.
-- brion
On Wednesday, June 16, 2010, Ariel T. Glenn ariel@wikimedia.org wrote:
As we look more closely at some of the XML dumps being generating, we are noticing bad revisions hanging around from various conversion bugs. In particular, nlwiktionary has around 1500 revisions from the early part of 2004 that are no longer recoverable.
Does anyone have (or know someone who might have) a full history dump of nl wiktionary from October 2004 or earlier (but no earlier than June)? If so we could pull the data from there.
Thanks,
Ariel Glenn Software Developer / Systems Engineer Wikimedia Foundation
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
We had old old ones on a drive that just goes click click now due to head and it's copies left us when one of our data center storage boxes raid setup decided to die.
Thankfully these days we have active copies of our snapshots in multiple places but the old ones are going to be tough to track down.
--tomasz
On Jun 16, 2010, at 16:22, Brion Vibber brion@pobox.com wrote:
I checked my old archives but seem to have dumped most of those after transferring them to space in the office / other spare servers. Tomasz, I think we stashed a bunch of those in one place last year, do you have them handy?
I'm not sure how much is available for the 2004 era, but if I had any they should be with that batch.
-- brion
On Wednesday, June 16, 2010, Ariel T. Glenn ariel@wikimedia.org wrote:
As we look more closely at some of the XML dumps being generating, we are noticing bad revisions hanging around from various conversion bugs. In particular, nlwiktionary has around 1500 revisions from the early part of 2004 that are no longer recoverable.
Does anyone have (or know someone who might have) a full history dump of nl wiktionary from October 2004 or earlier (but no earlier than June)? If so we could pull the data from there.
Thanks,
Ariel Glenn Software Developer / Systems Engineer Wikimedia Foundation
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Дана Thursday 17 June 2010 02:04:40 Tomasz Finc написа:
We had old old ones on a drive that just goes click click now due to head and it's copies left us when one of our data center storage boxes raid setup decided to die.
If it just goes click click the heads have probably not fallen and the data could probably be salvaged relatively cheaply.
"Brion Vibber" brion@pobox.com wrote in message news:AANLkTinRKBw-9PLSCsW-QrQOnAiAa5TzUpZWlIxTrJtE@mail.gmail.com...
Tomasz, I think we stashed a bunch of those in one place last year, do you have them handy?
Am I the only one who interpreted this as them being "stashed" in the left-luggage rack at Grand Central station, or buried in a field in Armagh?? Very Jason Bourne... :-D
--HM
On 06/22/2010 02:10 AM, Happy-melon wrote:
"Brion Vibber"brion@pobox.com wrote in message news:AANLkTinRKBw-9PLSCsW-QrQOnAiAa5TzUpZWlIxTrJtE@mail.gmail.com...
Tomasz, I think we stashed a bunch of those in one place last year, do you have them handy?
Am I the only one who interpreted this as them being "stashed" in the left-luggage rack at Grand Central station, or buried in a field in Armagh?? Very Jason Bourne... :-D
Well, that's one way of doing off-site backups... :)
wikitech-l@lists.wikimedia.org