Platonides Platonides@gmail.com escribió:
I did a proposal on that line last month http://thread.gmane.org/gmane.science.linguistics.wikipedia.technical/34547 You're also welcomed to comment it ;) Although the main point seems to be if the files compression is good enough... The compression acceptable level varying due to things like WMF disk space available for dumps and the needing to have a better dump system.
Well, actually if you read the previous threads on this list, you will see that this is a recurrent topic in the last two months. AFAIK, this topic also got the attention of the board of trustees, as it is not a joke. Now, it's been more than a year since the last time we had a complete and valid sutb-meta-history for enwiki.
Brion also heard this complaints, so please, don't bother him again about that. Currently, he has no time to properly fix it. He also offered some solutions in his blog (read previous threads, please).
Other big editions (dewiki, frwiki, plwiki....) also presents serious problems with complete history dumps. And I think the whole problem raised because the DB server was too stressed, and the dump script lost connectivity to the MySQL backend.
We all agree in that:
1. We all would like this problem to be fixed soon. Many of us researchers are stopped right now, waiting for new, fresh data.
2. The admins does not have enough time to fix it, because they have more important issues to attend, and this is normal in such a big project like Wikipedia (let alone the rest of the Wikimedia Foundation projects).
In short: in my humble opinion we should think about setting up:
1. A mirror/several mirrors to duplicate stub-meta-history info and thus offer alternative data repositories for research on Wikipedia and related projects. We at the URJC offer our facilities to the Wikimedia Foundation (and I think, other people in this thread could do that too).
2. An intermediate board of researchers that would serve as a central point of contact (though mirrored in practice) to ask for research data about Wikipedia and centralize petitions to Wikimedia Foundation tech-masters.
This way, everyone could focus his/her attention to their own tasks, and we would not slow down interesting research works about Wikipedia.
Regards.
Felipe
---------------------------------
¿Chef por primera vez? - Sé un mejor Cocinillas. Entra en Yahoo! Respuestas.
wikitech-l@lists.wikimedia.org