Platonides <Platonides(a)gmail.com> escribió:
I did a proposal on that line last month
http://thread.gmane.org/gmane.science.linguistics.wikipedia.technical/34547
You're also welcomed to comment it ;)
Although the main point seems to be if the files compression is good
enough... The compression acceptable level varying due to things like
WMF disk space available for dumps and the needing to have a better dump
system.
Well, actually if you read the previous threads on this list, you will see that this is a
recurrent topic in the last two months. AFAIK, this topic also got the attention of the
board of trustees, as it is not a joke. Now, it's been more than a year since the last
time we had a complete and valid sutb-meta-history for enwiki.
Brion also heard this complaints, so please, don't bother him again about that.
Currently, he has no time to properly fix it. He also offered some solutions in his blog
(read previous threads, please).
Other big editions (dewiki, frwiki, plwiki....) also presents serious problems with
complete history dumps. And I think the whole problem raised because the DB server was too
stressed, and the dump script lost connectivity to the MySQL backend.
We all agree in that:
1. We all would like this problem to be fixed soon. Many of us researchers are stopped
right now, waiting for new, fresh data.
2. The admins does not have enough time to fix it, because they have more important
issues to attend, and this is normal in such a big project like Wikipedia (let alone the
rest of the Wikimedia Foundation projects).
In short: in my humble opinion we should think about setting up:
1. A mirror/several mirrors to duplicate stub-meta-history info and thus offer
alternative data repositories for research on Wikipedia and related projects. We at the
URJC offer our facilities to the Wikimedia Foundation (and I think, other people in this
thread could do that too).
2. An intermediate board of researchers that would serve as a central point of contact
(though mirrored in practice) to ask for research data about Wikipedia and centralize
petitions to Wikimedia Foundation tech-masters.
This way, everyone could focus his/her attention to their own tasks, and we would not
slow down interesting research works about Wikipedia.
Regards.
Felipe
---------------------------------
¿Chef por primera vez? - Sé un mejor Cocinillas.
Entra en Yahoo! Respuestas.