On 4/11/10 2:14 PM, Luca de Alfaro wrote:
Hi Mayo,
On Sun, Apr 11, 2010 at 9:06 AM, Fuster, Mayo <Mayo.Fuster@eui.eu mailto:Mayo.Fuster@eui.eu> wrote:
Hi everybody! How are you? I hope happy and fine. I am Mayo Fuster Morell doing a Phd research on Wikipedia governance at the European University Institute. I would appreciate if you could help me with three specific doubts that I have on Wikipedia data. * Is there data or research results on number of users per article? Plus, what is the more frequent number of users per article? Or, what is the distribution of number of editors/article?
I was about to say that this can be derived easily from an analysis of the revisions database table, but then I noticed that no dump of this table in isolation is available from download.wikimedia.org... You can however access all this data by using the Wikipedia API (http://www.mediawiki.org/wiki/API) but it requires some programming.
Actually, if you go to:
http://download.wikipedia.org/%7B%7Bwiki%7D%7D/latest/%7B%7Bwiki%7D%7D-lates...
and replace {{wiki}} with the wiki you're interested in (for instance, enwiki for english wikipedia) you'll basically get the revision table without the actual data of the text, it's a little bit smaller and more manageable that way. You'd have to do some analysis on it, but this could at least get you this information.