Hi all,
FYI: the solution proposed below worked just fine. Thanks Dario! :)
Cheers,
Jérôme.
2013/10/10 Dario Taraborelli <dtaraborelli(a)wikimedia.org>
Hi Jerôme,
most of the actions you refer to are not stored as edits by mediawiki.
They can be accessed via the logging table [1] (with log_type 'delete' or
'block'), which is replicated on tool labs (you can apply for a tool labs
account if you don't have one [2]).
HTH
Dario
[1]
https://www.mediawiki.org/wiki/Manual:Logging_table
[2]
https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/Help
On Oct 10, 2013, at 10:02 AM, "Klein,Max" <kleinm(a)oclc.org> wrote:
Hello Jerome,
I'm not sure this is the best way, but pywikipediabot [1] has a library
called pagegenerators.py and there is a function *def
UserContributionsGenerator(username)* (around line 706). That would allow
you to iterate through theses user names, and I bet there will be a special
marking for deletions/undeletions. If not, worst comes to worse you can use
a regular expression for those words.
[1]
https://meta.wikimedia.org/wiki/pywikipediabot
When you use have a pywikibot-hammer everything looks like a
pywikibot-nail!
Maximilian Klein
Wikipedian in Residence, OCLC
+17074787023
------------------------------
*From:* wiki-research-l-bounces(a)lists.wikimedia.org <
wiki-research-l-bounces(a)lists.wikimedia.org> on behalf of Jérôme Hergueux
<jerome.hergueux(a)gmail.com>
*Sent:* Thursday, October 10, 2013 3:11 AM
*To:* wiki-research-l(a)lists.wikimedia.org
*Subject:* [Wiki-research-l] How to collect all the admin-specific edits
for a subset of Wp admins
Dear all,
I am starting this thread in the hope that some of the great Wiki
researchers on this list could advise me on a data collection problem.
Here is the question: for a each of 120 Wikipedia admins (for whom I have
the usernames and unique numeric ids), I would like to reliably count the
number of times they (i) deleted a page (ii) undeleted (i.e. restored) a
page (iii) protected a page (iv) blocked a user and (v) unblocked a user.
Those types of edits all correspond to a specific "action" in the
Wikipedia API documentation page (
http://en.wikipedia.org/w/api.php):
action=delete,action=undelete, action=protect, action=block and
action=unblock.
I don't know, however, what would be the best strategy to go about
collecting those edits. Does anyone have an idea about which data
collection strategy I should adopt in this case? Is there a way to query
the Wikipedia API directly, or should I look for some specific markers in
the edit summaries?
I would be very grateful for any advice of feedback!
Thanks much for your attention and time. :)
Best,
Jérôme.
_______________________________________________
Wiki-research-l mailing list
Wiki-research-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
_______________________________________________
Wiki-research-l mailing list
Wiki-research-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l