Dear All,
Many thanks for all your work with Wikipedia, we use it daily for various tasks and in our research on Wikipedia. ("we" = a subset of a research group of the Hungarian Academy of Sciences)
I just managed to wget the history dump enwiki-20100130-pages-meta-history.xml.bz2 a week ago, and I would like to combine this with status information about users. Do you happen to know if the status (admin, steward, bot, etc.) of all users has been logged for enwiki? In what way would this be available for download and for non-profit basic research?
Thanks, Illes
Farkas, Illes wrote:
Dear All,
Many thanks for all your work with Wikipedia, we use it daily for various tasks and in our research on Wikipedia. ("we" = a subset of a research group of the Hungarian Academy of Sciences)
I just managed to wget the history dump enwiki-20100130-pages-meta-history.xml.bz2 a week ago, and I would like to combine this with status information about users. Do you happen to know if the status (admin, steward, bot, etc.) of all users has been logged for enwiki? In what way would this be available for download and for non-profit basic research?
Thanks, Illes
You need to download user_groups.sql.gz The left value of the pair is the <id> entry in <contributor> The right one, the goup he belongs to.
Note however that you can't have a complete enwiki-20100130-pages-meta-history.xml.bz2 from a week ago, since it hasn't finished yet. Most likely, your dump is incomplete (you can resume it).
Platonides wrote:
Farkas, Illes wrote:
Dear All,
Many thanks for all your work with Wikipedia, we use it daily for various tasks and in our research on Wikipedia. ("we" = a subset of a research group of the Hungarian Academy of Sciences)
I just managed to wget the history dump enwiki-20100130-pages-meta-history.xml.bz2 a week ago, and I would like to combine this with status information about users. Do you happen to know if the status (admin, steward, bot, etc.) of all users has been logged for enwiki? In what way would this be available for download and for non-profit basic research?
Thanks, Illes
You need to download user_groups.sql.gz The left value of the pair is the <id> entry in <contributor> The right one, the goup he belongs to.
Note however that you can't have a complete enwiki-20100130-pages-meta-history.xml.bz2 from a week ago, since it hasn't finished yet. Most likely, your dump is incomplete (you can resume it).
Indeed. It's not scheduled to finish until 2010-03-13 12:03:56 ( UTC ) and will likely take a bit longer as I'm seeing the revs/sec drop lower by about .01 every 5 minutes
The deviation isn't much but when were talking about millions of revisions that starts adding up.
Almost there ...
--tomasz
Tomasz Finc wrote:
Platonides wrote:
Farkas, Illes wrote:
Dear All,
Many thanks for all your work with Wikipedia, we use it daily for various tasks and in our research on Wikipedia. ("we" = a subset of a research group of the Hungarian Academy of Sciences)
I just managed to wget the history dump enwiki-20100130-pages-meta-history.xml.bz2 a week ago, and I would like to combine this with status information about users. Do you happen to know if the status (admin, steward, bot, etc.) of all users has been logged for enwiki? In what way would this be available for download and for non-profit basic research?
Thanks, Illes
You need to download user_groups.sql.gz The left value of the pair is the <id> entry in <contributor> The right one, the goup he belongs to.
Note however that you can't have a complete enwiki-20100130-pages-meta-history.xml.bz2 from a week ago, since it hasn't finished yet. Most likely, your dump is incomplete (you can resume it).
Indeed. It's not scheduled to finish until 2010-03-13 12:03:56 ( UTC ) and will likely take a bit longer as I'm seeing the revs/sec drop lower by about .01 every 5 minutes
The deviation isn't much but when were talking about millions of revisions that starts adding up.
Almost there ...
--tomasz
Poor ms3 is is doing all the work now. I can't blame it for all the slow down as i/o wait is just under 10% but its certainly making just enough of a difference to push this later.
--tomasz
http://download.wikimedia.org/enwiki/20100116/enwiki-20100116-user_groups.sq... But the latest dump seems broken and I don't know why.
On Tue, Mar 9, 2010 at 5:49 AM, Farkas, Illes fij@elte.hu wrote:
Dear All,
Many thanks for all your work with Wikipedia, we use it daily for various tasks and in our research on Wikipedia. ("we" = a subset of a research group of the Hungarian Academy of Sciences)
I just managed to wget the history dump enwiki-20100130-pages-meta-history.xml.bz2 a week ago, and I would like to combine this with status information about users. Do you happen to know if the status (admin, steward, bot, etc.) of all users has been logged for enwiki? In what way would this be available for download and for non-profit basic research?
Thanks, Illes -- http://hal.elte.hu/fij _______________________________________________ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
If you want a history, maybe you can try: http://download.wikimedia.org/enwiki/20100116/enwiki-20100116-pages-logging.... and http://download.wikimedia.org/metawiki/20100306/metawiki-20100306-pages-logg...
On Tue, Mar 9, 2010 at 12:53 PM, Liangent liangent@gmail.com wrote:
http://download.wikimedia.org/enwiki/20100116/enwiki-20100116-user_groups.sq... But the latest dump seems broken and I don't know why.
On Tue, Mar 9, 2010 at 5:49 AM, Farkas, Illes fij@elte.hu wrote:
Dear All,
Many thanks for all your work with Wikipedia, we use it daily for various tasks and in our research on Wikipedia. ("we" = a subset of a research group of the Hungarian Academy of Sciences)
I just managed to wget the history dump enwiki-20100130-pages-meta-history.xml.bz2 a week ago, and I would like to combine this with status information about users. Do you happen to know if the status (admin, steward, bot, etc.) of all users has been logged for enwiki? In what way would this be available for download and for non-profit basic research?
Thanks, Illes -- http://hal.elte.hu/fij _______________________________________________ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
You might also get a history of user rights through MediaWiki API. For a documentation, see http://en.wikipedia.org/w/api.php .
For a list of stewards at a given time you would have to query Meta-Wiki: http://meta.wikimedia.org/w/api.php?action=query&list=logevents&lety... it might just be easier to look at the human readable lists at: http://meta.wikimedia.org/wiki/Stewards)
For the sysops and other rights you would have to query and filter the user rights logs of the English Wikipedia: http://en.wikipedia.org/w/api.php?action=query&list=logevents&letype...
Best regards, Bence Damokos
On 9 March 2010 11:01, Liangent liangent@gmail.com wrote:
If you want a history, maybe you can try:
http://download.wikimedia.org/enwiki/20100116/enwiki-20100116-pages-logging.... and
http://download.wikimedia.org/metawiki/20100306/metawiki-20100306-pages-logg...
On Tue, Mar 9, 2010 at 12:53 PM, Liangent liangent@gmail.com wrote:
http://download.wikimedia.org/enwiki/20100116/enwiki-20100116-user_groups.sq...
But the latest dump seems broken and I don't know why.
On Tue, Mar 9, 2010 at 5:49 AM, Farkas, Illes fij@elte.hu wrote:
Dear All,
Many thanks for all your work with Wikipedia, we use it daily for
various
tasks and in our research on Wikipedia. ("we" = a subset of a research group of the Hungarian Academy of
Sciences)
I just managed to wget the history dump enwiki-20100130-pages-meta-history.xml.bz2 a week ago, and I would
like
to combine this with status information about users. Do you happen to
know
if the status (admin, steward, bot, etc.) of all users has been logged
for
enwiki? In what way would this be available for download and for non-profit basic research?
Thanks, Illes -- http://hal.elte.hu/fij _______________________________________________ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
I don't know if here is the space to ask this. I heard there are some web where people voluntarily help on querying data list from Wikipedia dump and solve statistic problem.
May I ask is it still exist? If so, how can i find it?
best,
Zeyi
Hi,
I think that that is done on the databases on the toolserver instead of the dumps.
Huib
2010/3/9, zh509@york.ac.uk zh509@york.ac.uk:
I don't know if here is the space to ask this. I heard there are some web where people voluntarily help on querying data list from Wikipedia dump and solve statistic problem.
May I ask is it still exist? If so, how can i find it?
best,
Zeyi
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
On Tue, Mar 9, 2010 at 9:31 AM, Huib Laurens sterkebak@gmail.com wrote:
I think that that is done on the databases on the toolserver instead of the dumps.
Specifically, the Query service: https://wiki.toolserver.org/view/Query_service
wikitech-l@lists.wikimedia.org