Hi All
This is my first time posing on this list, I'm sorry if it is perhaps a little off topic. I'm currently Wikimedian in Residence at UNESCO and plan to run an online collaboration, a little bit like a short term Wikiproject with two main goals:
- Help organise reuse of UNESCO content on Wikimedia projects (UNESCO has released content under an open license and will do more shortly).
- Help improve content on Wikimedia of the subjects of UNESCO programmes e.g the World Heritage Sites.
I have been planning ways that I can use tools to:
- Organise work for contributors across all languages - Provide contributors feedback on their contributions (e.g page views for all contributions combined) - Measure success of the project.
I've been doing this on wiki here https://en.wikipedia.org/wiki/User:John_Cummings/Planning_UNESCO_metrics
In short I'm finding it very hard to find the tools needed, I have found less than a third of what I think would be helpful but found others that may be tangentially useful which I've added in.
Any help would be appreciate, please feel free to comment here, on the talk page or just add tools to the fields
Thanks
John
Hi John,
I added one suggestion re: pages-in-a-category to your work page. Not sure it was a particularly helpful one, though. Looking through your other possible metrics, I see ways of gathering most of them via database and/or API queries, but I don't know of many tools (other than the ones you've already listed) that provide a GUI interface that allows you to get those data without writing some queries or scripts yourself.
For queries related to Wikidata, you might try the new Wikidata Query Service (web interface[1], manual[2]). And if anyone you're working with knows any SQL or a scripting language like Python, you can gather most of your participation metrics (pages edited, etc) via Quarry[3].
If you need a particular query run on a one-off basis, like "tell me how many of the users in this list have created an article in this category", post a request to this list and someone here may be able to run it for you. If you want to learn how to run queries yourself, I can take an hour to get you started. If you're able to take some extra time now to learn the ropes of SQL, it will pay off in the future. We are blessed with a bunch of awesome research tools, but as you've discovered there will always be questions that you want to ask, that there isn't a tool for.
Hope that helps, J
1. https://query.wikidata.org/ 2. https://www.mediawiki.org/wiki/Wikidata_query_service/User_Manual 3. http://quarry.wmflabs.org/
On Fri, Oct 2, 2015 at 3:43 AM, john cummings mrjohncummings@gmail.com wrote:
Hi All
This is my first time posing on this list, I'm sorry if it is perhaps a little off topic. I'm currently Wikimedian in Residence at UNESCO and plan to run an online collaboration, a little bit like a short term Wikiproject with two main goals:
- Help organise reuse of UNESCO content on Wikimedia projects (UNESCO
has released content under an open license and will do more shortly).
- Help improve content on Wikimedia of the subjects of UNESCO
programmes e.g the World Heritage Sites.
I have been planning ways that I can use tools to:
- Organise work for contributors across all languages
- Provide contributors feedback on their contributions (e.g page views
for all contributions combined)
- Measure success of the project.
I've been doing this on wiki here https://en.wikipedia.org/wiki/User:John_Cummings/Planning_UNESCO_metrics
In short I'm finding it very hard to find the tools needed, I have found less than a third of what I think would be helpful but found others that may be tangentially useful which I've added in.
Any help would be appreciate, please feel free to comment here, on the talk page or just add tools to the fields
Thanks
John
Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
Thanks for this email.
This raises a wider question: What is the comfortable way to compare the coverage of a topic in different languages?
For example, I'd love to see a report that says:
Number of articles about UNESCO cultural heritage: English Wikipedia: 1000 French Wikipedia: 1200 Hebrew Wikipedia: 742 etc.
And also to track this over time, so if somebody would work hard on creating articles about UNESCO cultural heritage in Hebrew, I'd see a trend graph.
Of course, this can be relevant to any topic.
-- Amir Elisha Aharoni · אָמִיר אֱלִישָׁע אַהֲרוֹנִי http://aharoni.wordpress.com “We're living in pieces, I want to live in peace.” – T. Moore
2015-10-02 13:43 GMT+03:00 john cummings mrjohncummings@gmail.com:
Hi All
This is my first time posing on this list, I'm sorry if it is perhaps a little off topic. I'm currently Wikimedian in Residence at UNESCO and plan to run an online collaboration, a little bit like a short term Wikiproject with two main goals:
- Help organise reuse of UNESCO content on Wikimedia projects (UNESCO
has released content under an open license and will do more shortly).
- Help improve content on Wikimedia of the subjects of UNESCO
programmes e.g the World Heritage Sites.
I have been planning ways that I can use tools to:
- Organise work for contributors across all languages
- Provide contributors feedback on their contributions (e.g page views
for all contributions combined)
- Measure success of the project.
I've been doing this on wiki here https://en.wikipedia.org/wiki/User:John_Cummings/Planning_UNESCO_metrics
In short I'm finding it very hard to find the tools needed, I have found less than a third of what I think would be helpful but found others that may be tangentially useful which I've added in.
Any help would be appreciate, please feel free to comment here, on the talk page or just add tools to the fields
Thanks
John
Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
Amir E. Aharoni, 06/10/2015 15:12:
This raises a wider question: What is the comfortable way to compare the coverage of a topic in different languages?
https://tools.wmflabs.org/mix-n-match/ . Example: https://tools.wmflabs.org/mix-n-match/?mode=sitestats&catalog=17
Nemo
On 6 October 2015 at 14:12, Amir E. Aharoni amir.aharoni@mail.huji.ac.il wrote:
Thanks for this email.
This raises a wider question: What is the comfortable way to compare the coverage of a topic in different languages?
For example, I'd love to see a report that says:
Number of articles about UNESCO cultural heritage: English Wikipedia: 1000 French Wikipedia: 1200 Hebrew Wikipedia: 742 etc.
And also to track this over time, so if somebody would work hard on creating articles about UNESCO cultural heritage in Hebrew, I'd see a trend graph.
There's two general approaches to this:
a) On Wikidata b) On the individual wikis
Approach (a) would rely on having a defined set of things in Wikidata that we can identify. For example, "is a World Heritage Site" would be easy enough, since we have a property explicitly dealing with WHS identifiers (and we have 100% coverage in Wikidata). "Is of interest to UNESCO" is a trickier one - but if you can construct a suitable Wikidata query...
As Federico notes, for WHS records, we can generate a report like https://tools.wmflabs.org/mix-n-match/?mode=sitestats&catalog=93 (57.4% coverage on hewiki!). No graphs but if you were interested then you could probably set one up without much work.
b) is more useful for fuzzy groups like "of relevance to UNESCO", since this is more or less perfect for a category system. However, it would require examining the category tree for each WP you're interested in to figure out exactly which categories are relevant, and then running a script to count those daily.
A.
I happen to work on a tool (initially for Liam Wyatt) that might do some of what you want on Wikidata. Given a Wikidata Query (separate topic ;-) or a simple list of Wikidata items, it can record changes made to these items over time. It records the JSON for the Wikidata items, max of one revision/day.
A front-end (to be written) can then extract things like number of sitelinks (Wikipedia articles) for these items over time; Wikidata labels in different languages; number/type of statements added; etc. Ideally, this can be exported as a table, to make pretty stats in R (or the like).
As I said, it's work in progress, but if you have a (initial) list of items, I can start "recording".
On Tue, Oct 6, 2015 at 4:54 PM Andrew Gray andrew.gray@dunelm.org.uk wrote:
On 6 October 2015 at 14:12, Amir E. Aharoni amir.aharoni@mail.huji.ac.il wrote:
Thanks for this email.
This raises a wider question: What is the comfortable way to compare the coverage of a topic in different languages?
For example, I'd love to see a report that says:
Number of articles about UNESCO cultural heritage: English Wikipedia: 1000 French Wikipedia: 1200 Hebrew Wikipedia: 742 etc.
And also to track this over time, so if somebody would work hard on
creating
articles about UNESCO cultural heritage in Hebrew, I'd see a trend graph.
There's two general approaches to this:
a) On Wikidata b) On the individual wikis
Approach (a) would rely on having a defined set of things in Wikidata that we can identify. For example, "is a World Heritage Site" would be easy enough, since we have a property explicitly dealing with WHS identifiers (and we have 100% coverage in Wikidata). "Is of interest to UNESCO" is a trickier one - but if you can construct a suitable Wikidata query...
As Federico notes, for WHS records, we can generate a report like https://tools.wmflabs.org/mix-n-match/?mode=sitestats&catalog=93 (57.4% coverage on hewiki!). No graphs but if you were interested then you could probably set one up without much work.
b) is more useful for fuzzy groups like "of relevance to UNESCO", since this is more or less perfect for a category system. However, it would require examining the category tree for each WP you're interested in to figure out exactly which categories are relevant, and then running a script to count those daily.
A.
- Andrew Gray andrew.gray@dunelm.org.uk
Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
Hi all
The Mix n' Match lists are amazing, very surprising that 1/4 of World Heritage sites don't have an en.wiki article, I assume some of this is where the WHS status is for an area combining more than one structure.
It appears Wikidata is the way to go, I will work some more on "is of interest to UNESCO", the main list of topics initially would be:
- World Heritage Sites - Man and the Biosphere reserves - Global Geoparks Network - Memory of the World Programme - Masterpieces of the Oral and Intangible Heritage of Humanity - International days observed by UNESCO - UNESCO prizes - The UNESCO art collection
Would Mix n' Match be the best way to add Wikidata items for each of the things covered by these programmes? I can provide spreadsheets of items for each with properties.
Thanks
John
On 6 October 2015 at 18:45, Magnus Manske magnusmanske@googlemail.com wrote:
I happen to work on a tool (initially for Liam Wyatt) that might do some of what you want on Wikidata. Given a Wikidata Query (separate topic ;-) or a simple list of Wikidata items, it can record changes made to these items over time. It records the JSON for the Wikidata items, max of one revision/day.
A front-end (to be written) can then extract things like number of sitelinks (Wikipedia articles) for these items over time; Wikidata labels in different languages; number/type of statements added; etc. Ideally, this can be exported as a table, to make pretty stats in R (or the like).
As I said, it's work in progress, but if you have a (initial) list of items, I can start "recording".
On Tue, Oct 6, 2015 at 4:54 PM Andrew Gray andrew.gray@dunelm.org.uk wrote:
On 6 October 2015 at 14:12, Amir E. Aharoni amir.aharoni@mail.huji.ac.il wrote:
Thanks for this email.
This raises a wider question: What is the comfortable way to compare the coverage of a topic in different languages?
For example, I'd love to see a report that says:
Number of articles about UNESCO cultural heritage: English Wikipedia: 1000 French Wikipedia: 1200 Hebrew Wikipedia: 742 etc.
And also to track this over time, so if somebody would work hard on
creating
articles about UNESCO cultural heritage in Hebrew, I'd see a trend
graph.
There's two general approaches to this:
a) On Wikidata b) On the individual wikis
Approach (a) would rely on having a defined set of things in Wikidata that we can identify. For example, "is a World Heritage Site" would be easy enough, since we have a property explicitly dealing with WHS identifiers (and we have 100% coverage in Wikidata). "Is of interest to UNESCO" is a trickier one - but if you can construct a suitable Wikidata query...
As Federico notes, for WHS records, we can generate a report like https://tools.wmflabs.org/mix-n-match/?mode=sitestats&catalog=93 (57.4% coverage on hewiki!). No graphs but if you were interested then you could probably set one up without much work.
b) is more useful for fuzzy groups like "of relevance to UNESCO", since this is more or less perfect for a category system. However, it would require examining the category tree for each WP you're interested in to figure out exactly which categories are relevant, and then running a script to count those daily.
A.
- Andrew Gray andrew.gray@dunelm.org.uk
Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
wiki-research-l@lists.wikimedia.org