Pursuant to prior discussions about the need for a research
policy on Wikipedia, WikiProject Research is drafting a
policy regarding the recruitment of Wikipedia users to
participate in studies.
At this time, we have a proposed policy, and an accompanying
group that would facilitate recruitment of subjects in much
the same way that the Bot Approvals Group approves bots.
The policy proposal can be found at:
http://en.wikipedia.org/wiki/Wikipedia:Research
The Subject Recruitment Approvals Group mentioned in the proposal
is being described at:
http://en.wikipedia.org/wiki/Wikipedia:Subject_Recruitment_Approvals_Group
Before we move forward with seeking approval from the Wikipedia
community, we would like additional input about the proposal,
and would welcome additional help improving it.
Also, please consider participating in WikiProject Research at:
http://en.wikipedia.org/wiki/Wikipedia:WikiProject_Research
--
Bryan Song
GroupLens Research
University of Minnesota
We’re glad to announce the release of an aggregate clickstream dataset extracted from English Wikipedia
http://dx.doi.org/10.6084/m9.figshare.1305770 <http://dx.doi.org/10.6084/m9.figshare.1305770>
This dataset contains counts of (referer, article) pairs aggregated from the HTTP request logs of English Wikipedia. This snapshot captures 22 million (referer, article) pairs from a total of 4 billion requests collected during the month of January 2015.
This data can be used for various purposes:
• determining the most frequent links people click on for a given article
• determining the most common links people followed to an article
• determining how much of the total traffic to an article clicked on a link in that article
• generating a Markov chain over English Wikipedia
We created a page on Meta for feedback and discussion about this release: https://meta.wikimedia.org/wiki/Research_talk:Wikipedia_clickstream <https://meta.wikimedia.org/wiki/Research_talk:Wikipedia_clickstream>
Ellery and Dario
Cross-posting this request to wiki-research-l. Anyone have data on
frequently used section titles in articles (any language), or know of
datasets/publications that examined this?
I'm not aware of any off the top of my head, Amir.
- Jonathan
---------- Forwarded message ----------
From: Amir E. Aharoni <amir.aharoni(a)mail.huji.ac.il>
Date: Sat, Jul 11, 2015 at 3:29 AM
Subject: [Wikitech-l] statistics about frequent section titles
To: Wikimedia developers <wikitech-l(a)lists.wikimedia.org>
Hi,
Did anybody ever try to collect statistics about frequent section titles in
Wikimedia projects?
For Wikipedia, for example, titles such as "Biography", "Early life",
"Bibliography", "External links", "References", "History", etc., appear in
a lot of articles, and their counterparts appear in a lot of languages.
There are probably similar things in Wikivoyage, Wiktionary and possibly
other projects.
Did anybody ever try to collect statistics of the most frequent section
titles in each language and project?
--
Amir Elisha Aharoni · אָמִיר אֱלִישָׁע אַהֲרוֹנִי
http://aharoni.wordpress.com
“We're living in pieces,
I want to live in peace.” – T. Moore
_______________________________________________
Wikitech-l mailing list
Wikitech-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l
--
Jonathan T. Morgan
Senior Design Researcher
Wikimedia Foundation
User:Jmorgan (WMF) <https://meta.wikimedia.org/wiki/User:Jmorgan_(WMF)>
Hoi,
One of the area's where Wikipedia is not performing optimally is in the
area of mental health. I am working towards an editathon on the subject.
The aim is to consider information in multiple languages not only Dutch.
What I am looking for is how we can measure the impact it has on traffic.
After all I am interested in the whole of the subject not only in the
articles that are updated or written.
My problem is in two: what is the best way of identifying a subject as
broad as mental health and is it possible to get historic information on
traffic?
Thanks,
GerardM
PS I am working on Wikidata on the subject.
Yesterday I gave a presentation about community policing at the Cascadia
Wikimedians' end of year event with Seattle TA3M [1][2][3]. An issue that
came up for discussion is the extent to which, on English Wikipedia,
experienced Wikipedians conducting New Page Patrol create collateral damage
during their well-intentioned efforts to protect Wikipedia. Another subject
that came up is the need for more human resources for mentoring of newbies
who create articles using the Articles for Creation system [4]; one comment
I've heard previously is that the length of time between submission and
review may be long enough for the newbie to give up and disappear, and
another comment that I've heard is that newbies may not understand the
instructions that they're given when their article is reviewed. These
comments correlate with the community SWOT analysis that was done at
WikiConference USA this year, in which "biting the newbies", NPP, and
"onboarding/training" were identified as weaknesses [5]
Personally, I would like the interaction of experienced editors with the
newbies in places like NPP and AFC to look more like this
<https://en.wikipedia.org/wiki/File:Police_Week_May_15,_2010_on_Court_Avenue…>
and less like this
<https://en.wikipedia.org/wiki/File:Ferguson_Day_6,_Picture_44.png>.
Granted, it's hard for a relatively small number of experienced Wikipedians
to keep all the junk and vandals out while also mentoring the newbies and
avoiding collateral damage, so one strategy could be to increase the
quantity of skilled human resources that are devoted to these domains. Any
thoughts on how to make that happen?
I am currently especially interested in this topic because of my IEG
project which officially starts this week. [6] It would be very helpful to
retain the new editors that are trained through these videos, so improving
editor retention via improved newbie experiences at NPP and/or AFC would be
most welcome.
Pine
[1] https://en.wikipedia.org/wiki/Community_policing
[2] https://en.wikipedia.org/wiki/Police_reform_in_the_United_States
[3]
https://commons.wikimedia.org/wiki/File:Presentations_at_Cascadia_Wikimedia…
[4] https://en.wikipedia.org/wiki/Wikipedia:Articles_for_creation
[5]
https://commons.wikimedia.org/wiki/File:SWOT_analysis_of_Wikipedia_in_2015.…
[6]
https://meta.wikimedia.org/wiki/Grants:IEG/Motivational_and_educational_vid…
There might be something Wikipedia-related in this data set, so just in
case, I thought the group might find this Yahoo Labs announcement
interesting:
http://yahoolabs.tumblr.com/post/137281912191/yahoo-releases-the-largest-ev…
"Today, we are proud to announce the public release of the largest-ever
machine learning dataset to the research community. The dataset stands at a
massive ~110B events (13.5TB uncompressed) of anonymized user-news item
interaction data, collected by recording the user-news item interactions of
about 20M users from February 2015 to May 2015.”
Call for Applications: International Visiting Fellowships in Critical
Digital & Social Media Research
The Westminster Institute for Advanced Studies (WIAS)
www.westminster.ac.uk/wias is a newly created academic space at the
University of Westminster in London for independent critical thinking
beyond borders. Its inaugural research theme is Critical Digital &
Social Media Research.
One of the WIAS’ key features is the Research Fellowship Programme that
attracts and brings together current and future academic leaders. We
invite applications for international junior and senior research fellows
(from all academic backgrounds) who conduct fellowship research projects
in the realm of Critical Social & Digital Media Research for the
duration of 3 months in 2016. Several fellowships will be awarded as
result of this call. The fellowships cover airfare and a contribution to
accomodation and subsistence in London.
Funded scholarships are only awarded as a result of open calls. The WIAS
invites both junior and senior fellows. Junior fellows are researchers
who hold a PhD that has been awarded not more than 5 years before the
date of the call publication. Senior fellows are researchers who hold a
PhD that has been awarded more than 5 years before the call is published.
More details and application:
https://www.westminster.ac.uk/news/2016/call-for-applications-international…
Application deadline: February 29, 2016, 17:00 BST
Subscription to the WIAS newsletter in order to receive updates about
events, future fellowship calls, calls of the book series "Critical
Digital & Social Media Studies", publications, etc. is possible here:
https://www.westminster.ac.uk/newsletter
Hi everybody,
We’re preparing for the January 2016 research newsletter and looking for contributors. Please take a look at: https://etherpad.wikimedia.org/p/WRN201601 and add your name next to any paper you are interested in covering. Our target publication date is Wednesday January 27 UTC although actual publication might happen several days later. As usual, short notes and one-paragraph reviews are most welcome.
Highlights from this month:
50/50 Norm in Massive Online Public Good: The Case of Wikipedia
Advances in Network Science| chapter = Studying the Role of Diversity in Open Collaboration Network: Experiments on Wikipedia
An AI for the Wikipedia Game
Analyzing the Usage of Wikipedia on Twitter: Understanding Inter-Language Links
Anon productivity and productive efficiency in English Wikipedia (Showcase, Jan. 2016)
Common knowledge? An ethnography of Wikipedia
Das Ende klassischer Printmedien? Ein Vergleich soziologischer Einführungswerke und Lexika mit der Online-Enzyklopädie Wikipedia aus studentischer Perspektive
From Digital Library Citation Parsing to Wikipedia Reference Analysis
Hidden revolution of human priorities: An analysis of biographical data from Wikipedia
Identifying missing dictionary entries with frequency-conserving context models
Intellectual interchanges in the history of the massive online open-editing encyclopedia, Wikipedia
Not at Home on the Range: Peer Production and the Urban/Rural Divide
On the origin of burstiness in human behavior: The wikipedia edits case
Political Advertising on the Wikipedia Market Place of Information
Population automation: An interview with Wikipedia bot pioneer Ram-Man
Prediction of influenza outbreaks by integrating Wikipedia article access logs and Google flu trend data
Public relations interactions with Wikipedia
Quantifying the Relationship between Hit Count Estimates and Wikipedia Article Traffic
Relevance Analyses and Automatic Categorization of Wikipedia Articles
The Detection of Emerging Trends Using Wikipedia Traffic Data and Context Networks
The Evolution of Wikipedia's Norm Network
The Impact of Sentiment-driven Feedback on Knowledge Reuse in Online Communities
The Impact of Sentiment-driven Feedback on Knowledge Reuse in Online Communities
Verifying social network models of Wikipedia knowledge community
Vier von fünf Internetnutzern recherchieren bei Wikipedia
Where are the Women in Wikipedia? Understanding the Different Psychological Experiences of Men and Women in Wikipedia
Wikiometrics: A Wikipedia Based Ranking System
Wikipedia: Access and participation in an open encyclopaedia
Women Through the Glass-Ceiling: Gender Asymmetries in Wikipedia
If you have any question about the format or process feel free to get in touch off-list.
Masssly, Tilman Bayer and Dario Taraborelli
[1] http://meta.wikimedia.org/wiki/Research:Newsletter
Cross-posting.
---------- Forwarded message ----------
From: Pine W <wiki.pine(a)gmail.com>
Date: Tue, Jan 19, 2016 at 11:03 PM
Subject: Office hour regarding "Motivational and educational video to
introduce Wikimedia"
Hi all,
I will have an office hour regarding the development of the Motivational
and educational video to introduce Wikimedia
<https://meta.wikimedia.org/wiki/Grants:IEG/Motivational_and_educational_vid…>
project. I would appreciate your comments and questions about:
* The outline of the subjects that will be included in the videos
* Good practices for onboarding newcomers that may be leveraged in the
video series
* What to name the video series when it is released to the general public
* Translation of the public communications related to the videos, and
translation of the script
* Anything else related to the series
Please participate in a Doodle poll to help with scheduling the office
hour: http://doodle.com/poll/eu4m62ccdq4senuv
I also welcome questions and comments on the project talk page
<https://meta.wikimedia.org/wiki/Grants_talk:IEG/Motivational_and_educationa…>
.
Thank you!
Pine