Pursuant to prior discussions about the need for a research
policy on Wikipedia, WikiProject Research is drafting a
policy regarding the recruitment of Wikipedia users to
participate in studies.
At this time, we have a proposed policy, and an accompanying
group that would facilitate recruitment of subjects in much
the same way that the Bot Approvals Group approves bots.
The policy proposal can be found at:
The Subject Recruitment Approvals Group mentioned in the proposal
is being described at:
Before we move forward with seeking approval from the Wikipedia
community, we would like additional input about the proposal,
and would welcome additional help improving it.
Also, please consider participating in WikiProject Research at:
University of Minnesota
For all Hive users using stat1002/1004, you might have seen a deprecation
warning when you launch the hive client - that claims it's being replaced
with Beeline. The Beeline shell has always been available to use, but it
required supplying a database connection string every time, which was
pretty annoying. We now have a wrapper
setup to make this easier. The old Hive CLI will continue to exist, but we
encourage moving over to Beeline. You can use it by logging into the
stat1002/1004 boxes as usual, and launching `beeline`.
There is some documentation on this here:
If you run into any issues using this interface, please ping us on the
Analytics list or #wikimedia-analytics or file a bug on Phabricator
(If you are wondering stat1004 whaaat - there should be an announcement
coming up about it soon!)
Curious, what percentage of digital assistants (Alexa, Siri, Cortana,
Google) cite Wikipedia when a person asks a question?
Does the current Wikipedia mobile app support voice search?
Are there any reports on this? Thanks in advance!
Stella Yu | STELLARESULTS | 415 690 7827
"Chronicling heritage brands and legendary people."
(Apologies for cross-posting)
The organizing committee of the 9th Annual International Conference on
Social Media and Society (#SMSociety) invites you to visit wonderful
Copenhagen on July 18-20, 2018 to experience scholarly inspiration and
Nordic design excellence, architecture brilliance, gastronomic creativity,
and social inclusiveness. The 2018 conference will be held at the Copenhagen
Business School and hosted by the Centre for Business Data Analytics. This
year's theme is on "Networked Influence and Virality - REVISITED".
* Please register by May 1, 2018 to lock in the early-bird rate:
The Conference offers an intensive 3-day program including hands-on
workshops, full & work-in-progress papers, panels, and posters featuring the
latest in social media research. Our interdisciplinary community brings
researchers from a wide variety of disciplines including Communication,
Information Science, Education, Journalism, Management, Computer Science,
Political Science, Sociology, Health, and other fields. This year, we are
honoured to have a highly distinguished scholar and leader as our keynote
* Prof. Karine Nahon - the elected president of the Israel Internet
Association (ISOC-IL), an Associate Professor of Information Science in the
Lauder School of Government and Ofer School of Communications at the
Interdisciplinary Center at Herzliya (IDC), Israel, and an Affiliated
Associate Professor in the Information School at University of Washington
We hope you can join us for this exciting event and contribute to this
emerging research area! If you have any questions about the conference,
please email us at: <mailto:email@example.com>
ask(a)socialmediaandsociety.org or visit our website at
2018 #SMSociety Organizing Committee:
. Anatoliy Gruzd, Ryerson University - Conference Chair
. Ravi Vatrapu, Copenhagen Business School - Local Conference Chair
. Jenna Jacobson, Ryerson University - Conference Chair
. Philip Mai, Ryerson University - Conference Chair
. Anabel Quan-Haase, Western University - Panel Chair
. Luke Sloan, Cardiff University - Panel Chair
. K. Hazel Kwon, Arizona State University -WIP Papers Chair
. Jeff Hemsley, Syracuse University - WIP Papers Chair
. Jaigris Hodson, Royal Roads University - Poster Chair
[Crossposting to Research and Analytics lists]
Most Wikipedia articles with a weekly periodicity show more pageviews
on a typical weekday than a weekend. Some articles associated with
weekends (e.g. articles associated with a variety of hobbies) will
show relatively fewer pave views on weekdays.
Suppose I wanted to plot a heatmap with colors corresponding to the
strength of the weekly periodicity of the pageviews of articles shown
in different geographic locations.
(1) Has anyone done anything like this before?
(2) Is sufficient information available from the current logging regime?
Finally, I would also like to ask for review of this summarization, please:
[sharing because the URL was obscured in the Google Hangout on Air yesterday.]
---------- Forwarded message ----------
From: Nicholas Vincent <nickvincent(a)u.northwestern.edu>
Date: Fri, Apr 20, 2018 at 8:00 AM
Subject: Re: nice Wikimedia Research presentation
To: James Salsman <jsalsman(a)gmail.com>
Really glad you enjoyed the presentation!
Here's the link to my personal website:
and to our lab website (which I should probably add to the slides!)
On Fri, Apr 20, 2018 at 2:15 AM James Salsman <jsalsman(a)gmail.com> wrote:
> I really enjoyed your presentation.
> What is your web address, obscured in the attached?
> Thank you!
> Best regards,
We are excited to announce that the 5th annual Wiki Workshop  will
take place in Lyon on April 24, 2018 and as part of The Web Conference
2018 (a.k.a. WWW2018) .
You can access the call for papers at
http://wikiworkshop.org/2018/#call . Please submit your ongoing or
completed research related to Wikimedia projects to the workshop. Note
that 2018-01-28 is the submission deadline if you want your paper to
appear in the proceedings, and 2018-03-11 is for all other papers.
Following the past year's model, the workshop will have a set of
invited talks (Jon Kleinberg and Markus Kroetzsch have already
accepted our invitation  \o/), a poster session, and more.
Questions and comments are welcome. Otherwise, we're looking forward
to receiving your submissions and seeing you in Lyon in April. :)
Leila, on behalf of the organizers 
Senior Research Scientist
The next Research Showcase will be live-streamed this Wednesday, April 18,
2018 at 11:30 AM (PDT) 18:30 UTC.
YouTube stream: https://www.youtube.com/watch?v=Z1pa-pr6xis
As usual, you can join the conversation on IRC at #wikimedia-research. And,
you can watch our past research showcases here.
The Critical Relationship of Volunteer Created Wikipedia Content to
Large-Scale Online CommunitiesBy *Nate TeBlunthuis*The extensive Wikipedia
literature has largely considered Wikipedia in isolation, outside of the
context of its broader Internet ecosystem. Very recent research has
demonstrated the significance of this limitation, identifying critical
relationships between Google and Wikipedia that are highly relevant to many
areas of Wikipedia-based research and practice. In this talk, I will
present a study which extends this recent research beyond search engines to
examine Wikipedia’s relationships with large-scale online communities,
Stack Overflow and Reddit in particular. I will discuss evidence of
consequential, albeit unidirectional relationships. Wikipedia provides
substantial value to both communities, with Wikipedia content increasing
visitation, engagement, and revenue, but we find little evidence that these
websites contribute to Wikipedia in return. Overall, these findings
highlight important connections between Wikipedia and its broader ecosystem
that should be considered by researchers studying Wikipedia. Overall, this
talk will emphasize the key role that volunteer-created Wikipedia content
plays in improving other websites, even contributing to revenue generation.
The Rise and Decline of an Open Collaboration System, a Closer LookBy *Nate
TeBlunthuis*Do patterns of growth and stabilization found in large peer
production systems such as Wikipedia occur in other communities? This study
assesses the generalizability of Halfaker etal.’s influential 2013 paper on
“The Rise and Decline of an Open Collaboration System.” We replicate its
tests of several theories related to newcomer retention and norm
entrenchment using a dataset of hundreds of active peer production wikis
from Wikia. We reproduce the subset of the findings from Halfaker and
colleagues that we are able to test, comparing both the estimated signs and
magnitudes of our models. Our results support the external validity of
Halfaker et al.’s claims that quality control systems may limit the growth
of peer production communities by deterring new contributors and that norms
tend to become entrenched over time.
Sarah R. Rodlund
Senior Project Coordinator-Product & Technology, Wikimedia Foundation | Hic
>... Accepted papers
> Christoph Hube and Besnik Fetahu
> Detecting Biased Statements in Wikipedia
Hi Christoph and Besnik,
Having worked with several thousand of Amazon Mechanical Turkers over
the past year, I am not convinced that their opinions of bias, even in
aggregate, are not biased. Did you take any steps to measure the bias
against accuracy in your crowdworkers?
Here is an example of what I expect they would get wrong:
"Tax cuts allow consumers to increase their spending, which boosts
That statement, added by en:User:Bkwillwm in 2012, is still part of
the English Wikipedia's Economics article. However, the statement is
strictly inaccurate, and heavily biased in favor of trickle-down
economics and austerity policy. It and statements like it,
pervasive through many if not most of the popular language Wikipedias,
directly support increases in income inequality, which in turn is a
terrible scourge affecting both health and economic growth.
How can you measure whether your crowdworkers are truly unbiased
relative to accuracy, instead of just reflecting the
propaganda-influenced whims of the populist center?