For all Hive users using stat1002/1004, you might have seen a deprecation
warning when you launch the hive client - that claims it's being replaced
with Beeline. The Beeline shell has always been available to use, but it
required supplying a database connection string every time, which was
pretty annoying. We now have a wrapper
setup to make this easier. The old Hive CLI will continue to exist, but we
encourage moving over to Beeline. You can use it by logging into the
stat1002/1004 boxes as usual, and launching `beeline`.
There is some documentation on this here:
If you run into any issues using this interface, please ping us on the
Analytics list or #wikimedia-analytics or file a bug on Phabricator
(If you are wondering stat1004 whaaat - there should be an announcement
coming up about it soon!)
Curious, what percentage of digital assistants (Alexa, Siri, Cortana,
Google) cite Wikipedia when a person asks a question?
Does the current Wikipedia mobile app support voice search?
Are there any reports on this? Thanks in advance!
Stella Yu | STELLARESULTS | 415 690 7827
"Chronicling heritage brands and legendary people."
(Apologies for cross-posting)
The organizing committee of the 9th Annual International Conference on
Social Media and Society (#SMSociety) invites you to visit wonderful
Copenhagen on July 18-20, 2018 to experience scholarly inspiration and
Nordic design excellence, architecture brilliance, gastronomic creativity,
and social inclusiveness. The 2018 conference will be held at the Copenhagen
Business School and hosted by the Centre for Business Data Analytics. This
year's theme is on "Networked Influence and Virality - REVISITED".
* Please register by May 1, 2018 to lock in the early-bird rate:
The Conference offers an intensive 3-day program including hands-on
workshops, full & work-in-progress papers, panels, and posters featuring the
latest in social media research. Our interdisciplinary community brings
researchers from a wide variety of disciplines including Communication,
Information Science, Education, Journalism, Management, Computer Science,
Political Science, Sociology, Health, and other fields. This year, we are
honoured to have a highly distinguished scholar and leader as our keynote
* Prof. Karine Nahon - the elected president of the Israel Internet
Association (ISOC-IL), an Associate Professor of Information Science in the
Lauder School of Government and Ofer School of Communications at the
Interdisciplinary Center at Herzliya (IDC), Israel, and an Affiliated
Associate Professor in the Information School at University of Washington
We hope you can join us for this exciting event and contribute to this
emerging research area! If you have any questions about the conference,
please email us at: <mailto:email@example.com>
ask(a)socialmediaandsociety.org or visit our website at
2018 #SMSociety Organizing Committee:
. Anatoliy Gruzd, Ryerson University - Conference Chair
. Ravi Vatrapu, Copenhagen Business School - Local Conference Chair
. Jenna Jacobson, Ryerson University - Conference Chair
. Philip Mai, Ryerson University - Conference Chair
. Anabel Quan-Haase, Western University - Panel Chair
. Luke Sloan, Cardiff University - Panel Chair
. K. Hazel Kwon, Arizona State University -WIP Papers Chair
. Jeff Hemsley, Syracuse University - WIP Papers Chair
. Jaigris Hodson, Royal Roads University - Poster Chair
[Crossposting to Research and Analytics lists]
Most Wikipedia articles with a weekly periodicity show more pageviews
on a typical weekday than a weekend. Some articles associated with
weekends (e.g. articles associated with a variety of hobbies) will
show relatively fewer pave views on weekdays.
Suppose I wanted to plot a heatmap with colors corresponding to the
strength of the weekly periodicity of the pageviews of articles shown
in different geographic locations.
(1) Has anyone done anything like this before?
(2) Is sufficient information available from the current logging regime?
Finally, I would also like to ask for review of this summarization, please:
[sharing because the URL was obscured in the Google Hangout on Air yesterday.]
---------- Forwarded message ----------
From: Nicholas Vincent <nickvincent(a)u.northwestern.edu>
Date: Fri, Apr 20, 2018 at 8:00 AM
Subject: Re: nice Wikimedia Research presentation
To: James Salsman <jsalsman(a)gmail.com>
Really glad you enjoyed the presentation!
Here's the link to my personal website:
and to our lab website (which I should probably add to the slides!)
On Fri, Apr 20, 2018 at 2:15 AM James Salsman <jsalsman(a)gmail.com> wrote:
> I really enjoyed your presentation.
> What is your web address, obscured in the attached?
> Thank you!
> Best regards,
We are excited to announce that the 5th annual Wiki Workshop  will
take place in Lyon on April 24, 2018 and as part of The Web Conference
2018 (a.k.a. WWW2018) .
You can access the call for papers at
http://wikiworkshop.org/2018/#call . Please submit your ongoing or
completed research related to Wikimedia projects to the workshop. Note
that 2018-01-28 is the submission deadline if you want your paper to
appear in the proceedings, and 2018-03-11 is for all other papers.
Following the past year's model, the workshop will have a set of
invited talks (Jon Kleinberg and Markus Kroetzsch have already
accepted our invitation  \o/), a poster session, and more.
Questions and comments are welcome. Otherwise, we're looking forward
to receiving your submissions and seeing you in Lyon in April. :)
Leila, on behalf of the organizers 
Senior Research Scientist
The next Research Showcase will be live-streamed this Wednesday, April 18,
2018 at 11:30 AM (PDT) 18:30 UTC.
YouTube stream: https://www.youtube.com/watch?v=Z1pa-pr6xis
As usual, you can join the conversation on IRC at #wikimedia-research. And,
you can watch our past research showcases here.
The Critical Relationship of Volunteer Created Wikipedia Content to
Large-Scale Online CommunitiesBy *Nate TeBlunthuis*The extensive Wikipedia
literature has largely considered Wikipedia in isolation, outside of the
context of its broader Internet ecosystem. Very recent research has
demonstrated the significance of this limitation, identifying critical
relationships between Google and Wikipedia that are highly relevant to many
areas of Wikipedia-based research and practice. In this talk, I will
present a study which extends this recent research beyond search engines to
examine Wikipedia’s relationships with large-scale online communities,
Stack Overflow and Reddit in particular. I will discuss evidence of
consequential, albeit unidirectional relationships. Wikipedia provides
substantial value to both communities, with Wikipedia content increasing
visitation, engagement, and revenue, but we find little evidence that these
websites contribute to Wikipedia in return. Overall, these findings
highlight important connections between Wikipedia and its broader ecosystem
that should be considered by researchers studying Wikipedia. Overall, this
talk will emphasize the key role that volunteer-created Wikipedia content
plays in improving other websites, even contributing to revenue generation.
The Rise and Decline of an Open Collaboration System, a Closer LookBy *Nate
TeBlunthuis*Do patterns of growth and stabilization found in large peer
production systems such as Wikipedia occur in other communities? This study
assesses the generalizability of Halfaker etal.’s influential 2013 paper on
“The Rise and Decline of an Open Collaboration System.” We replicate its
tests of several theories related to newcomer retention and norm
entrenchment using a dataset of hundreds of active peer production wikis
from Wikia. We reproduce the subset of the findings from Halfaker and
colleagues that we are able to test, comparing both the estimated signs and
magnitudes of our models. Our results support the external validity of
Halfaker et al.’s claims that quality control systems may limit the growth
of peer production communities by deterring new contributors and that norms
tend to become entrenched over time.
Sarah R. Rodlund
Senior Project Coordinator-Product & Technology, Wikimedia Foundation | Hic
>... Accepted papers
> Christoph Hube and Besnik Fetahu
> Detecting Biased Statements in Wikipedia
Hi Christoph and Besnik,
Having worked with several thousand of Amazon Mechanical Turkers over
the past year, I am not convinced that their opinions of bias, even in
aggregate, are not biased. Did you take any steps to measure the bias
against accuracy in your crowdworkers?
Here is an example of what I expect they would get wrong:
"Tax cuts allow consumers to increase their spending, which boosts
That statement, added by en:User:Bkwillwm in 2012, is still part of
the English Wikipedia's Economics article. However, the statement is
strictly inaccurate, and heavily biased in favor of trickle-down
economics and austerity policy. It and statements like it,
pervasive through many if not most of the popular language Wikipedias,
directly support increases in income inequality, which in turn is a
terrible scourge affecting both health and economic growth.
How can you measure whether your crowdworkers are truly unbiased
relative to accuracy, instead of just reflecting the
propaganda-influenced whims of the populist center?
Call for Papers
formal papers - informal papers - doctoral programme
11th Conference on Intelligent Computer Mathematics
- CICM 2018 -
August 13-17, 2018
RISC, Hagenberg, Austria
Digital and computational solutions are becoming the prevalent means
for the generation, communication, processing, storage and curation of
CICM brings together the many separate communities that have developed
theoretical and practical solutions for mathematical applications such as
computation, deduction, knowledge management, and user interfaces.
It offers a venue for discussing problems and solutions in each of these
areas and their integration.
CICM 2018 will feature 3 invited speakers
* Akiko Aizawa, National Institute of Informatics, University of Tokyo
* Bruno Buchberger, Research Institute for Symbolic Computation, Johannes Kepler University
* Adri Olde Daalhuis, University of Edinburgh
and 5 affiliated workshops
* Computer Algebra in the age of Types
* Computer Mathematics in Education - Enlightenment or Incantation
* Formal Mathematics for Mathematicians
* Formal Verification of Physical Systems
* Mathematical Models and Mathematical Software as Research Data
We invite submissions in all topics relating to intelligent computer
mathematics, in particular but not limited to
* theorem proving and computer algebra
* mathematical knowledge management
* digital mathematical libraries
CICM appreciates the varying nature of the relevant research in this area and
invites submissions of very different forms:
1) Formal submissions will be reviewed rigorously and accepted papers will be
published in a volume of Springer LNAI:
* regular papers (up to 15 pages) present novel research results
* project and survey papers (up to 15 pages + bibliography) summarize
* system and dataset descriptions (up to 5 pages) present digital artifacts
2) Informal submissions will be reviewed with a positive bias and selected for
presentation based on their relevance for the community.
* informal papers may present work-in-progress, project announcements,
position statements, etc.
* posters, mini-tutorials, and system demos will be presented in special sessions
3) The doctoral programme provides PhD students a forum to present early results
receive constructive feedback and mentoring.
* Important Dates *
- Abstract deadline: April 22 (Extended)
- Full paper deadline: April 29 (Extended)
- Reviews sent to authors: May 23
- Rebuttals due: May 28
- Notification of acceptance: June 4
- Camera-ready copies due: June 8
- Conference: August 13-17
Informal submissions and doctoral programme
Two separate submission rounds are offered so that some authors can make early
travel plans while others submit spontaneously.
- First round submission deadline: April 29
- Second round submission deadline: July 15
All submissions should be made via easychair at https://easychair.org/conferences/?conf=cicm2018