For all Hive users using stat1002/1004, you might have seen a deprecation
warning when you launch the hive client - that claims it's being replaced
with Beeline. The Beeline shell has always been available to use, but it
required supplying a database connection string every time, which was
pretty annoying. We now have a wrapper
setup to make this easier. The old Hive CLI will continue to exist, but we
encourage moving over to Beeline. You can use it by logging into the
stat1002/1004 boxes as usual, and launching `beeline`.
There is some documentation on this here:
If you run into any issues using this interface, please ping us on the
Analytics list or #wikimedia-analytics or file a bug on Phabricator
(If you are wondering stat1004 whaaat - there should be an announcement
coming up about it soon!)
I'm doing some research on the use of image and video in Wikipedia and
would like to know if there is any way to track # of video views in
Wikipedia articles ?
Image view per page I use the GLAM tools, but for video, I'm not sure if
there is a tool or general Wikipedia stat on # of videos currently used in
all languages, # of Wikipedia articles containing video and # of views to
I understand use of video online is exploiting, and wondered if the wiki
had stats on this as well.
your feedback will be most appreciated !
thanks much in advance
m: +31 (0)6 244 84998 | s: trilcen | t: @trilcenavarrete
Apologies for any cross-posting
CompleNet 2017 - 8th Conference on Complex Networks
Where and When:
Dubrovnik, Croatia, March 21st-24th, 2016.
* Abstract/Paper submission deadline: November 27, 2016
* Notification of acceptance: December 23, 2016
* Submission of Camera-Ready (papers): January 8, 2017
* Early registration ends on: January 20, 2017
Submit your contribution through this easychair link
Johan Bollen <http://www.informatics.indiana.edu/jbollen/>, Indiana
University, IN, US
Guido Caldarelli <http://www.guidocaldarelli.com/>, IMT Lucca, Italy
Gourab Ghoshal <http://www.pas.rochester.edu/people/faculty/ghoshal_gourab/>,
Rochester University, NY, US
Ancsa Hannak <http://www.ccs.neu.edu/home/ancsaaa/>, Northeastern
University, MA, US
Agnes Emoke Horvat <http://www.agneshorvat.info/>, Northwestern University,
Vito Latora <http://www.maths.qmul.ac.uk/~latora/>, Queen Mary, University
of London, UK
Jörg Menche <https://sites.google.com/site/joergmenche/>, Austrian Academy
of Sciences, Austria
Staša Milojević <http://homes.soic.indiana.edu/smilojev/>, Indiana
University, IN, US
Lancaster University, UK
Giovanni Petri <https://lordgrilo.github.io/>, Fondazione ISI, Italy
Zoltan Toroczkai <http://obelix.phys.nd.edu/~toro/>, Notre Dame University,
Call for Contributions:
You are cordially invited to submit an abstract to CompleNet 2017, 8th
Conference on Complex Networks (www.complenet.org), to take place in
Dubrovnik, Croatia, on March 21st-24th, 2017.
The 8th International Conference on Complex Networks (CompleNet 2017) aims
at bringing together researchers working on areas related to complex
networks. In the past two decades scientific research has witnessed a
remarkable increase on studies in this field. Applied to problems that
span from biology to computer sciences, from economics to social sciences,
complex networks are becoming increasingly pervasive in many fields of
science. It is this interdisciplinary nature of the complex networks
approach that the workshop is addressing.
Authors are encouraged to submit their contributions on their research in
complex networks. Both theoretical and applied contributions are of
interest. Specific topics of interest are (but not limited to):
Applications of Network Science
Behavioral & Social Influence
Community Structure in Networks
Complex Networks and Epidemics
Complex Networks and Mobility
Complex Networks in Biological Systems
Complex Networks in Technology
Complex networks in Statistical Mechanics
Dynamics on Networks
Ecological Networks and Food Webs
Emergence in Complex Networks
Geometry in Complex Networks
Impact and Success Prediction
Information Spreading in Social Media
Interacting Social Networks
Link Analysis and Ranking
Modeling Human Behavior in Complex Networks
Models of Complex Networks
Network Controllability and Observability
Networks as Frameworks
Science of science
Search in Complex Networks
Shocks and Bursts
Structural Network Properties and Analysis
Synchronization in Networks
Language Variation and Language Use
Language and Information Diffusion
Kind of contribution
Complenet accepts contributions in the form of papers (not previously
published) and of original abstracts. Each submitted paper/abstract will be
fully refereed and undergo a blind review process by at least two referees.
Publication (for Papers only)
Accepted papers (not abstracts) of registered authors will be published in
the workshop proceedings to be published by Springer-Verlag on the Studies
in Computational Intelligence Series.
Accepted abstracts of registered authors will be presented at the
conference but the abstract will not be included in the Springer-Verlag book
Presentation at Complenet
Submitted contributions (papers and abstracts) can be accepted either for
an oral presentation or for a poster presentation.
Instructions for Submissions
Submit your paper/abstract electronically in PDF format. No hardcopy or fax
submissions will be accepted. Submissions should be printable on a standard
printer on common paper formats such as A4 or letter.
Papers: Submitted papers should follow the Springer format
should not exceed 12 pages in this format. Some works may be accepted as
short papers and shall be limited to 8 pages maximum.
Abstracts: the length of the abstract should be minimum one page and
maximum 2 pages. It should include one figure with caption within the two
Link for contribution:
All contributions should be submitted electronically through easychair
<https://easychair.org/conferences/?conf=complenet17>. You can find
detailed information about the submission process on the conference web site
Vinko Zlatic, Institute Rudjer Boskovic, Croatia
Roberta Sinatra, CNS and Math Department, Central European University,
Bruno Gonçalves, NYU Center for Data Science, USA
Visit the Committee Page on http://complenet.weebly.com/
Alex Arenas, Universitat Rovira i Virgili, Spain
Giuseppe Mangioni, University of Catania, Italy
José Mendes, University of Aveiro, Portugal
Ronaldo Menezes, Florida Institute of Technology, USA
Stephen Uzzo, NY Hall of Science, USA
Main Contact for CompleNet 2017
More information are available at: http://complenet.weebly.com/
Bruno Miguel Tavares Gonçalves, PhD
My name is Eddie Avila and I’m with the organization Global Voices. I’ve been lurking for the past couple of months getting a feel of different experiences in research around Wikimedia-related topics.
We were recently awarded a Project Grant from the Wikimedia Foundation to conduct a research project examining the current state of active and incubator Wikipedias in indigenous languages of Latin America with a special focus on identifying ongoing challenges for native-speaking editors and the demand for this content in their communities.
I believe some members of this list provided some feedback on the proposal, which was greatly appreciated. The project is still a work in progress and the final, refined proposal will be added to the original proposal. We are looking forward to adding the Lead Researcher to the team to help us better define the parameters and design the methodology.
Anyway, we recently published a call for applications (it is in Spanish because of the language requirement associated with the position), but if you or someone you might know would be interested in applying, please send them this link. Deadline is next Friday, Nov. 4.
Global Voices Busca Investigador/a Principal Para Estudio Sobre Wikipedia en Lenguas Indígenas <http://bit.ly/wikilenguas>
Looking forward to providing updates along the way.
- - - - - - - - - - - -
Director, Rising Voices
I wanted to reach out to you to invite you to attend the Wikimedia
Developer Summit. It's an annual meeting to push the evolution of
MediaWiki and other technologies supporting the Wikimedia movement. The
next edition will be held in San Francisco on January 9-11, 2017.
Registration is free. Travel scholarships can be applied for through
Monday, Oct 24th.
I'm specifically reaching out to this list because there has been a lot of
Wiki Research interested in *algorithms and the governing roles they play
in open communities like Wikipedia*. I'll be facilitating a core topic on
Algorithms at the dev summit. So far, I've proposed a few different
sessions that I think researchers will be particularly interested in.
- Building an AI wishlist for Wikimedia Projects
- Algorithmic dangers and transparency -- Best practices
- Where to surface AI in Wikimedia Projects
If your participation and/or independent session proposals are welcome!
- Monday, October 24: This is the last day to request travel
sponsorship. Applying takes less than five minutes.
- Monday, October 31: This is the last day to propose an activity. Bring
the topics you care about!
On-wiki discussion here: https://www.mediawiki.org/wiki
Please feel free to forward this email to anyone who might be interested in
-Aaron, the Half Acre Science Maker
due to a severe kernel vulnerability (https://access.redhat.com/
security/vulnerabilities/2706661) I need to reboot the stat1002, stat1003
and stat1004 hosts to install the new kernel. The reboots are scheduled for
9 AM CEST tomorrow (Oct 21st), please follow up with me or anybody in the
Analytics team if you have ongoing work that can't be stopped.
The Analytics Hadoop and Kafka clusters will be rebooted too during the
next hours. Event if this maintenance shouldn't cause any major issue, you
might experience some service degradation. More up to date information on
IRC in the analytics and operations channels.
Thanks and apologies in advance for the trouble!
The next Research Showcase will be live-streamed this Wednesday, October
19, 2016 at 11:30 AM (PST) 18:30 (UTC).
Link for remote presenters to join the Hangout on Air:
As usual, you can join the conversation on IRC at #wikimedia-research. And,
you can watch our past research showcases here
YouTube stream: https://www.youtube.com/watch?v=cBImUZ_si5s
This month's showcase includes.
Human centered design for using and editing structured data in Wikipedia
infoboxesBy *Charlie Kritschmar
Intern, Wikimedia Deutschland
<https://meta.wikimedia.org/wiki/Wikimedia_Deutschland>*Wikidata is a
Wikimedia project which stores structured data to be used by other
Wikimedia projects like Wikipedia. Currently, integrating its data in
Wikipedia is difficult for users, since there’s no predefined way to do so
and requires some technical knowledge. To tackle these issues,
human-centered design methods were applied to find needs from which
solutions were generated and evaluated with the help of the community. The
concept may serve as a basis which may be implemented into various Wiki
projects in the future to make editing Wikidata from within another
Wikimedia project more user-friendly and improve the project’s acceptance
in the community.
Emergent Work in WikipediaBy *Ofer Arazy
<http://oferarazy.com/> (University of Haifa)*Online production communities
present an exciting opportunity for investigating novel organizational
forms. Extant theoretical accounts of knowledge co-production point to
organizational policies, norms, and communication as key mechanisms
enabling the coordination of work. Yet, in practice participants in
initiatives such as Wikipedia are often occasional contributors who are
unaware of community policies and do not communicate with other members.
How then is work coordinated and how does the organization maintain
stability in the face of dynamics in individuals’ task enactment? In this
study we develop a conceptualization of emergent roles - the prototypical
activity patterns that organically emerge from individuals’ spontaneous
actions – and investigate the temporal dynamics of emergent role behaviors.
Conducing a multi-level large-scale empirical study stretching over a
decade, we tracked co-production of a thousand Wikipedia articles, logging
two hundred thousand distinct participants and seven hundred thousand
co-production activities. Using a combination of manual tagging and machine
learning, we annotated each activity type, and then clustered participants’
activity profiles to arrive at seven prototypical emergent roles. Our
analysis shows that participants’ behavior is turbulent, with substantial
flow in and out of co-production work and across roles. Our findings at the
organizational level, however, show that work is organized around a highly
stable set of emergent roles, despite the absence of traditional
stabilizing mechanisms such as pre-defined work procedures or role
expectations. We conceptualize this dualism in emergent work as “Turbulent
Stability”. Further analyses suggest that co-production is
artifact-centric, where contributors mutually adjust according to the
artifact’s changing needs. Our study advances the theoretical
understandings of self-organizing knowledge co-production and particularly
the nature of emergent roles.
Hope to see you there!
Sarah R. Rodlund
Senior Project Coordinator-Engineering, Wikimedia Foundation