Dear Wiki Community,
My name is Mackenzie Lemieux and I am a neuroscience researcher at the Salk
Institute for Biological Studies and I am interested in exploring biases on
My research hypothesis is that gender or ethnicity mediate the rate of
flagging and deletion of pages for women in STEM. I hope to
retrospectively analyze Wikipedia's deletion history, harvest the
biographical articles about scientists that have been created over the past
n years and then confirm the gender and ethnicity of a large sample.
It appears that we can identify deleted pages with Wikipedia's deletion log
<https://en.wikipedia.org/wiki/Wikipedia:Deletion_log>, but to actually see
the page that was deleted we need to be members of one of these Wikipedia
user groups: Administrators
Does anyone have advice on how to obtain researcher status or is there
anyone willing to collaborate who has access to the data we need?
220 Gilmour Avenue
Join the Research Team at the Wikimedia Foundation  for their monthly
Office hours on 2020-09-01 at 16.00-17.00 (UTC).
Through these office hours, we aim to make ourselves more available to
answer some of the research related questions that you as Wikimedia
volunteer editors, organizers, affiliates, staff, and researchers face in
your projects and initiatives (*).
To participate, join the video-call via this Wikimedia-meet link . There
is no set agenda - feel free to add your item to the list of topics in the
etherpad  (You can do this after you join the meeting, too.), otherwise
you are welcome to also just hang out. More detailed information (e.g.
about how to attend) can be found here .
Started in the beginning of 2020 as an experiment , after the first 6
editions we have evaluated the scope and format of the Research office
hours. In order to decrease barriers of accessibility and to facilitate
more direct interaction, we have switched the format from IRC to video
call. We will re-evaluate the current format at the end of the year. We
would also be glad to hear your feedback and/or comments.
(*) Some example cases we hope to be able to support you in:
You have a specific research related question that you suspect you
should be able to answer with the publicly available data and you don’t
know how to find an answer for it, or you just need some more help with it.
For example, how can I compute the ratio of anonymous to registered editors
in my wiki?
You run into repetitive or very manual work as part of your Wikimedia
contributions and you wish to find out if there are ways to use machines to
improve your workflows. These types of conversations can sometimes be
harder to find an answer for during an office hour, however, discussing
them can help us understand your challenges better and we may find ways to
work with each other to support you in addressing it in the future.
You want to learn what the Research team at the Wikimedia Foundation
does and how we can potentially support you. Specifically for affiliates:
if you are interested in building relationships with the academic
institutions in your country, we would love to talk with you and learn
more. We have a series of programs that aim to expand the network of
Wikimedia researchers globally and we would love to collaborate with those
of you interested more closely in this space.
You want to talk with us about one of our existing programs .
Hope to see many of you,
Martin (WMF Research Team)
Hope all is well. Does anyone know of published statistics for the number
of registered editors per country? I need it for 2016, but any year close
to that would suffice.
Your help is greatly appreciated,
School of Geography and Development
University of Arizona
We’re preparing for the August 2020 research newsletter and looking for
contributors. Please take a look at
https://etherpad.wikimedia.org/p/WRN202008 and add your name next to any
paper you are interested in covering. Our target publication time is 30
August 15:59 UTC. If you can't make this deadline but would like to cover a
particular paper in the subsequent issue, leave a note next to the paper's
entry below. As usual, short notes and one-paragraph reviews are most
*Highlights from this month:*
- Broadening African Self-Representation on Wikipedia: A Field Experiment
- Characterizing Online Vandalism: A Rational Choice Perspective
- Commonsense Knowledge in Wikidata
- Impact of individual actions on the collective response of social
- Multiple Texts as a Limiting Factor in Online Learning: Quantifying
(Dis-)similarities of Knowledge Networks across Languages
- Notable Site Recognition using Deep Learning on Mobile and
- Protecting the Web from Misinformation
- Ripples on the web: Spreading lake information via Wikipedia
- Successful Online Socialization: Lessons from the Wikipedia Education
- Wikipedia, COVID-19, and readers' interests across languages
- Wikipedia, The Free Online Medical Encyclopedia Anyone Can Plagiarize:
Time to Address Wiki-Plagiarism
- WiTPy: A Toolkit to Parse and Analyse Wikipedia Talk Pages
Masssly and Tilman Bayer
 WikiResearch (@WikiResearch) | Twitter
Announcement: September 10 Zoom Webinar
Wiki Credibility Tech Demo Hour
The WikiCred grants initiative was launched earlier this year with a focus
on information credibility/reliability and new tools on open-knowledge
platforms like Wikipedia and Wikidata.
The Demo Hour will be held on September 10th at 12 pm EDT where 6 of our
grantees will demo their Wikipedia projects. You can register for the
online event here
WikiCred is a grants programme managed by the Credibility Coalition, Hacks
Hackers, members of Wikimedia DC, and other groups. We support research,
software projects, and Wikimedia events that explore information
reliability and credibility in the Wikimedia space and the overall online
So far, we have funded 12 projects over 3 rounds of funding. You can find a
full list of the projects here <http://wikicred.org/#projects> or our own
blog <https://misinfocon.com/search?q=wikicred>. Projects range from
strengthening credible vaccine content on Wikipedia to automating the
additions of references to Wikidata.
6 grantees will join us to present their work for 5 mins each followed by a
Q&A from the attendees.
Let me know if you have any questions about the project or the event.
Ahmed & WikiCred Team
Partners <https://www.wikicred.org/#who-we-are> / Projects
<http://wikicred.org/#projects> / Funders <http://wikicred.org/#funders>
The next Research Showcase will be live-streamed on Wednesday, August 19,
at 9:30 AM PDT/16:30 UTC, and will be on the theme of readership and
YouTube stream: https://www.youtube.com/watch?v=MeUl0zjHdF8
As usual, you can join the conversation on IRC at #wikimedia-research. You
can also watch our past research showcases here:
This month's presentations:
What matters to us most and why? Studying popularity and attention dynamics
via Wikipedia navigation data.
By Taha Yasseri (University College Dublin), Patrick Gildersleve (Oxford
While Wikipedia research was initially focused on editorial behaviour or
the content to a great extent, soon researchers realized the value of the
navigation data both as a reflection of readers interest and, more
generally, as a proxy for behaviour of online information seekers. In this
talk we will report on various projects in which we utilized pageview
statistics or readers navigation data to study: movies financial success
, electoral popularity , disaster triggered collective attention 
and collective memory , general navigation patterns and article typology
, and attention patterns in relation to news breakouts.
 Early Prediction of Movie Box Office Success Based on Wikipedia
Activity Big Data. PLoS One (2013).
 Wikipedia traffic data and electoral prediction: towards
theoretically informed models. EPJ Data Science (2016).
 Dynamics and biases of online attention: the case of aircraft
crashes. Royal Society Open Science (2016).
 The memory remains: Understanding collective memory in the digital
age. Science Advances (2018). https://doi.org/10.1126/sciadv.1602368
 Inspiration, captivation, and misdirection: Emergent properties in
networks of online navigation. Springer (2018).
Query for Architecture, Click through Military. Comparing the Roles of
Search and Navigation on Wikipedia
By Dimitar Dimitrov (GESIS - Leibniz Institute for the Social Sciences)
As one of the richest sources of encyclopedic information on the Web,
Wikipedia generates an enormous amount of traffic. In this paper, we study
large-scale article access data of the English Wikipedia in order to
compare articles with respect to the two main paradigms of information
seeking, i.e., search by formulating a query, and navigation by following
hyperlinks. To this end, we propose and employ two main metrics, namely (i)
searchshare -- the relative amount of views an article received by search
--, and (ii) resistance -- the ability of an article to relay traffic to
other Wikipedia articles -- to characterize articles. We demonstrate how
articles in distinct topical categories differ substantially in terms of
these properties. For example, architecture-related articles are often
accessed through search and are simultaneously a "dead end" for traffic,
whereas historical articles about military events are mainly navigated. We
further link traffic differences to varying network, content, and editing
activity features. Lastly, we measure the impact of the article properties
by modeling access behavior on articles with a gradient boosting approach.
The results of this paper constitute a step towards understanding human
information seeking behavior on the Web.
Different Topic, Different Traffic: How Search and Navigation Interplay
on Wikipedia. Journal of Web Science (2019).
Janna Layton (she/her)
Administrative Associate - Product & Technology
Wikimedia Foundation <https://wikimediafoundation.org/>
I'm hoping you & yours are all well, safe and healthy in these
unprecedented times we have all found ourselves in.
I'm writing you today with my 'educator & researcher' hat on, with a
special request to help Piotr Konieczny & I spread the word about a new
global research we are conducting.
While using Wikimedia-related assignments (Wikipedia, Commons, WikiBooks,
WikiSource, Wikidata, Wiktionary etc) in the classroom has been used all
over the world for over a decade, very little research was conducted about
what instructors who have tried it actually think about the experience.
We are hoping that answering the questions in the survey will help us
- Whether this teaching approach is effective (or not)
- What are some of the challenges experienced by instructors
- How the process could be improved
The questions are meant for any instructors running a wiki assignment,
whether it is in k-12 or higher education, formal or informal educational
setting. We are hoping the results will allow us to globally share
experiences and learn from one another, so we can make it smoother, easier
and more effective for educators joining these efforts.
It is important to note that this would be the first time (that we know
of!) that an academic research of this type has been conducted around the
world, so we really need your help in spreading the word about it in your
local communities. We're hoping that any of you, supporting such
initiatives around the world over the years, would forward it to your local
Education contacts and ask them to participate. The more instructors
participating, the better.
We realize that it would have been great to have the questionnaire in a
variety of languages, but in order for us to process the data properly and
not via third-party translations and keep the anonymity and privacy of
participants, it was decided to release the survey just in English.
Here is a link to the survey - https://tinyurl.com/yd6dfata
Thank you all in advance, and of course, if there are any questions, Piotr
& I are here.
Stay healthy & safe!
*Shani Evenstein Sigalov*
* Lecturer, Tel Aviv University.
* EdTech Innovation Strategist, NY/American Medical Program, Sackler School
of Medicine, Tel Aviv University.
* PhD Candidate, School of Education, Tel Aviv University.
* Azrieli Foundation Research Fellow.
* OER & Emerging Technologies Coordinator, UNESCO Chair
<https://education.tau.ac.il/node/3495> on Technology, Internationalization
and Education, School of Education, Tel Aviv University
* Member of the Board of Trustees
* Chairperson, The Hebrew Literature Digitization Society
* Chief Editor, Project Ben-Yehuda <http://benyehuda.org>.
Apologies for cross-posting
Over the last year, the DBpedia core team has consolidated great amount
of technology around DBpedia. This tutorial is targeted for developers
(in particular of DBpedia Chapters) that wish to learn how to replicate
local infrastructure such as loading and hosting an own SPARQL endpoint.
A core focus will also be the new DBpedia Stack, which contains several
dockerized applications that are automatically loading data from the
DBpedia databus. The second tutorial will be held on September 2nd, 2020
at 17:00 CEST and it will cover the following topics:
- Using Databus collections (Download)
- Creating customized Databus collections
- Uploading data to the Databus
- Using collections in Databus-ready Docker applications
- Creating dockerized applications for the DBpedia Stack
- Web URL: https://wiki.dbpedia.org/tutorials/2nd-dbpedia-stack-tutorial
- When: September 2nd, 2020 at 17:00-18:00 CEST
- Where: The tutorial will be organized online.Registration is required
- Databus: https://databus.dbpedia.org/
Attending the DBpedia Stack tutorial is free. Registration is required
though. After the registration for the event, you will receive an email
with more instructions. Please register here to be part of the meeting:
- Milan Dojchinovski, AKSW/KILT, DBpedia Association
- Jan Forberg, AKSW/KILT, DBpedia Association
- Julia Holze, InfAI, DBpedia Association
- Sebastian Hellmann, AKSW/KILT, DBpedia Association
We are looking forward to meeting you online!
With kind regards,
The DBpedia Team