Hi everyone,
We’re preparing for the November 2019 research newsletter and looking for contributors. Please take a look at https://etherpad.wikimedia.org/p/WRN201911 and add your name next to any paper you are interested in covering. Our writing deadline is 27 November 23:59 UTC. If you can't make this deadline but would like to cover a particular paper in the subsequent issue, leave a note next to the paper's entry below. As usual, short notes and one-paragraph reviews are most welcome.
Highlights from this month:
- A Forensic Qualitative Analysis of Contributions to Wikipedia from Anonymity Seeking Users
- All Talk: How Increasing Interpersonal Communication on Wikis May Not Enhance Productivity
- Analysis of Data Persistence in Collaborative Content Creation Systems: The Wikipedia Case
- Analyzing Wikipedia Deletion Debates with a Group Decision-Making Forecast Model
- Collaboration Drives Individual Productivity
- Does Sleep Deprivation Cause Online Incivility? Evidence from a Natural Experiment
- Extracting Literal Assertions for DBpedia from Wikipedia Abstracts
- How Does Editor Interaction Help Build the Spanish Wikipedia?
- Knowledge Graphs and Knowledge Networks: The Story in Brief
- Online Disinformation and the Role of Wikipedia
- Public Archaeology's Mammoth in the Room: Engaging Wikipedia as a Tool for Teaching and Outreach
- Revision Classification for Current Events in Dutch Wikipedia Using a Long Short-Term Memory Network
- The Dynamics of Peer-Produced Political Information During the 2016 U.S. Presidential Campaign
- The Roles Bots Play in Wikipedia
- Transforming Wikipedia into Augmented Data for Query-Focused Summarization
- Weakly Supervised Multilingual Causality Extraction from Wikipedia
- Wiktionary matcher
Masssly and Tilman Bayer
[1] http://meta.wikimedia.org/wiki/Research:Newsletter[2] WikiResearch (@WikiResearch) | Twitter
Hello,
As part of an AHRC research network, I conduct a survey about
Internet/media utopias.
In the time of the Cambridge Analytica scandal and fake news, we
experience a crisis of Internet platforms. Many people think we need
Internet and media utopias today. But how could they look like?
People interested in Wikipedia might have good ideas...
I want to invite you to participate:
https://psmutopias.limequery.net/879161
Answering will take about five minutes. A number of participants with
very visionary ideas will be invited to a workshop in 2020 in London,
where participants will work on co-writing/co-authoring an
Internet/Media Utopias Manifesto.
Kind regards, Christian Fuchs
--
Prof. Christian Fuchs
University of Westminster,
Director of the Communication and Media Research Institute
http://www.camri.ac.uk
@fuchschristian
Hi all,
The next Research Showcase will be live-streamed on Wednesday, November 20,
2019, at 9:30 AM PST/17:30 UTC. We’ll have a presentation from Martin
Potthast of Leipzig University on text reuse in Wikipedia and other
presentation from the Wikimedia Foundation’s Isaac Johnson on the
demographics and interests of Wikipedia’s readers.
YouTube stream: https://www.youtube.com/watch?v=tIko_V1k09s
As usual, you can join the conversation on IRC at #wikimedia-research. You
can also watch our past research showcases here:
https://www.mediawiki.org/wiki/Wikimedia_Research/Showcase
This month's presentations:
Wikipedia Text Reuse: Within and Without
By Martin Potthast, Leipzig University
We study text reuse related to Wikipedia at scale by compiling the first
corpus of text reuse cases within Wikipedia as well as without (i.e., reuse
of Wikipedia text in a sample of the Common Crawl). To discover reuse
beyond verbatim copy and paste, we employ state-of-the-art text reuse
detection technology, scaling it for the first time to process the entire
Wikipedia as part of a distributed retrieval pipeline. We further report on
a pilot analysis of the 100 million reuse cases inside, and the 1.6 million
reuse cases outside Wikipedia that we discovered. Text reuse inside
Wikipedia gives rise to new tasks such as article template induction,
fixing quality flaws, or complementing Wikipedia’s ontology. Text reuse
outside Wikipedia yields a tangible metric for the emerging field of
quantifying Wikipedia’s influence on the web. To foster future research
into these tasks, and for reproducibility’s sake, the Wikipedia text reuse
corpus and the retrieval pipeline are made freely available. Paper
<https://webis.de/publications.html#?q=wikipedia%20ecir%202019>, Demo
<https://demo.webis.de/wikipedia-text-reuse/>
Characterizing Wikipedia Reader Demographics and Interests
By Isaac Johnson, Wikimedia Foundation
Building on two past surveys on the motivation and needs of Wikipedia
readers (Why We Read Wikipedia
<https://www.mediawiki.org/wiki/Wikimedia_Research/Showcase#November_2016>; Why
the World Reads Wikipedia
<https://www.mediawiki.org/wiki/Wikimedia_Research/Showcase#December_2018>),
we examine the relationship between Wikipedia reader demographics and their
interests and needs. Specifically, we run surveys in thirteen different
languages that ask readers three questions about their motivation for
reading Wikipedia (motivation, needs, and familiarity) and five questions
about their demographics (age, gender, education, locale, and native
language). We link these survey results with the respondents' reading
sessions -- i.e. sequence of Wikipedia page views -- to gain a more
fine-grained understanding of how a reader's context relates to their
activity on Wikipedia. We find that readers have a diversity of backgrounds
but that the high-level needs of readers do not correlate strongly with
individual demographics. We also find, however, that there are
relationships between demographics and specific topic interests that are
consistent across many cultures and languages. This work provides insights
into the reach of various Wikipedia language editions and the relationship
between content or contributor gaps and reader gaps. See the meta page
<https://meta.wikimedia.org/wiki/Research:Characterizing_Wikipedia_Reader_Be…>
for more details.
--
Janna Layton (she, her)
Administrative Assistant - Product & Technology
Wikimedia Foundation <https://wikimediafoundation.org/>
Hi all,
Wikimedia Research Showcase [1] is almost six years old and we're
using the birthday opportunity to step back and reflect on the past,
celebrate the contributions by more than 70 speakers and many of you
who participated in the discussions, and plan for its future.
We would like to ask for your input as we're thinking about the future
of the Research Showcases. We want to hear from those of you who
participated in the showcases and/or watched them, as well as those of
you who decided this is not something for you. :) In order to gather
your input, we have put together a survey that we'd appreciate if you
participate in.
Link to survey (please note that the link will take you to Google
Forms [2]): https://docs.google.com/forms/d/e/1FAIpQLSecgn8cMu5IfTYRgn93bfOiJVEIL09RRf_…
Your contributions to this survey can help us in our thinking as we
move forward. Please submit your responses by 2019-11-22.
Sincerely,
Jonathan Morgan and Leila Zia
Research, Wikimedia Foundation
[1] https://www.mediawiki.org/wiki/Wikimedia_Research/Showcase
[2] If you want to participate but not through Google Forms, ping me
off-list and I'll send you a pdf file you can fill and send back to me
(it won't be anonymous though. sorry.). I'm not attaching it to this
email as some lists may put my email in the moderation queue with an
attachment. (And I don't /think/ I can upload it to Commons.)
The October 2019 issue of the Wikimedia Research Newsletter is out:
https://meta.wikimedia.org/wiki/Research:Newsletter/2019/October
In this issue:
1 Research presentations at Wikimania 20191.1 "All Talk: How Increasing Interpersonal Communication on Wikis May Not Enhance Productivity"1.2 "Despite the [Tor] ban: doing good work anonymously on Wikipedia"1.3 Discussion summarization tool to help with Requests for Comments (RfCs) going stale1.4 "Hidden Gems in the Wikipedia Discussions: The Wikipedians' Rationales"1.5 "Characterizing Reader Behavior on Wikipedia"1.6 Wikipedia citations (footnotes) are only clicked on one of every 200 pageviews1.7 "Dwelling on Wikipedia Investigating time spent by global encyclopedia readers"1.8 "Wikipedia graph mining dynamic structure of collective memory1.9 Harmful content rare on English Wikipedia1.10 "Sockpuppet detection in the English Wikipedia"1.11 "Wiki-Atlas: Rendering Wikipedia Content through Cartographic and Augmented Reality Mediums"1.12 "Evidence of Dark Matter: Assessing the Contribution of Subject-matter Experts to Wikipedia"1.13 Why Apple's Siri relies on data from Wikipedia infoboxes instead of (just) Wikidata1.14 "Discovering Implicational Knowledge in Wikidata"1.15 "Analyzing the evolution of wikis with WikiChron"1.16 "State of Wikimedia Research 2018-2019"2 Other events3 Other recent publications3.1 "Revealing the Role of User Moods in Struggling Search Tasks"3.2 Helping students find a research advisor, with Google Scholar and Wikipedia3.3 "Uncovering the Semantics of Wikipedia Categories"3.4 "Adapting NMT to caption translation in Wikimedia Commons for low-resource languages"3.5 "Automatic Detection of Online Abuse and Analysis of Problematic Users in Wikipedia"3.6 "Self Attentive Edit Quality Prediction in Wikipedia"3.7 "TableNet: An Approach for Determining Fine-grained Relations for Wikipedia Tables"3.8 "Training and hackathon on building biodiversity knowledge graphs" with Wikidata3.9 "Spectral Clustering Wikipedia Keyword-Based Search Results"3.10 "Indigenous Knowledge for Wikipedia: A Case Study with an OvaHerero Community in Eastern Namibia"3.11 "On Persuading an OvaHerero Community to Join the Wikipedia Community"
*** 16 recent publications were covered or listed in this issue ***
Masssly and Tilman Bayer
---
Wikimedia Research Newsletterhttps://meta.wikimedia.org/wiki/Research:Newsletter/* Follow us on Twitter: @WikiResearch
* Like us on Facebook: Facebook.com/WikiResearch/
* Receive this newsletter by mail: Research-newsletter Mailing List - Wikimedia
Today we are releasing a new dataset meant to help us understand the impact
of grants and programs on editing. This data was requested several years
ago, and we took a long time to bring in the privacy and security experts
whose help we needed to release it. With that work done, you can download
the data here: https://dumps.wikimedia.org/other/geoeditors/ and read about
it here:
https://wikitech.wikimedia.org/wiki/Analytics/Data_Lake/Edits/Geoeditors/Pu…
You can send questions or comments on this thread or on the discussion page.
Hi all,
Summary: I have made changes to the introduction description of this
mailing list. This includes changes to what this mailing list is used
for (reflecting more closely the reality of how it's been used for the
past couple of years), more expanded information about other lists
that you may want to be aware of, the mailing list norms, and the
introduction of topics and tags. You can review the updated
description at https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
. As a participant of this mailing list, you are responsible to know
the description of the mailing list and I kindly ask you to review it.
Thanks! :)
Long version:
==Who am I?==
I'm one of the admins of this list (and head of Research at Wikimedia
Foundation).
==Why the change?==
On a personal level: I want to be able to communicate with the
research community around Wikimedia projects more often, I don't want
to create a new list for my communications, and I saw the note about
the limits on the frequency of posts in the old description of this
list as something that "kept me out". Inspired by this need, I had a
more comprehensive look at the full description.
The previous version of the description [1] had a few areas to improve
based on the current needs and realities of this list. I name a couple
of them below:
* We had a note about the frequency of emails to this list to be kept
low. With the stack of technologies available to us today, we can
relax this condition and ask for people to use tags/topics in the
subject of their emails to allow for more emails to come to the list.
* The note on who can post to this list could be improved. It read
"only people who are actively involved in research on Wikimedia
projects should post to this list" while my understanding is that on
this list we want to welcome research related questions from those who
are not actively involved in research, too. For example, community
organizers should feel welcome to ask research-related questions on
this list.
==What process did I follow for this change?==
I wrote my proposed changes in an etherpad and sent it to 10 people.
These folks include the two other admins of this list, a couple of
other Research folks from WMF, a few folks from the Wikimedia Research
community, and two editors from the community who are active in the
research space. I heard back from 3 of these folks with thumbs up and
areas for improvement. I incorporated all the suggestions I received.
==What if you have suggestions for improvements?==
Sure. Bring them up on this thread or privately. Please note that I
will be in Wikidata Conference [2] and after that in our annual
Research Offsite and may not be able to respond to you until
2019-11-01.
==Where can I see the updated description?==
Please go to https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
to review it.
Thanks,
Leila
[1] Read the previous version of the description below:
The purpose of this mailing list is to discuss scientific research
into the content and the communities of the Wikimedia projects:
Wikipedia, Wiktionary, Wikibooks, Wikisource, Wikiquote, Wikinews,
Wikispecies, and Wikimedia Commons, Meta-Wiki.
Research into the technology of Wikimedia, MediaWiki, should primarily
be discussed on <a
href="http://mail.wikipedia.org/mailman/listinfo/wikitech-l">wikitech-l</A>
instead. For content or community research projects with a strong
technological component, cross-posting to both lists may be advisable.
Please note that only people who are actively involved in research on
Wikimedia projects should post to this list. Typical on-topic posts
include:
<UL><LI>announcement of a new research project
<LI>discussions of methodology
<LI>questions and answers about related projects
</UL>
Mailing list traffic should be kept at a reasonably low level. The
list is softly moderated, and individuals posting off-topic material
repeatedly may be removed.
This list is not directly associated with the <a
href="http://meta.wikimedia.org/wiki/Wikimedia_Research_Network">Wikimedia
Research Network</A>, though members of the Network are welcome to
post here if they are involved in research projects relating to
content or community. Internal Wikimedia matters, discussions of new
projects and similar threads should be kept off the list.
[2] https://www.wikidata.org/wiki/Wikidata:WikidataCon_2019
--
Leila Zia
Principal Research Scientist, Head of Research
Wikimedia Foundation
Hi everyone,
We’re preparing for the October 2019 research newsletter and looking for contributors. Please take a look a thttps://etherpad.wikimedia.org/p/WRN201910 and add your name next to any paper you are interested in covering. Our target publication date is 30 October 11:59 UTC. As usual, short notes and one-paragraph reviews are most welcome.
Highlights from this month:
- Science Is Shaped by Wikipedia: Evidence From a Randomized Control Trial
- “On Altpedias: partisan epistemics in the encyclopaedias of alternative facts”. In “After the post-truth
- A Clockwork Wikipedia: From a Broad Perspective to a Case Study
- A deep learning-based quality assessment model of collaboratively edited documents: A case study of Wikipedia
- Finding Synonymous Attributes in Evolving Wikipedia Infoboxes
- GeneDB and Wikidata
- Reader Engagement with Wikipedia’s Medical Content
- Science Is Shaped by Wikipedia: Evidence From a Randomized Control Trial
- Sepsis information-seeking behaviors via Wikipedia between 2015 and 2018: A mixed methods retrospective observational study
- The Global Popularity of William Shakespeare in 303 Wikipedias
Masssly and Tilman Bayer
[1] http://meta.wikimedia.org/wiki/Research:Newsletter[2]https://twitter.com/WikiResearch
The September 2019 issue of the Wikimedia Research Newsletter is out:
https://meta.wikimedia.org/wiki/Research:Newsletter/2019/September
In this issue:
1 "Reducing Procrastination While Improving Performance: A Wiki-powered Experiment with Students"2 The Importance of Wikipedia in Assessing News Source Credibility3 Wikipedia Topic Assessment4 OpenSym 20194.1 First literature survey of Wikidata quality research4.2 "Article Quality Classification on Wikipedia: Introducing Document Embeddings and Content Features"4.3 "When Humans and Machines Collaborate: Cross-lingual Label Editing in Wikidata"4.4 "Approving automation: analyzing requests for permissions of bots in Wikidata"4.5 "Dwelling on Wikipedia: Investigating Time Spent by Global Encyclopedia Readers"4.6 "Visualization of the Evolution of Collaboration and Communication Networks in Wikis"4.7 Wikitribune navigating "challenges of collaborative evidence-based journalism"5 Conferences and events6 Other recent publications6.1 "Eliciting New Wikipedia Users' Interests via Automatically Mined Questionnaires: For a Warm Welcome, Not a Cold Start"6.2 Shocks make both newcomers and experienced editors contribute more6.3 "Crosslingual Document Embedding As Reduced-Rank Ridge Regression"6.4 "Framing the Holocaust Online: Memory of the Babi Yar Massacres on Wikipedia"6.5 "Framing the Holocaust in popular knowledge: 3 articles about the Holocaust in English, Hebrew and Polish Wikipedia"
*** 15 recent publications were covered or listed in this issue ***
Masssly and Tilman Bayer
---
Wikimedia Research Newsletterhttps://meta.wikimedia.org/wiki/Research:Newsletter/* Follow us on Twitter: @WikiResearch
* Like us on Facebook: Facebook.com/WikiResearch/
* Receive this newsletter by mail: Research-newsletter Mailing List - Wikimedia
CFP: International Conference on Social Media and Society (#SMSociety)2020
Theme – “Diverse Voices: Promises and Perils of Social Media For Diversity”
Join us on July 22–24, 2020 for the 11th annual International Conference on
Social Media and Society (#SMSociety)
<https://socialmediaandsociety.us3.list-manage.com/track/click?u=b32270f9582…>.
The conference is an interdisciplinary gathering of leading social media
researchers, practitioners, and analysts from around the world. The 2020
conference is hosted by the College of Communication
<https://socialmediaandsociety.us3.list-manage.com/track/click?u=b32270f9582…>
, Studio Chi
<https://socialmediaandsociety.us3.list-manage.com/track/click?u=b32270f9582…>
and
the College of Computing and Digital Media
<https://socialmediaandsociety.us3.list-manage.com/track/click?u=b32270f9582…>
at DePaul University
<https://socialmediaandsociety.us3.list-manage.com/track/click?u=b32270f9582…>
in
Chicago, USA.
Until a decade and a half ago, large media companies and governments lorded
over an oligopoly controlling the means for communicating with the masses.
That oligopoly was (temporarily?) broken by the introduction of social
media with its revolutionary promise to democratize civil discourse and
civil society. It has offered diverse groups the opportunity to connect to
one another and form communities of practice that ultimately can serve to
strengthen their voice. However, in recent years, we have also discovered
that connecting the world via social media leads to new challenges. When so
many diverse voices are brought together on a massive scale, conflict is
common. Interpretation of this conflict is itself diverse: do we see a rise
of incivility or freedom from moral policing? Extremism or idealism?
Distrust or critique? The very same digital tools that amplify voices of
the marginalized can also be used to silence diverse voices online through
online harassment, doxing, trolling, and other measures.
In this context, the *International Conference on Social Media & Society*
<https://socialmediaandsociety.us3.list-manage.com/track/click?u=b32270f9582…>
invites
scholarly and original submissions that explore key questions and central
issues related (but not limited) to the 2020 theme of “*Diverse Voices:
Promises and Perils of Social Media For Diversity**.*” We welcome research
from a wide range of methodological perspectives employing established
quantitative, qualitative, and mixed methods as well as innovative
approaches that cross disciplinary boundaries and expand our understanding
of the current and future trends in social media research, especially
research that seeks to explore questions such as:
- Can human society handle the connection of diverse and divergent
(often conflicting) voices on social media? Is empowering every voice via
social media a net good?
- Under what conditions can social media build bridges across
difference? When does social media reinforce division?
- What social media affordances are good or bad in terms of supporting
diversity, including but not limited to: race, class, gender, sexual,
cultural, political and linguistic diversity?
- What is the role of government regulation? Should social media be
limited to activities that help to build strong societies?
- Is it naive to think that giving everyone a voice will result in
increased democracy?
- And what is the role of social media in all of these? And how can
social media platforms be used to empower marginalized voices?
- Can AI and other automated tools help social media consumers and
producers to overcome these challenges and provide online spaces for
engagement with diverse voices? If so, how?
*IMPORTANT DATES*
- *Full papers** (6-10 pages) Due: *Jan. 27, 2020*
- *WIP papers* (1000-word extended abstract) Due: *Jan. 27, 2020*
- *Panels, Workshops, & Posters* Due: *Mar. 16, 2020*
*Full papers presented at the Conference will be published in the
conference proceedings by *ACM International Conference Proceeding Series*
(ICPS)
<https://socialmediaandsociety.us3.list-manage.com/track/click?u=b32270f9582…>
and
will be available in the ACM Digital Library.
*SUBMISSION DETAILS*:
https://socialmediaandsociety.org/submit/
<https://socialmediaandsociety.us3.list-manage.com/track/click?u=b32270f9582…>
*ABOUT THE CONFERENCE*
The International Conference on Social Media & Society
<https://socialmediaandsociety.us3.list-manage.com/track/click?u=b32270f9582…>
(#SMsociety)
is an annual gathering of leading social media researchers from around the
world. It is the premier venue for sharing and discovering new
peer-reviewed interdisciplinary research on how social media affects
society. Organized by the Social Media Lab
<https://socialmediaandsociety.us3.list-manage.com/track/click?u=b32270f9582…>
at Ted Rogers School of Management
<https://socialmediaandsociety.us3.list-manage.com/track/click?u=b32270f9582…>
at Ryerson University
<https://socialmediaandsociety.us3.list-manage.com/track/click?u=b32270f9582…>,
#SMSociety provides participants with opportunities to exchange ideas,
present original research, learn about recent and ongoing studies, and
network with peers.
The conference’s intensive three-day program features hands-on workshops,
full papers, work-in-progress papers, panels, and posters. The wide-ranging
topics in social media showcase research from scholars working in many
fields including Communication, Computer Science, Education, Journalism,
Information Science, Management, Political Science, Psychology and
Sociology.
*TOPICS OF INTEREST (Not Exhaustive)*
SOCIAL MEDIA IMPACT ON SOCIETY
- Privacy
- Trust & credibility
- Political mobilization & engagement
- Extremism & terrorism
- Dis/Mis/Mal-information, aka…”Fake news”
- Politics of hate and oppression
- Health and well-being
SOCIAL MEDIA & BUSINESS
- Brand communities
- Influencers and consumer engagement
- Consumer behavior & social media marketing
- Public & customer relations
- Cybervetting and HR
SOCIAL MEDIA & PUBLIC SECTOR
- Government regulations of social media
- Government social media management
- Adoption, use, strategies and policies
- Citizens’ engagement
- Citizens’ privacy & security concerns
- Public opinions on environmental issues
SOCIAL MEDIA & ACADEMIA
- Alternative metrics
- Learning analytics
- Teaching with social media
- University branding
- Knowledge Translation/management
ONLINE/OFFLINE COMMUNITIES
- Online community detection
- Influential user detection
- Identity and anonymity
- Case studies
THEORIES & METHODS
- Qualitative approaches
- Quantitative approaches
- Mixed methods
- Opinion mining & sentiment analysis
- Social network analysis
- Theoretical models
BIG & SMALL DATA
- Value of small data
- Data mining and analytics
- Sampling issues
- Visualization
- Scalability issues
- Ethics
- Data Access/Scraping
- Data Biases
*ORGANIZING COMMITTEE*
- Anatoliy Gruzd, Ryerson University, Canada – Conference Chair
- Philip Mai, Ryerson University, Canada – Conference Chair
- Raquel Recuero, Universidade Federal de Pelotas (UFPel), Brazil – Full
Paper Chair
- Ángel Hernández-García, Universidad Politécnica de Madrid, Spain –
Full Paper Chair
- Chei Sian Lee, Nanyang Technological University, Singapore – WIP Chair
- James Cook, University of Maine at Augusta, USA – WIP Chair
- Jaigris Hodson, Royal Roads University, Canada – Poster Chair
*HOST COMMITTEE at DePaul University*
- Bree McEwan, Communication Studies, Communication & Technology
- Jill Hopke, Journalism
- Hamed Qahri Saremi, Management Information Systems
- Juan Mundel, Advertising
- Paul Booth, Digital Communication & Media Arts
- Enid Montague, Human Computer Interaction
- Samantha Close, Digital Communication & Media Arts
- Nur Uysal, Public Relations