Wiki-research-l September 2020

wiki-research-l@lists.wikimedia.org

16 participants
13 discussions

Wikipedia Research policy
by song＠cs.umn.edu 14 Jul '23

14 Jul '23

Pursuant to prior discussions about the need for a research policy on Wikipedia, WikiProject Research is drafting a policy regarding the recruitment of Wikipedia users to participate in studies. At this time, we have a proposed policy, and an accompanying group that would facilitate recruitment of subjects in much the same way that the Bot Approvals Group approves bots. The policy proposal can be found at: http://en.wikipedia.org/wiki/Wikipedia:Research The Subject Recruitment Approvals Group mentioned in the proposal is being described at: http://en.wikipedia.org/wiki/Wikipedia:Subject_Recruitment_Approvals_Group Before we move forward with seeking approval from the Wikipedia community, we would like additional input about the proposal, and would welcome additional help improving it. Also, please consider participating in WikiProject Research at: http://en.wikipedia.org/wiki/Wikipedia:WikiProject_Research -- Bryan Song GroupLens Research University of Minnesota

8 10

Requesting access to deleted pages for research purposes
by Mackenzie Lemieux 02 Feb '21

02 Feb '21

Dear Wiki Community, My name is Mackenzie Lemieux and I am a neuroscience researcher at the Salk Institute for Biological Studies and I am interested in exploring biases on Wikipedia. My research hypothesis is that gender or ethnicity mediate the rate of flagging and deletion of pages for women in STEM. I hope to retrospectively analyze Wikipedia's deletion history, harvest the biographical articles about scientists that have been created over the past n years and then confirm the gender and ethnicity of a large sample. It appears that we can identify deleted pages with Wikipedia's deletion log <https://en.wikipedia.org/wiki/Wikipedia:Deletion_log>, but to actually see the page that was deleted we need to be members of one of these Wikipedia user groups: Administrators <https://en.wikipedia.org/wiki/Wikipedia:Administrators>, Oversighters <https://en.wikipedia.org/wiki/Wikipedia:Oversight>, Researchers <https://en.wikipedia.org/wiki/Wikipedia:Researchers>, Checkusers <https://en.wikipedia.org/wiki/Wikipedia:CheckUser>. Does anyone have advice on how to obtain researcher status or is there anyone willing to collaborate who has access to the data we need? Warmly, Mackenzie Lemieux -- Mackenzie Lemieux mackenzie.lemieux(a)gmail.com cell: 416-806-0041 220 Gilmour Avenue Toronto, Ontario M6P 3B4

6 5

Upcoming Research Newsletter: New Papers Open For Review (September 2020)
by Mohammed Sadat Abdulai 25 Sep '20

25 Sep '20

Hi everyone, We’re preparing for the September 2020 research newsletter and looking for contributors. Please take a look at https://etherpad.wikimedia.org/p/WRN202009 and add your name next to any paper you are interested in covering. Our target publication time is 27 September 15:59 UTC. If you can't make this deadline but would like to cover a particular paper in the subsequent issue, leave a note next to the paper's entry below. As usual, short notes and one-paragraph reviews are most welcome. *Highlights from this month:* - A decade of writing on Wikipedia: A comparative study of three articles - A Taxonomy of Knowledge Gaps for Wikimedia Projects (First Draft) - Biased Representation of Politicians in Google and Wikipedia Search? The Joint Effect of Party Identity, Gender Identity and Elections - Covid-on-the-Web: Knowledge Graph and Services to Advance COVID-19 Research - ideoCutTool - Online Video Editor Tool for Wikimedia Commons - Mobile Recognition of Wikipedia Featured Sites using Deep Learning and Crowd-sourced Imagery - PNEL: Pointer Network based End-To-End Entity Linking over Knowledge Graphs - Using logical constraints to validate information in collaborative knowledge graphs: a study of COVID-19 on Wikidata - What if we had no Wikipedia? Domain-independent Term Extraction from a Large News Corpus - Wikidata on MARS *Masssly and Tilman Bayer* [1] http://meta.wikimedia.org/wiki/Research:Newsletter [2] WikiResearch (@WikiResearch) | Twitter <https://twitter.com/WikiResearch>

1 0

[Wikimedia Research Showcase] September 23, 2020: Knowledge Gaps
by Janna Layton 23 Sep '20

23 Sep '20

Hi all, The next Research Showcase will be live-streamed on Wednesday, September 23, at 9:30 AM PDT/16:30 UTC, and will be on the theme of knowledge gaps. Miriam Redi will give an overview on the first draft of the taxonomy of knowledge gaps in Wikimedia projects. The taxonomy is a first milestone towards developing a framework to understand and measure knowledge gaps with the goal of capturing the multi-dimensional aspect of knowledge gaps and inform long-term decision making. YouTube stream: https://www.youtube.com/watch?v=GJDsKPsz64o As usual, you can join the conversation on IRC at #wikimedia-research. You can also watch our past research showcases here: https://www.mediawiki.org/wiki/Wikimedia_Research/Showcase This month's presentation: A first draft of the knowledge gaps taxonomy for Wikimedia projects By the Wikimedia Foundation Research Team <https://research.wikimedia.org/> In response to Wikimedia Movement’s 2030 strategic direction <https://meta.wikimedia.org/wiki/Strategy/Wikimedia_movement/2018-20>, the Research team <https://research.wikimedia.org/team.html> at the Wikimedia Foundation is developing a framework to understand and measure knowledge gaps. The goal is to capture the multi-dimensional aspect of knowledge gaps and inform long-term decision making. The first milestone was to develop a taxonomy of knowledge gaps which offers a grouping and descriptions of the different Wikimedia knowledge gaps. The first draft of the taxonomy is now published <https://arxiv.org/abs/2008.12314> and we seek your feedback to improve it. In this talk, we will give an overview over the first draft of the taxonomy of knowledge gaps in Wikimedia projects. Following that, we will host an extended Q&A in which we would like to get your feedback and discuss with you the taxonomy and knowledge gaps more generally. - More information: https://meta.wikimedia.org/wiki/Research:Knowledge_Gaps_Index/Taxonomy -- Janna Layton (she/her) Administrative Associate - Product & Technology Wikimedia Foundation <https://wikimediafoundation.org/>

1 2

Re: [Wiki-research-l] Feedback about Wikipedia-related project.
by Garcia Duran Alberto 22 Sep '20

22 Sep '20

Hi Su, Thanks for your questions. Imagine you are a fan of Mollywood (a Hollywood inspired nickname for Malayalam Cinema) and you want to improve the article about the following movie: https://en.wikipedia.org/wiki/Aarohanam_(1980_film)<https://slack-redir.net/link?url=https%3A%2F%2Fen.wikipedia.org%2Fwiki%2FAa…> You just watched the movie, and you want to tell the world about the argument of the movie. Then you create a new section named "Plot". You have an idea about which Wikipedia articles to include in the section, but for a whole picture you ask the tool for recommendations. You query -- format of the query is (article, section name, type of entities to suggest) -- the model to get the following information. - Where was the argument of the movie supposed to happen? Which would be translated into the query Query(Aarohanam_(1980_film), "Plot", Location) Recommendations: (Kerala, Puducherry, India, Malabar Coast) - Which topics are addressed in the plot of the movie? Query(Aarohanam_(1980_film), "Plot", TopicalConcept) Recommendations: (Bipolar disorder, Poverty) Given these recommendations and your previous knowledge you are ready to start editing the section! You also know this movie had an impact on the Malayalam culture. Then you decide to include a new section named "Impact on the Malayalam society". As before, you want recommendations from the model before editing. Now you query the tool with Query(Aarohanam_(1980_film), "Impact on the Malayalam society", Person) Query(Aarohanam_(1980_film), "Impact on the Malayalam society", Event) The model provides some suggestions to these queries. The suggestions will be of type Person and Event, respectively. This, along with your previous knowledge you are ready to start editing. To sum up, the tool suggests Wikipedia entities to insert in the respective text of the section you are going to start editing for the first time. You can also use the tool in cases where the section already exists and it contains some text and links. In this case, you can check whether the section is missing some important entity that has been recommended by the tool. However, one of our concerns relates to the requirement of specifying the type of entity (Person, Event, TopicalConcept) for which the editor wants recommendations. We are wondering if this requirement is limiting or not. Note that as indicated in our original post, the total number of entity types is in a manageable range (~20), and can be presented in a visual manner (using a dropdown list) to the editor. Thanks! dlab

1 0

Feedback about Wikipedia-related project.
by Garcia Duran Alberto 22 Sep '20

22 Sep '20

Hi all We are researchers from the dlab at EPFL working with Bob West. We have plans to build a graph-based ML algorithm, which will further facilitate development of a tool to assist Wikipedia editors by providing recommendations on two novel use-cases. One consists of suggesting hyperlinks (Wikipedia articles) to be inserted within a section of an article. Note that this is different from "classical link prediction". We feel the tool could be of great value, as it can work with newly created sections that do not have any content yet. What's more, the editor can type *any* section name (either non-existent in that article or even in the whole Wiki project) and the tool would have the power to suggest hyperlinks that are likely to be of interest for that section in the article. We think that (specially) stub articles can benefit from this tool. However, we have one assumption. In addition to the section name, the editor must provide the "entity type" (Place, People, Date, Organization...) of the Wikipedia articles she would like to insert in the section. The reason is that within a section you can find links to articles of diverse types. The reason we are reaching out to you is two fold: (1) To check whether such a tool would be of interest and likely to be used by the editors. (2) How limiting is the assumption that the editor needs to specify the entity type of the Wikipedia articles for which she needs recommendations from the tool? One one hand, some of us think this is not a problem as the number of entity types is relatively small (between 10 and 20) and they can be easily and visually presented to the editor with a dropdown list. On the other side, others think this requirement is limiting. We would like to know your opinion to decide whether we should move forward with this project. Thanks! dlab

3 2

Editor surveys on race/ethnicity/religion
by Su-Laine Brodsky 22 Sep '20

22 Sep '20

Hi everyone, I’m wondering if any large-scale surveys have been done that ask Wikipedia editors about their race, ethnicity, or religion? Also, have any researchers considered asking these questions in editor surveys, but chosen not to ask them for particular reasons? Best wishes, Su-Laine Su-Laine Brodsky Vancouver, BC

6 10

[feedback requested] Taxonomy of knowledge gaps
by Leila Zia 18 Sep '20

18 Sep '20

Hi all, I hope this email finds you well. I'm reaching out to let you know that the Research team [1] at the Wikimedia Foundation has been working on developing a taxonomy of knowledge gaps for the Wikimedia projects. We now have the first draft of the taxonomy ready and we're seeking your input to improve it. ==Material== * A summary of the taxonomy and motivation: https://commons.wikimedia.org/wiki/File:The_Knowledge_Gaps_Taxonomy_Summary… * Full paper: https://arxiv.org/abs/2008.12314 * A video presentation: https://www.youtube.com/watch?v=pP3uXA9bfvU or https://commons.wikimedia.org/wiki/File:Knowledge_Gaps_Taxonomy.mp4.webm (same video on two platforms) ==Feedback== Please provide your feedback by answering the 6 questions posted at https://meta.wikimedia.org/wiki/Research_talk:Knowledge_Gaps_Index/Taxonomy… . We're collecting feedback until 2020-09-30. ==Talk with us== If you have questions about the taxonomy and you'd like to talk with us in a synchronous set-up, we invite you to join us in the upcoming Research Showcase https://www.mediawiki.org/wiki/Wikimedia_Research/Showcase#September_2020 . We will have a very short presentation about it and will leave 15-20 min for any questions you may have. If you all need more time to talk about it in sync, we're happy to open time for that as well. Let us know. ==What happens next?== We have explained it at https://meta.wikimedia.org/wiki/Research:Knowledge_Gaps_Index/Taxonomy#What… . At a high level, we will iterate on the taxonomy based on the feedback and will start researching on how to measure the different gap types identified in the taxonomy. Thank you! Isaac, Martin, Miriam, and Leila [1] https://research.wikimedia.org/team.html -- Leila Zia Head of Research Wikimedia Foundation

2 2

[Announcement] A new formal collaboration in Research
by Isaac Johnson 16 Sep '20

16 Sep '20

Hi all, The Research team at the Wikimedia Foundation [1] has officially started a new Formal Collaboration [2] with Djellel Difallah (NYU Abu Dhabi) to work collaboratively on sockpuppet detection [3] as part of the Improve Knowledge Integrity program [4] and link recommendation [5] as part of the Address Knowledge Gaps program [6]. You may recognize Djellel as a former member of the Research team and we are glad to be able to continue to collaborate with him as he rejoins academia! Here are a few pieces of information about this collaboration that we would like to share with you: * We aim to keep the research documentation for these projects in the corresponding research page on meta (sockpuppet detection) [3] and phabricator ticket (link recommendation) [5]. * We are thankful to Djellel for agreeing to spend his time and expertise on these projects in the coming year, and to those of you who have worked with us to improve these models. * I will act as the point of contact for the sockpuppet detection research and Martin Gerlach (cc'ed) will act as the point of contact for the link recommendation research in the Wikimedia Foundation. Please feel free to reach out to one of us (directly, if it cannot be shared publicly) if you have comments or questions about a specific project. Best, Isaac Johnson [1] https://research.wikimedia.org/ [2] https://www.mediawiki.org/wiki/Wikimedia_Research/Formal_collaborations [3] https://meta.wikimedia.org/wiki/Research:Sockpuppet_detection_in_Wikimedia_… [4] https://research.wikimedia.org/knowledge-integrity.html [5] https://phabricator.wikimedia.org/T252822 [6] https://research.wikimedia.org/knowledge-gaps.html -- Isaac Johnson (he/him/his) -- Research Scientist -- Wikimedia Foundation

1 0

The August 2020 issue of the Wikimedia Research Newsletter is out
by Mohammed Sadat Abdulai 13 Sep '20

13 Sep '20

The August 2020 issue of the Wikimedia Research Newsletter is out: https://meta.wikimedia.org/wiki/Research:Newsletter/2020/August In this issue: 1. "Protecting the Web from Misinformation" by detecting Wikipedia spammers and identifying pages to protect 2. Editors successfully signal their intelligence by writing high-quality articles - but only when contributing non-anonymously 3. Briefly *** 11 recent publications were covered or listed in this issue *** Masssly and Tilman Bayer --- Wikimedia Research Newsletter https://meta.wikimedia.org/wiki/Research:Newsletter/ * Follow us on Twitter: @WikiResearch <http://twitter.com/Wikiresearch> * Like us on Facebook: Facebook.com/WikiResearch/ * Receive this newsletter by mail: Research-newsletter Mailing List - Wikimedia <https://lists.wikimedia.org/mailman/listinfo/research-newsletter>

1 0

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

Wiki-research-l September 2020