Analytics December 2020

analytics@lists.wikimedia.org

4 participants
3 discussions

Seeking Information Regarding Pageview Traffic
by Ankan Ghosh Dastider 21 Dec '20

21 Dec '20

Hello everyone, I am Ankan, a Wikimedian from Bangladesh. Recently, I was searching for the Wikimedia stats website for research purposes. I got a bit curious regarding the Bengali Wikipedia total page view section <https://stats.wikimedia.org/#/bn.wikipedia.org/reading/total-page-views/nor…>, as the traffic didn't match the normal flow in January 2018 and faced a sudden surge of desktop access by users. It is unprecedented and highest till today. If you check the normal rate of desktop access, you will see that it is almost 450% than the second highest. The pageview result suggests that the top-visited pages are category-related and date-related pages (the highest visited one is 'Category:Stubs', see here <https://pageviews.toolforge.org/?project=bn.wikipedia.org&platform=desktop&…>) which is quite enigmatic as these pages are hardly viewed by the general readers. The result of certain dates in January 2018 is completely exceptional. Note that, I have checked some other languages and the rate is normal there. I am seeking your assistance to analyze the probable reason behind this surge. Thanks in advance! Best regards, Ankan -- Ankan Ghosh Dastider (he/him) User:ANKAN <https://meta.wikimedia.org/wiki/User:ANKAN> || All Wikimedia Foundation <https://meta.wikimedia.org/wiki/Wikimedia_Foundation>'s public Wiki Executive Member || Wikimedia Bangladesh <http://wikimedia.org.bd/> Twitter <https://twitter.com/Iagdastider> | LinkedIn <https://www.linkedin.com/in/ankan-ghosh-dastider/> | ResearchGate <https://www.researchgate.net/profile/Ankan_Ghosh_Dastider>

2 4

[Wikimedia Research Showcase] December 16, 2020: Disinformation and Reliability of Sources in Wikipedia
by Janna Layton 16 Dec '20

16 Dec '20

Hello, The next Research Showcase will be live-streamed on Wednesday, December 16, at 9:30 AM PST/17:30 UTC, and will be on the theme of disinformation and reliability of sources in Wikipedia. In the first talk, Włodzimierz Lewoniewski will present recent work around multilingual approaches for the assessment of content quality and reliability of sources in Wikipedia leveraging machine learning algorithms. In the second talk, Diego Saez-Trumper will give an overview of ongoing work on fighting disinformation in Wikipedia; specifically, the development of tools and datasets aimed at supporting the discovery of suspicious content and improving verifiability. Youtube stream: https://www.youtube.com/watch?v=v9Wcc-TeaEY As usual, you can join the conversation on IRC at #wikimedia-research. You can also watch our past research showcases here: https://www.mediawiki.org/wiki/Wikimedia_Research/Showcase Talk 1 Speaker: Włodzimierz Lewoniewski (Poznań University of Economics and Business, Poland) Title: Quality assessment of Wikipedia and its sources Abstract: Information in Wikipedia can be edited in over 300 languages independently. Therefore often the same subject in Wikipedia can be described differently depending on language edition. In order to compare information between them one usually needs to understand each of considered languages. We work on solutions that can help to automate this process. They leverage machine learning and artificial intelligence algorithms. The crucial component, however, is assessment of article quality therefore we need to know how to define and extract different quality measures. This presentation briefly introduces some of the recent activities of Department of Information Systems at Poznań University of Economics and Business related to quality assessment of multilingual content in Wikipedia. In particular, we demonstrate some of the approaches for the reliability assessment of sources in Wikipedia articles. Such solutions can help to enrich various language editions of Wikipedia and other knowledge bases with information of better quality. Talk 2 Speaker: Diego Saez-Trumper (Research, Wikimedia Foundation) Title: Challenges on fighting Disinformation in Wikipedia: Who has the (ground-)truth? Abstract: Different from the major social media websites where the fight against disinformation mainly refers to preventing users to massively replicate fake content, fighting disinformation in Wikipedia requires tools that allows editors to apply the content policies of: verifiability, non-original research, and neutral point of view. Moreover, while other platforms try to apply automatic fact checking techniques to verify content, the ground-truth for such verification is done based on Wikipedia, for obvious reasons we can't follow the same pipeline for fact checking content on Wikipedia. In this talk we will explain the ML approach we are developing to build tools to efficiently support wikipedians to discover suspicious content and how we collaborate with external researchers on this task. We will also describe a group of datasets we are preparing to share with the research community in order to produce state-of-the-art algorithms to improve the verifiability of content on Wikipedia. -- Janna Layton (she/her) Administrative Associate - Product & Technology Wikimedia Foundation <https://wikimediafoundation.org/>

1 2

Invitation for Wikimedia Research Office hours December 1, 2020
by Martin Gerlach 01 Dec '20

01 Dec '20

Hi all, Join the Research Team at the Wikimedia Foundation [1] for their monthly Office hours on 2020-12-01 at 17:00-18:00 PM UTC (9am PT/6pm CET). To participate, join the video-call via this Wikimedia-meet link [2]. There is no set agenda - feel free to add your item to the list of topics in the etherpad [3] (You can do this after you join the meeting, too.), otherwise you are welcome to also just hang out. More detailed information (e.g. about how to attend) can be found here [4]. Through these office hours, we aim to make ourselves more available to answer some of the research related questions that you as Wikimedia volunteer editors, organizers, affiliates, staff, and researchers face in your projects and initiatives. Some example cases we hope to be able to support you in: - You have a specific research related question that you suspect you should be able to answer with the publicly available data and you don’t know how to find an answer for it, or you just need some more help with it. For example, how can I compute the ratio of anonymous to registered editors in my wiki? - You run into repetitive or very manual work as part of your Wikimedia contributions and you wish to find out if there are ways to use machines to improve your workflows. These types of conversations can sometimes be harder to find an answer for during an office hour, however, discussing them can help us understand your challenges better and we may find ways to work with each other to support you in addressing it in the future. - You want to learn what the Research team at the Wikimedia Foundation does and how we can potentially support you. Specifically for affiliates: if you are interested in building relationships with the academic institutions in your country, we would love to talk with you and learn more. We have a series of programs that aim to expand the network of Wikimedia researchers globally and we would love to collaborate with those of you interested more closely in this space. - You want to talk with us about one of our existing programs [5]. Hope to see many of you, Martin (WMF Research Team) [1] https://research.wikimedia.org/team.html [2] https://meet.wmcloud.org/ResearchOfficeHours [3] https://etherpad.wikimedia.org/p/Research-Analytics-Office-hours [4] https://www.mediawiki.org/wiki/Wikimedia_Research/Office_hours [5] https://research.wikimedia.org/projects.html -- Martin Gerlach Research Scientist Wikimedia Foundation

1 1

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

Analytics December 2020