Hi all,
I'd like to make sure you're aware of the following event, especially the second talk which is scheduled to start on 2019-11-20 at 10:00 PST - 18:00 UTC. The presentation will show the latest research on Wikipedia reader motivations and gaps. I'm bringing this research to your attention as for the first time we were able to sample the readership (traffic) from countries in Africa for French and English Wikipedias. While the presentation will not do a deep dive in Africa's readership (as we did the study in 14 languages in many regions of the world) it can still give you more insights about the readers from your continent. :)
If you cannot watch the meeting at the specified time, you can always watch it afterwards as it will be recorded.
Best, Leila
---------- Forwarded message --------- From: Janna Layton jlayton@wikimedia.org Date: Fri, Nov 15, 2019 at 12:23 PM Subject: [Wiki-research-l] [Wikimedia Research Showcase] November 20, 2019 at 9:30 AM PST, 17:30 UTC To: wikimedia-l@lists.wikimedia.org, analytics@lists.wikimedia.org, wiki-research-l@lists.wikimedia.org
Hi all,
The next Research Showcase will be live-streamed on Wednesday, November 20, 2019, at 9:30 AM PST/17:30 UTC. We’ll have a presentation from Martin Potthast of Leipzig University on text reuse in Wikipedia and other presentation from the Wikimedia Foundation’s Isaac Johnson on the demographics and interests of Wikipedia’s readers.
YouTube stream: https://www.youtube.com/watch?v=tIko_V1k09s
As usual, you can join the conversation on IRC at #wikimedia-research. You can also watch our past research showcases here: https://www.mediawiki.org/wiki/Wikimedia_Research/Showcase
This month's presentations:
Wikipedia Text Reuse: Within and Without
By Martin Potthast, Leipzig University
We study text reuse related to Wikipedia at scale by compiling the first corpus of text reuse cases within Wikipedia as well as without (i.e., reuse of Wikipedia text in a sample of the Common Crawl). To discover reuse beyond verbatim copy and paste, we employ state-of-the-art text reuse detection technology, scaling it for the first time to process the entire Wikipedia as part of a distributed retrieval pipeline. We further report on a pilot analysis of the 100 million reuse cases inside, and the 1.6 million reuse cases outside Wikipedia that we discovered. Text reuse inside Wikipedia gives rise to new tasks such as article template induction, fixing quality flaws, or complementing Wikipedia’s ontology. Text reuse outside Wikipedia yields a tangible metric for the emerging field of quantifying Wikipedia’s influence on the web. To foster future research into these tasks, and for reproducibility’s sake, the Wikipedia text reuse corpus and the retrieval pipeline are made freely available. Paper https://webis.de/publications.html#?q=wikipedia%20ecir%202019, Demo https://demo.webis.de/wikipedia-text-reuse/
Characterizing Wikipedia Reader Demographics and Interests
By Isaac Johnson, Wikimedia Foundation
Building on two past surveys on the motivation and needs of Wikipedia readers (Why We Read Wikipedia https://www.mediawiki.org/wiki/Wikimedia_Research/Showcase#November_2016; Why the World Reads Wikipedia https://www.mediawiki.org/wiki/Wikimedia_Research/Showcase#December_2018), we examine the relationship between Wikipedia reader demographics and their interests and needs. Specifically, we run surveys in thirteen different languages that ask readers three questions about their motivation for reading Wikipedia (motivation, needs, and familiarity) and five questions about their demographics (age, gender, education, locale, and native language). We link these survey results with the respondents' reading sessions -- i.e. sequence of Wikipedia page views -- to gain a more fine-grained understanding of how a reader's context relates to their activity on Wikipedia. We find that readers have a diversity of backgrounds but that the high-level needs of readers do not correlate strongly with individual demographics. We also find, however, that there are relationships between demographics and specific topic interests that are consistent across many cultures and languages. This work provides insights into the reach of various Wikipedia language editions and the relationship between content or contributor gaps and reader gaps. See the meta page https://meta.wikimedia.org/wiki/Research:Characterizing_Wikipedia_Reader_Behaviour/Demographics_and_Wikipedia_use_cases#Reader_Surveys for more details.
-- Janna Layton (she, her) Administrative Assistant - Product & Technology Wikimedia Foundation https://wikimediafoundation.org/ _______________________________________________ Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
Thanks. 12:30 Eastern standard time.
On Nov 19, 2019 2:31 PM, "Leila Zia" lzia@wikimedia.org wrote:
Hi all,
I'd like to make sure you're aware of the following event, especially the second talk which is scheduled to start on 2019-11-20 at 10:00 PST
- 18:00 UTC. The presentation will show the latest research on
Wikipedia reader motivations and gaps. I'm bringing this research to your attention as for the first time we were able to sample the readership (traffic) from countries in Africa for French and English Wikipedias. While the presentation will not do a deep dive in Africa's readership (as we did the study in 14 languages in many regions of the world) it can still give you more insights about the readers from your continent. :)
If you cannot watch the meeting at the specified time, you can always watch it afterwards as it will be recorded.
Best, Leila
---------- Forwarded message --------- From: Janna Layton jlayton@wikimedia.org Date: Fri, Nov 15, 2019 at 12:23 PM Subject: [Wiki-research-l] [Wikimedia Research Showcase] November 20, 2019 at 9:30 AM PST, 17:30 UTC To: wikimedia-l@lists.wikimedia.org, analytics@lists.wikimedia.org, wiki-research-l@lists.wikimedia.org
Hi all,
The next Research Showcase will be live-streamed on Wednesday, November 20, 2019, at 9:30 AM PST/17:30 UTC. We’ll have a presentation from Martin Potthast of Leipzig University on text reuse in Wikipedia and other presentation from the Wikimedia Foundation’s Isaac Johnson on the demographics and interests of Wikipedia’s readers.
YouTube stream: https://www.youtube.com/watch?v=tIko_V1k09s
As usual, you can join the conversation on IRC at #wikimedia-research. You can also watch our past research showcases here: https://www.mediawiki.org/wiki/Wikimedia_Research/Showcase
This month's presentations:
Wikipedia Text Reuse: Within and Without
By Martin Potthast, Leipzig University
We study text reuse related to Wikipedia at scale by compiling the first corpus of text reuse cases within Wikipedia as well as without (i.e., reuse of Wikipedia text in a sample of the Common Crawl). To discover reuse beyond verbatim copy and paste, we employ state-of-the-art text reuse detection technology, scaling it for the first time to process the entire Wikipedia as part of a distributed retrieval pipeline. We further report on a pilot analysis of the 100 million reuse cases inside, and the 1.6 million reuse cases outside Wikipedia that we discovered. Text reuse inside Wikipedia gives rise to new tasks such as article template induction, fixing quality flaws, or complementing Wikipedia’s ontology. Text reuse outside Wikipedia yields a tangible metric for the emerging field of quantifying Wikipedia’s influence on the web. To foster future research into these tasks, and for reproducibility’s sake, the Wikipedia text reuse corpus and the retrieval pipeline are made freely available. Paper https://webis.de/publications.html#?q=wikipedia%20ecir%202019, Demo https://demo.webis.de/wikipedia-text-reuse/
Characterizing Wikipedia Reader Demographics and Interests
By Isaac Johnson, Wikimedia Foundation
Building on two past surveys on the motivation and needs of Wikipedia readers (Why We Read Wikipedia https://www.mediawiki.org/wiki/Wikimedia_Research/Showcase#November_2016; Why the World Reads Wikipedia <https://www.mediawiki.org/wiki/Wikimedia_Research/Showcase#December_2018
),
we examine the relationship between Wikipedia reader demographics and their interests and needs. Specifically, we run surveys in thirteen different languages that ask readers three questions about their motivation for reading Wikipedia (motivation, needs, and familiarity) and five questions about their demographics (age, gender, education, locale, and native language). We link these survey results with the respondents' reading sessions -- i.e. sequence of Wikipedia page views -- to gain a more fine-grained understanding of how a reader's context relates to their activity on Wikipedia. We find that readers have a diversity of backgrounds but that the high-level needs of readers do not correlate strongly with individual demographics. We also find, however, that there are relationships between demographics and specific topic interests that are consistent across many cultures and languages. This work provides insights into the reach of various Wikipedia language editions and the relationship between content or contributor gaps and reader gaps. See the meta page https://meta.wikimedia.org/wiki/Research:Characterizing_ Wikipedia_Reader_Behaviour/Demographics_and_Wikipedia_ use_cases#Reader_Surveys for more details.
-- Janna Layton (she, her) Administrative Assistant - Product & Technology Wikimedia Foundation https://wikimediafoundation.org/ _______________________________________________ Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
African-Wikimedians mailing list African-Wikimedians@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/african-wikimedians
african-wikimedians@lists.wikimedia.org