Hi all,
The next Research Showcase will be live-streamed on Wednesday, August 19, at 9:30 AM PDT/16:30 UTC, and will be on the theme of readership and navigation.
YouTube stream: https://www.youtube.com/watch?v=MeUl0zjHdF8
As usual, you can join the conversation on IRC at #wikimedia-research. You can also watch our past research showcases here: https://www.mediawiki.org/wiki/Wikimedia_Research/Showcase
This month's presentations:
What matters to us most and why? Studying popularity and attention dynamics via Wikipedia navigation data.
By Taha Yasseri (University College Dublin), Patrick Gildersleve (Oxford Internet Institute)
While Wikipedia research was initially focused on editorial behaviour or the content to a great extent, soon researchers realized the value of the navigation data both as a reflection of readers interest and, more generally, as a proxy for behaviour of online information seekers. In this talk we will report on various projects in which we utilized pageview statistics or readers navigation data to study: movies financial success [1], electoral popularity [2], disaster triggered collective attention [3] and collective memory [4], general navigation patterns and article typology [5], and attention patterns in relation to news breakouts.
-
[1] Early Prediction of Movie Box Office Success Based on Wikipedia Activity Big Data. PLoS One (2013). https://doi.org/10.1371/journal.pone.0071226 -
[2] Wikipedia traffic data and electoral prediction: towards theoretically informed models. EPJ Data Science (2016). https://doi.org/10.1140/epjds/s13688-016-0083-3 -
[3] Dynamics and biases of online attention: the case of aircraft crashes. Royal Society Open Science (2016). https://doi.org/10.1098/rsos.160460 -
[4] The memory remains: Understanding collective memory in the digital age. Science Advances (2018). https://doi.org/10.1126/sciadv.1602368 -
[5] Inspiration, captivation, and misdirection: Emergent properties in networks of online navigation. Springer (2018). https://ora.ox.ac.uk/objects/uuid:73baed3c-d3fe-4200-8e90-2d80b11f21cf
Query for Architecture, Click through Military. Comparing the Roles of Search and Navigation on Wikipedia
By Dimitar Dimitrov (GESIS - Leibniz Institute for the Social Sciences)
As one of the richest sources of encyclopedic information on the Web, Wikipedia generates an enormous amount of traffic. In this paper, we study large-scale article access data of the English Wikipedia in order to compare articles with respect to the two main paradigms of information seeking, i.e., search by formulating a query, and navigation by following hyperlinks. To this end, we propose and employ two main metrics, namely (i) searchshare -- the relative amount of views an article received by search --, and (ii) resistance -- the ability of an article to relay traffic to other Wikipedia articles -- to characterize articles. We demonstrate how articles in distinct topical categories differ substantially in terms of these properties. For example, architecture-related articles are often accessed through search and are simultaneously a "dead end" for traffic, whereas historical articles about military events are mainly navigated. We further link traffic differences to varying network, content, and editing activity features. Lastly, we measure the impact of the article properties by modeling access behavior on articles with a gradient boosting approach. The results of this paper constitute a step towards understanding human information seeking behavior on the Web.
-
Different Topic, Different Traffic: How Search and Navigation Interplay on Wikipedia. Journal of Web Science (2019). https://doi.org/10.34962/jws-71
Just a reminder that this will be happening on Wednesday!
On Thu, Aug 13, 2020 at 9:50 PM Janna Layton jlayton@wikimedia.org wrote:
Hi all,
The next Research Showcase will be live-streamed on Wednesday, August 19, at 9:30 AM PDT/16:30 UTC, and will be on the theme of readership and navigation.
YouTube stream: https://www.youtube.com/watch?v=MeUl0zjHdF8
As usual, you can join the conversation on IRC at #wikimedia-research. You can also watch our past research showcases here: https://www.mediawiki.org/wiki/Wikimedia_Research/Showcase
This month's presentations:
What matters to us most and why? Studying popularity and attention dynamics via Wikipedia navigation data.
By Taha Yasseri (University College Dublin), Patrick Gildersleve (Oxford Internet Institute)
While Wikipedia research was initially focused on editorial behaviour or the content to a great extent, soon researchers realized the value of the navigation data both as a reflection of readers interest and, more generally, as a proxy for behaviour of online information seekers. In this talk we will report on various projects in which we utilized pageview statistics or readers navigation data to study: movies financial success [1], electoral popularity [2], disaster triggered collective attention [3] and collective memory [4], general navigation patterns and article typology [5], and attention patterns in relation to news breakouts.
[1] Early Prediction of Movie Box Office Success Based on Wikipedia Activity Big Data. PLoS One (2013). https://doi.org/10.1371/journal.pone.0071226
[2] Wikipedia traffic data and electoral prediction: towards theoretically informed models. EPJ Data Science (2016). https://doi.org/10.1140/epjds/s13688-016-0083-3
[3] Dynamics and biases of online attention: the case of aircraft crashes. Royal Society Open Science (2016). https://doi.org/10.1098/rsos.160460
[4] The memory remains: Understanding collective memory in the digital age. Science Advances (2018). https://doi.org/10.1126/sciadv.1602368
[5] Inspiration, captivation, and misdirection: Emergent properties in networks of online navigation. Springer (2018). https://ora.ox.ac.uk/objects/uuid:73baed3c-d3fe-4200-8e90-2d80b11f21cf
Query for Architecture, Click through Military. Comparing the Roles of Search and Navigation on Wikipedia
By Dimitar Dimitrov (GESIS - Leibniz Institute for the Social Sciences)
As one of the richest sources of encyclopedic information on the Web, Wikipedia generates an enormous amount of traffic. In this paper, we study large-scale article access data of the English Wikipedia in order to compare articles with respect to the two main paradigms of information seeking, i.e., search by formulating a query, and navigation by following hyperlinks. To this end, we propose and employ two main metrics, namely (i) searchshare -- the relative amount of views an article received by search --, and (ii) resistance -- the ability of an article to relay traffic to other Wikipedia articles -- to characterize articles. We demonstrate how articles in distinct topical categories differ substantially in terms of these properties. For example, architecture-related articles are often accessed through search and are simultaneously a "dead end" for traffic, whereas historical articles about military events are mainly navigated. We further link traffic differences to varying network, content, and editing activity features. Lastly, we measure the impact of the article properties by modeling access behavior on articles with a gradient boosting approach. The results of this paper constitute a step towards understanding human information seeking behavior on the Web.
Different Topic, Different Traffic: How Search and Navigation Interplay on Wikipedia. Journal of Web Science (2019). https://doi.org/10.34962/jws-71
-- Janna Layton (she/her) Administrative Associate - Product & Technology Wikimedia Foundation https://wikimediafoundation.org/
The Research Showcase will start in about 30 minutes.
On Thu, Aug 13, 2020 at 9:50 PM Janna Layton jlayton@wikimedia.org wrote:
Hi all,
The next Research Showcase will be live-streamed on Wednesday, August 19, at 9:30 AM PDT/16:30 UTC, and will be on the theme of readership and navigation.
YouTube stream: https://www.youtube.com/watch?v=MeUl0zjHdF8
As usual, you can join the conversation on IRC at #wikimedia-research. You can also watch our past research showcases here: https://www.mediawiki.org/wiki/Wikimedia_Research/Showcase
This month's presentations:
What matters to us most and why? Studying popularity and attention dynamics via Wikipedia navigation data.
By Taha Yasseri (University College Dublin), Patrick Gildersleve (Oxford Internet Institute)
While Wikipedia research was initially focused on editorial behaviour or the content to a great extent, soon researchers realized the value of the navigation data both as a reflection of readers interest and, more generally, as a proxy for behaviour of online information seekers. In this talk we will report on various projects in which we utilized pageview statistics or readers navigation data to study: movies financial success [1], electoral popularity [2], disaster triggered collective attention [3] and collective memory [4], general navigation patterns and article typology [5], and attention patterns in relation to news breakouts.
[1] Early Prediction of Movie Box Office Success Based on Wikipedia Activity Big Data. PLoS One (2013). https://doi.org/10.1371/journal.pone.0071226
[2] Wikipedia traffic data and electoral prediction: towards theoretically informed models. EPJ Data Science (2016). https://doi.org/10.1140/epjds/s13688-016-0083-3
[3] Dynamics and biases of online attention: the case of aircraft crashes. Royal Society Open Science (2016). https://doi.org/10.1098/rsos.160460
[4] The memory remains: Understanding collective memory in the digital age. Science Advances (2018). https://doi.org/10.1126/sciadv.1602368
[5] Inspiration, captivation, and misdirection: Emergent properties in networks of online navigation. Springer (2018). https://ora.ox.ac.uk/objects/uuid:73baed3c-d3fe-4200-8e90-2d80b11f21cf
Query for Architecture, Click through Military. Comparing the Roles of Search and Navigation on Wikipedia
By Dimitar Dimitrov (GESIS - Leibniz Institute for the Social Sciences)
As one of the richest sources of encyclopedic information on the Web, Wikipedia generates an enormous amount of traffic. In this paper, we study large-scale article access data of the English Wikipedia in order to compare articles with respect to the two main paradigms of information seeking, i.e., search by formulating a query, and navigation by following hyperlinks. To this end, we propose and employ two main metrics, namely (i) searchshare -- the relative amount of views an article received by search --, and (ii) resistance -- the ability of an article to relay traffic to other Wikipedia articles -- to characterize articles. We demonstrate how articles in distinct topical categories differ substantially in terms of these properties. For example, architecture-related articles are often accessed through search and are simultaneously a "dead end" for traffic, whereas historical articles about military events are mainly navigated. We further link traffic differences to varying network, content, and editing activity features. Lastly, we measure the impact of the article properties by modeling access behavior on articles with a gradient boosting approach. The results of this paper constitute a step towards understanding human information seeking behavior on the Web.
Different Topic, Different Traffic: How Search and Navigation Interplay on Wikipedia. Journal of Web Science (2019). https://doi.org/10.34962/jws-71
-- Janna Layton (she/her) Administrative Associate - Product & Technology Wikimedia Foundation https://wikimediafoundation.org/
wiki-research-l@lists.wikimedia.org