Agree with everyone else, this is great!
I just have a question. Is this an evolving thing in a sense that more data sources will be used to define page views? Let me give an example. Reading Web team is working on a new web app prototype that caches pages which can be viewed without hitting the back end. Since no request is made, no page view will be recorded. We may add an event logging event to record a page view which would be another source of information for this API end point.
Baha
On Tue, 17 Nov 2015 02:50:20 +0500, Dan Andreescu dandreescu@wikimedia.org wrote:
Dear Data Enthusiasts,
In collaboration with the Services team, the analytics team wishes to announce a public Pageview API https://wikimedia.org/api/rest_v1/?doc#!/Pageviews_data/get_metrics_pageviews_per_article_project_access_agent_article_granularity_start_end. For an example of what kind of UIs someone could build with it, check out this excellent demo http://analytics.wmflabs.org/demo/pageview-api (code) https://gist.github.com/marcelrf/49738d14116fd547fe6d#file-article-comparison-html .
The API can tell you how many times a wiki article or project is viewed over a certain period. You can break that down by views from web crawlers or humans, and by desktop, mobile site, or mobile app. And you can find the 1000 most viewed articles https://wikimedia.org/api/rest_v1/metrics/pageviews/top/es.wikipedia/all-access/2015/11/11 on any project, on any given day or month that we have data for. We currently have data back through October and we will be able to go back to May 2015 when the loading jobs are all done. For more information, take a look at the user docs https://wikitech.wikimedia.org/wiki/Analytics/AQS/Pageview_API.
After many requests from the community, we were really happy to finally make this our top priority and get it done. Huge thanks to Gabriel, Marko, Petr, and Eric from Services, Alexandros and all of Ops really, Henrik for maintaining stats.grok, and, of course, the many community members who have been so patient with us all this time.
The Research team’s Article Recommender tool http://recommend.wmflabs.org/ already uses the API to rank pages and determine relative importance. Wiki Education Foundation’s dashboard https://dashboard.wikiedu.org/ is going to be using it to count how many times an article has been viewed since a student edited it. And there are other grand plans for this data like “article finder”, which will find low-rated articles with a lot of pageviews; this can be used by editors looking for high-impact work. Join the fun, we’re happy to help get you started and listen to your ideas. Also, if you find bugs or want to suggest improvements, please create a task in Phabricator and tag it with #Analytics-Backlog https://phabricator.wikimedia.org/tag/analytics-backlog/.
So what’s next? We can think of too many directions to go into, for pageview data and Wikimedia project data, in general. We need to work with you to make a great plan for the next few quarters. Please chime in here https://phabricator.wikimedia.org/T112956 with your needs.
Team Analytics
Hello Bahodir,
On Tue, Nov 17, 2015 at 2:15 PM, Bahodir Mansurov bmansurov@wikimedia.org wrote:
Agree with everyone else, this is great!
I just have a question. Is this an evolving thing in a sense that more data sources will be used to define page views? Let me give an example. Reading Web team is working on a new web app prototype that caches pages which can be viewed without hitting the back end. Since no request is made, no page view will be recorded.
This got me intrigued. I am wondering what exactly is meant by that. Got something (wikipage, doc, something...) a curious being like me could read ?
wikitech-l@lists.wikimedia.org