Hi team,
Can you help here? Is there any api to fetch most viewed wiki pages countrywise? For instance, most viewed pages in India, USA, UK etc.
Thanks, Anand Sent from my iPhone
Begin forwarded message:
From: Michael Holloway mholloway@wikimedia.org Date: 7 April 2017 at 9:58:22 PM IST To: Anandroid Inc anandrdinc@gmail.com Cc: "MediaWiki API announcements & discussion" mediawiki-api@lists.wikimedia.org Subject: Re: [Mediawiki-api] Wikipedia most viewed pages data
Anand,
I don't believe we track per-article pageview counts by country. However, it might be worth asking on analytics@lists.wikimedia.org to be sure, as they're the experts on our analytics infrastructure.
Best, Michael
On Fri, Apr 7, 2017 at 12:03 PM, Anandroid Inc anandrdinc@gmail.com wrote: Thanks Michael, this clears it. One last question is there a way to get the most viewed pages countrywise. For instance, most viewed pages in India, USA, UK etc.
Thanks, Anand
On 07-Apr-2017, at 8:58 PM, Michael Holloway mholloway@wikimedia.org wrote:
Whoops! Meant to reply to list.
On Fri, Apr 7, 2017 at 10:20 AM, Michael Holloway mholloway@wikimedia.org wrote: Hi Anand,
It looks like the results from those two queries are from different days. The first set is from April 5 and the second is from April 6. For the results from April 5 in the REST API, you'll want https://en.wikipedia.org/api/rest_v1/feed/featured/2017/04/06.
Also please note that the results from the Action API (api.php) aren't sorted in decreasing pageview order as the REST API's are. You'll need to sort them yourself. (Depending on the number of results you need, you should also familiarize yourself with query continuation as well as the result limits per request on the various modules you're using (for instance, 20 for TextExtracts and 50 for PageImages as indicated at the bottom of the results of your query)).
After ensuring you have the same date and sorting the Action API results, you should have the same or very similar results. For instance, looking at the results from April 5, it looks like "Lake Nyos Disaster" is on top with 497638 pageviews in both sets. You still might see slight discrepancies between the two sources due to implementation details, however; for instance, in the REST API endpoint we use a heuristic to attempt to filter out pages with pageview counts likely inflated by bot traffic, and so a few pages here and there that appear in the Action API results wouldn't appear in the REST API list.
Best, Michael
On Fri, Apr 7, 2017 at 6:02 AM, Anandroid Inc anandrdinc@gmail.com wrote: Hi Team,
Thanks a lot for your support and quick responses. As per all the suggestions you gave for my query, I found below two apis which solves my purpose, But I see both queries ask for todays most viewed pages on wikipedia but they are returning different results. Request you to please help me on this.
https://en.wikipedia.org/w/api.php?action=query&format=json&prop=ext...
On 07-Apr-2017, at 3:43 AM, Adam Baso abaso@wikimedia.org wrote:
For the action API I think you're looking for an extra property, pageviews. Maybe something like this:
https://en.wikipedia.org/wiki/Special:ApiSandbox#action=query&format=jso...
> On Thu, Apr 6, 2017 at 4:44 PM, Anandroid Inc anandrdinc@gmail.com wrote: > ++Michael > >> On 07-Apr-2017, at 3:09 AM, Anandroid Inc anandrdinc@gmail.com wrote: >> >> Hi Michael, >> >> Thanks for your quick response. >> This is very helpful, exactly what I was looking for. >> >> Thanks, >> Anand >> >>> On 07-Apr-2017, at 12:08 AM, Michael Holloway mholloway@wikimedia.org wrote: >>> >>> Hi Anand, >>> >>> It sounds like the REST API's featured feed endpoint provides what you're looking for. >>> >>> For example: https://en.wikipedia.org/api/rest_v1/feed/featured/2017/04/06 (see the content under the "mostread" key). >>> >>> Under the hood, titles are obtained from the Pageview API and then supplementary information for our desired titles is obtained from the REST API's page summary endpoint. That would be the easiest way to go if you'd like to go in a slightly different direction from what's provided in our featured feed endpoint. >>> >>> If you'd like, you can view the implementation for the "mostread" section of our aggregated feed endpoint here: >>> >>> https://phabricator.wikimedia.org/diffusion/GMOA/browse/master/lib/feed/most... >>> >>> Please note that as of now the REST API endpoints are officially designated as unstable. >>> >>> Best, >>> Michael >>> >>> >>> >>>> On Thu, Apr 6, 2017 at 2:19 PM, Anandroid Inc anandrdinc@gmail.com wrote: >>>> Hi Team, >>>> >>>> Thanks for the great api provided for all the wiki info. >>>> I am looking for a combination of right parameters which will return me most viewed pages with the page url, article snippet, thumbnail image url. >>>> Hope you can help with this. >>>> >>>> Thanks, >>>> Anand >>>> _______________________________________________ >>>> Mediawiki-api mailing list >>>> Mediawiki-api@lists.wikimedia.org >>>> https://lists.wikimedia.org/mailman/listinfo/mediawiki-api >>> >>> _______________________________________________ >>> Mediawiki-api mailing list >>> Mediawiki-api@lists.wikimedia.org >>> https://lists.wikimedia.org/mailman/listinfo/mediawiki-api >> > > > _______________________________________________ > Mediawiki-api mailing list > Mediawiki-api@lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/mediawiki-api >
Mediawiki-api mailing list Mediawiki-api@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-api
Hi Anandroid,
We don't publish pageviews per country via APIs yet. We have to take care to sanitize the data properly before doing so, and that's something we're tackling in the next fiscal year (starting in 3 months at the earliest). We will announce it to this list, so do subscribe if you're interested in updates.
On Fri, Apr 7, 2017 at 2:12 PM, Anandroid Inc anandrdinc@gmail.com wrote:
Hi team,
Can you help here? Is there any api to fetch most viewed wiki pages countrywise? For instance, most viewed pages in India, USA, UK etc.
Thanks, Anand Sent from my iPhone
Begin forwarded message:
*From:* Michael Holloway mholloway@wikimedia.org *Date:* 7 April 2017 at 9:58:22 PM IST *To:* Anandroid Inc anandrdinc@gmail.com *Cc:* "MediaWiki API announcements & discussion" <mediawiki-api@lists. wikimedia.org> *Subject:* *Re: [Mediawiki-api] Wikipedia most viewed pages data*
Anand,
I don't believe we track per-article pageview counts by country. However, it might be worth asking on analytics@lists.wikimedia.org to be sure, as they're the experts on our analytics infrastructure.
Best, Michael
On Fri, Apr 7, 2017 at 12:03 PM, Anandroid Inc anandrdinc@gmail.com wrote:
Thanks Michael, this clears it. One last question is there a way to get the most viewed pages countrywise. For instance, most viewed pages in India, USA, UK etc.
Thanks, Anand
On 07-Apr-2017, at 8:58 PM, Michael Holloway mholloway@wikimedia.org wrote:
Whoops! Meant to reply to list.
On Fri, Apr 7, 2017 at 10:20 AM, Michael Holloway < mholloway@wikimedia.org> wrote:
Hi Anand,
It looks like the results from those two queries are from different days. The first set is from April 5 and the second is from April 6. For the results from April 5 in the REST API, you'll want https://en.wikipedia.org/api/rest_v1/feed/featured/2017/04/06.
Also please note that the results from the Action API (api.php) aren't sorted in decreasing pageview order as the REST API's are. You'll need to sort them yourself. (Depending on the number of results you need, you should also familiarize yourself with query continuation https://www.mediawiki.org/wiki/API:Query#Continuing_queries as well as the result limits per request on the various modules you're using (for instance, 20 for TextExtracts and 50 for PageImages as indicated at the bottom of the results of your query)).
After ensuring you have the same date and sorting the Action API results, you should have the same or very similar results. For instance, looking at the results from April 5, it looks like "Lake Nyos Disaster" is on top with 497638 pageviews in both sets. You still might see slight discrepancies between the two sources due to implementation details, however; for instance, in the REST API endpoint we use a heuristic to attempt to filter out pages with pageview counts likely inflated by bot traffic, and so a few pages here and there that appear in the Action API results wouldn't appear in the REST API list.
Best, Michael
On Fri, Apr 7, 2017 at 6:02 AM, Anandroid Inc anandrdinc@gmail.com wrote:
Hi Team,
Thanks a lot for your support and quick responses. As per all the suggestions you gave for my query, I found below two apis which solves my purpose, But I see both queries ask for todays most viewed pages on wikipedia but they are returning different results. Request you to please help me on this.
https://en.wikipedia.org/w/api.php?action=query&format=json& prop=extracts%7Cinfo%7Cpageimages%7Cpageviews%7Cpageterms%7C pageprops%7Crevisions&generator=mostviewed&exchars=512&exlim it=max&exintro=1&explaintext=1&exsectionformat=plain&inprop =url&pithumbsize=640&pilimit=max&wbptterms=description% 7Calias%7Clabel&gpvimlimit=50&pvipdays=1 https://en.wikipedia.org/w/api.php?action=query&format=json&prop=extracts%7Cinfo%7Cpageimages%7Cpageviews%7Cpageterms%7Cpageprops%7Crevisions&generator=mostviewed&exchars=512&exlimit=max&exintro=1&explaintext=1&exsectionformat=plain&inprop=url&pithumbsize=640&pilimit=max&wbptterms=description%7Calias%7Clabel&gpvimlimit=100&pvipdays=1
On 07-Apr-2017, at 3:43 AM, Adam Baso abaso@wikimedia.org wrote:
For the action API I think you're looking for an extra property, *pageviews*. Maybe something like this:
https://en.wikipedia.org/wiki/Special:ApiSandbox#action=quer y&format=json&prop=extracts%7Cinfo%7Cpageimages%7Cpageviews% 7Cpageterms%7Cpageprops%7Crevisions&generator=mostviewed&exc hars=512&exlimit=max&exintro=1&explaintext=1&exsectionforma t=plain&inprop=url&pithumbsize=640&pilimit=max& pvipdays=8&wbptterms=description%7Calias%7Clabel&gpvimlimit=20
On Thu, Apr 6, 2017 at 4:44 PM, Anandroid Inc anandrdinc@gmail.com wrote:
++Michael
On 07-Apr-2017, at 3:09 AM, Anandroid Inc anandrdinc@gmail.com wrote:
Hi Michael,
Thanks for your quick response. This is very helpful, exactly what I was looking for.
Thanks, Anand
On 07-Apr-2017, at 12:08 AM, Michael Holloway mholloway@wikimedia.org wrote:
Hi Anand,
It sounds like the REST API's featured feed endpoint https://en.wikipedia.org/api/rest_v1/#!/Feed/aggregatedFeed provides what you're looking for.
For example: https://en.wikipedia.org/api/r est_v1/feed/featured/2017/04/06 (see the content under the "mostread" key).
Under the hood, titles are obtained from the Pageview API https://wikitech.wikimedia.org/wiki/Analytics/PageviewAPI and then supplementary information for our desired titles is obtained from the REST API's page summary endpoint https://en.wikipedia.org/api/rest_v1/#!/Page_content/get_page_summary_title. That would be the easiest way to go if you'd like to go in a slightly different direction from what's provided in our featured feed endpoint.
If you'd like, you can view the implementation for the "mostread" section of our aggregated feed endpoint here:
https://phabricator.wikimedia.org/diffusion/GMOA/browse/mast er/lib/feed/most-read.js
Please note that as of now the REST API endpoints are officially designated as unstable.
Best, Michael
On Thu, Apr 6, 2017 at 2:19 PM, Anandroid Inc anandrdinc@gmail.com wrote:
Hi Team,
Thanks for the great api provided for all the wiki info. I am looking for a combination of right parameters which will return me most viewed pages with the page url, article snippet, thumbnail image url. Hope you can help with this.
Thanks, Anand _______________________________________________ Mediawiki-api mailing list Mediawiki-api@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-api
Mediawiki-api mailing list Mediawiki-api@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-api
Mediawiki-api mailing list Mediawiki-api@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-api
Mediawiki-api mailing list Mediawiki-api@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-api
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics