Whoops! Meant to reply to list.
On Fri, Apr 7, 2017 at 10:20 AM, Michael Holloway mholloway@wikimedia.org wrote:
Hi Anand,
It looks like the results from those two queries are from different days. The first set is from April 5 and the second is from April 6. For the results from April 5 in the REST API, you'll want https://en.wikipedia.org/api/rest_v1/feed/featured/2017/04/06.
Also please note that the results from the Action API (api.php) aren't sorted in decreasing pageview order as the REST API's are. You'll need to sort them yourself. (Depending on the number of results you need, you should also familiarize yourself with query continuation https://www.mediawiki.org/wiki/API:Query#Continuing_queries as well as the result limits per request on the various modules you're using (for instance, 20 for TextExtracts and 50 for PageImages as indicated at the bottom of the results of your query)).
After ensuring you have the same date and sorting the Action API results, you should have the same or very similar results. For instance, looking at the results from April 5, it looks like "Lake Nyos Disaster" is on top with 497638 pageviews in both sets. You still might see slight discrepancies between the two sources due to implementation details, however; for instance, in the REST API endpoint we use a heuristic to attempt to filter out pages with pageview counts likely inflated by bot traffic, and so a few pages here and there that appear in the Action API results wouldn't appear in the REST API list.
Best, Michael
On Fri, Apr 7, 2017 at 6:02 AM, Anandroid Inc anandrdinc@gmail.com wrote:
Hi Team,
Thanks a lot for your support and quick responses. As per all the suggestions you gave for my query, I found below two apis which solves my purpose, But I see both queries ask for todays most viewed pages on wikipedia but they are returning different results. Request you to please help me on this.
https://en.wikipedia.org/w/api.php?action=query&format=json& prop=extracts%7Cinfo%7Cpageimages%7Cpageviews%7Cpageterms% 7Cpageprops%7Crevisions&generator=mostviewed&exchars= 512&exlimit=max&exintro=1&explaintext=1&exsectionformat=plai n&inprop=url&pithumbsize=640&pilimit=max&wbptterms=descript ion%7Calias%7Clabel&gpvimlimit=50&pvipdays=1 https://en.wikipedia.org/w/api.php?action=query&format=json&prop=extracts%7Cinfo%7Cpageimages%7Cpageviews%7Cpageterms%7Cpageprops%7Crevisions&generator=mostviewed&exchars=512&exlimit=max&exintro=1&explaintext=1&exsectionformat=plain&inprop=url&pithumbsize=640&pilimit=max&wbptterms=description%7Calias%7Clabel&gpvimlimit=100&pvipdays=1
On 07-Apr-2017, at 3:43 AM, Adam Baso abaso@wikimedia.org wrote:
For the action API I think you're looking for an extra property, *pageviews*. Maybe something like this:
https://en.wikipedia.org/wiki/Special:ApiSandbox#action=quer y&format=json&prop=extracts%7Cinfo%7Cpageimages%7Cpageview s%7Cpageterms%7Cpageprops%7Crevisions&generator=mostviewed&exchars=512& exlimit=max&exintro=1&explaintext=1&exsectionformat=plain& inprop=url&pithumbsize=640&pilimit=max&pvipdays=8&wbptter ms=description%7Calias%7Clabel&gpvimlimit=20
On Thu, Apr 6, 2017 at 4:44 PM, Anandroid Inc anandrdinc@gmail.com wrote:
++Michael
On 07-Apr-2017, at 3:09 AM, Anandroid Inc anandrdinc@gmail.com wrote:
Hi Michael,
Thanks for your quick response. This is very helpful, exactly what I was looking for.
Thanks, Anand
On 07-Apr-2017, at 12:08 AM, Michael Holloway mholloway@wikimedia.org wrote:
Hi Anand,
It sounds like the REST API's featured feed endpoint https://en.wikipedia.org/api/rest_v1/#!/Feed/aggregatedFeed provides what you're looking for.
For example: https://en.wikipedia.org/api/r est_v1/feed/featured/2017/04/06 (see the content under the "mostread" key).
Under the hood, titles are obtained from the Pageview API https://wikitech.wikimedia.org/wiki/Analytics/PageviewAPI and then supplementary information for our desired titles is obtained from the REST API's page summary endpoint https://en.wikipedia.org/api/rest_v1/#!/Page_content/get_page_summary_title. That would be the easiest way to go if you'd like to go in a slightly different direction from what's provided in our featured feed endpoint.
If you'd like, you can view the implementation for the "mostread" section of our aggregated feed endpoint here:
https://phabricator.wikimedia.org/diffusion/GMOA/browse/mast er/lib/feed/most-read.js
Please note that as of now the REST API endpoints are officially designated as unstable.
Best, Michael
On Thu, Apr 6, 2017 at 2:19 PM, Anandroid Inc anandrdinc@gmail.com wrote:
Hi Team,
Thanks for the great api provided for all the wiki info. I am looking for a combination of right parameters which will return me most viewed pages with the page url, article snippet, thumbnail image url. Hope you can help with this.
Thanks, Anand _______________________________________________ Mediawiki-api mailing list Mediawiki-api@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-api
Mediawiki-api mailing list Mediawiki-api@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-api
Mediawiki-api mailing list Mediawiki-api@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-api
Mediawiki-api mailing list Mediawiki-api@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-api
Thanks Michael, this clears it. One last question is there a way to get the most viewed pages countrywise. For instance, most viewed pages in India, USA, UK etc.
Thanks, Anand
On 07-Apr-2017, at 8:58 PM, Michael Holloway mholloway@wikimedia.org wrote:
Whoops! Meant to reply to list.
On Fri, Apr 7, 2017 at 10:20 AM, Michael Holloway <mholloway@wikimedia.org mailto:mholloway@wikimedia.org> wrote: Hi Anand,
It looks like the results from those two queries are from different days. The first set is from April 5 and the second is from April 6. For the results from April 5 in the REST API, you'll want https://en.wikipedia.org/api/rest_v1/feed/featured/2017/04/06 https://en.wikipedia.org/api/rest_v1/feed/featured/2017/04/06.
Also please note that the results from the Action API (api.php) aren't sorted in decreasing pageview order as the REST API's are. You'll need to sort them yourself. (Depending on the number of results you need, you should also familiarize yourself with query continuation https://www.mediawiki.org/wiki/API:Query#Continuing_queries as well as the result limits per request on the various modules you're using (for instance, 20 for TextExtracts and 50 for PageImages as indicated at the bottom of the results of your query)).
After ensuring you have the same date and sorting the Action API results, you should have the same or very similar results. For instance, looking at the results from April 5, it looks like "Lake Nyos Disaster" is on top with 497638 pageviews in both sets. You still might see slight discrepancies between the two sources due to implementation details, however; for instance, in the REST API endpoint we use a heuristic to attempt to filter out pages with pageview counts likely inflated by bot traffic, and so a few pages here and there that appear in the Action API results wouldn't appear in the REST API list.
Best, Michael
On Fri, Apr 7, 2017 at 6:02 AM, Anandroid Inc <anandrdinc@gmail.com mailto:anandrdinc@gmail.com> wrote: Hi Team,
Thanks a lot for your support and quick responses. As per all the suggestions you gave for my query, I found below two apis which solves my purpose, But I see both queries ask for todays most viewed pages on wikipedia but they are returning different results. Request you to please help me on this.
https://en.wikipedia.org/w/api.php?action=query&format=json&prop=ext... https://en.wikipedia.org/w/api.php?action=query&format=json&prop=extracts%7Cinfo%7Cpageimages%7Cpageviews%7Cpageterms%7Cpageprops%7Crevisions&generator=mostviewed&exchars=512&exlimit=max&exintro=1&explaintext=1&exsectionformat=plain&inprop=url&pithumbsize=640&pilimit=max&wbptterms=description%7Calias%7Clabel&gpvimlimit=100&pvipdays=1
- https://en.wikipedia.org/api/rest_v1/feed/featured/2017/04/07 https://en.wikipedia.org/api/rest_v1/feed/featured/2017/04/07
On 07-Apr-2017, at 3:43 AM, Adam Baso <abaso@wikimedia.org mailto:abaso@wikimedia.org> wrote:
For the action API I think you're looking for an extra property, pageviews. Maybe something like this:
https://en.wikipedia.org/wiki/Special:ApiSandbox#action=query&format=jso... https://en.wikipedia.org/wiki/Special:ApiSandbox#action=query&format=json&prop=extracts%7Cinfo%7Cpageimages%7Cpageviews%7Cpageterms%7Cpageprops%7Crevisions&generator=mostviewed&exchars=512&exlimit=max&exintro=1&explaintext=1&exsectionformat=plain&inprop=url&pithumbsize=640&pilimit=max&pvipdays=8&wbptterms=description%7Calias%7Clabel&gpvimlimit=20
On Thu, Apr 6, 2017 at 4:44 PM, Anandroid Inc <anandrdinc@gmail.com mailto:anandrdinc@gmail.com> wrote: ++Michael
On 07-Apr-2017, at 3:09 AM, Anandroid Inc <anandrdinc@gmail.com mailto:anandrdinc@gmail.com> wrote:
Hi Michael,
Thanks for your quick response. This is very helpful, exactly what I was looking for.
Thanks, Anand
On 07-Apr-2017, at 12:08 AM, Michael Holloway <mholloway@wikimedia.org mailto:mholloway@wikimedia.org> wrote:
Hi Anand,
It sounds like the REST API's featured feed endpoint https://en.wikipedia.org/api/rest_v1/#!/Feed/aggregatedFeed provides what you're looking for.
For example: https://en.wikipedia.org/api/rest_v1/feed/featured/2017/04/06 https://en.wikipedia.org/api/rest_v1/feed/featured/2017/04/06 (see the content under the "mostread" key).
Under the hood, titles are obtained from the Pageview API https://wikitech.wikimedia.org/wiki/Analytics/PageviewAPI and then supplementary information for our desired titles is obtained from the REST API's page summary endpoint https://en.wikipedia.org/api/rest_v1/#!/Page_content/get_page_summary_title. That would be the easiest way to go if you'd like to go in a slightly different direction from what's provided in our featured feed endpoint.
If you'd like, you can view the implementation for the "mostread" section of our aggregated feed endpoint here:
https://phabricator.wikimedia.org/diffusion/GMOA/browse/master/lib/feed/most... https://phabricator.wikimedia.org/diffusion/GMOA/browse/master/lib/feed/most-read.js
Please note that as of now the REST API endpoints are officially designated as unstable.
Best, Michael
On Thu, Apr 6, 2017 at 2:19 PM, Anandroid Inc <anandrdinc@gmail.com mailto:anandrdinc@gmail.com> wrote: Hi Team,
Thanks for the great api provided for all the wiki info. I am looking for a combination of right parameters which will return me most viewed pages with the page url, article snippet, thumbnail image url. Hope you can help with this.
Thanks, Anand _______________________________________________ Mediawiki-api mailing list Mediawiki-api@lists.wikimedia.org mailto:Mediawiki-api@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-api https://lists.wikimedia.org/mailman/listinfo/mediawiki-api
Mediawiki-api mailing list Mediawiki-api@lists.wikimedia.org mailto:Mediawiki-api@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-api https://lists.wikimedia.org/mailman/listinfo/mediawiki-api
Mediawiki-api mailing list Mediawiki-api@lists.wikimedia.org mailto:Mediawiki-api@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-api https://lists.wikimedia.org/mailman/listinfo/mediawiki-api
Mediawiki-api mailing list Mediawiki-api@lists.wikimedia.org mailto:Mediawiki-api@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-api https://lists.wikimedia.org/mailman/listinfo/mediawiki-api
Anand,
I don't believe we track per-article pageview counts by country. However, it might be worth asking on analytics@lists.wikimedia.org to be sure, as they're the experts on our analytics infrastructure.
Best, Michael
On Fri, Apr 7, 2017 at 12:03 PM, Anandroid Inc anandrdinc@gmail.com wrote:
Thanks Michael, this clears it. One last question is there a way to get the most viewed pages countrywise. For instance, most viewed pages in India, USA, UK etc.
Thanks, Anand
On 07-Apr-2017, at 8:58 PM, Michael Holloway mholloway@wikimedia.org wrote:
Whoops! Meant to reply to list.
On Fri, Apr 7, 2017 at 10:20 AM, Michael Holloway <mholloway@wikimedia.org
wrote:
Hi Anand,
It looks like the results from those two queries are from different days. The first set is from April 5 and the second is from April 6. For the results from April 5 in the REST API, you'll want https://en.wikipedia.org/api/rest_v1/feed/featured/2017/04/06.
Also please note that the results from the Action API (api.php) aren't sorted in decreasing pageview order as the REST API's are. You'll need to sort them yourself. (Depending on the number of results you need, you should also familiarize yourself with query continuation https://www.mediawiki.org/wiki/API:Query#Continuing_queries as well as the result limits per request on the various modules you're using (for instance, 20 for TextExtracts and 50 for PageImages as indicated at the bottom of the results of your query)).
After ensuring you have the same date and sorting the Action API results, you should have the same or very similar results. For instance, looking at the results from April 5, it looks like "Lake Nyos Disaster" is on top with 497638 pageviews in both sets. You still might see slight discrepancies between the two sources due to implementation details, however; for instance, in the REST API endpoint we use a heuristic to attempt to filter out pages with pageview counts likely inflated by bot traffic, and so a few pages here and there that appear in the Action API results wouldn't appear in the REST API list.
Best, Michael
On Fri, Apr 7, 2017 at 6:02 AM, Anandroid Inc anandrdinc@gmail.com wrote:
Hi Team,
Thanks a lot for your support and quick responses. As per all the suggestions you gave for my query, I found below two apis which solves my purpose, But I see both queries ask for todays most viewed pages on wikipedia but they are returning different results. Request you to please help me on this.
https://en.wikipedia.org/w/api.php?action=query&format=json& prop=extracts%7Cinfo%7Cpageimages%7Cpageviews%7Cpageterms%7C pageprops%7Crevisions&generator=mostviewed&exchars=512& exlimit=max&exintro=1&explaintext=1&exsectionformat=plain& inprop=url&pithumbsize=640&pilimit=max&wbptterms=description %7Calias%7Clabel&gpvimlimit=50&pvipdays=1 https://en.wikipedia.org/w/api.php?action=query&format=json&prop=extracts%7Cinfo%7Cpageimages%7Cpageviews%7Cpageterms%7Cpageprops%7Crevisions&generator=mostviewed&exchars=512&exlimit=max&exintro=1&explaintext=1&exsectionformat=plain&inprop=url&pithumbsize=640&pilimit=max&wbptterms=description%7Calias%7Clabel&gpvimlimit=100&pvipdays=1
On 07-Apr-2017, at 3:43 AM, Adam Baso abaso@wikimedia.org wrote:
For the action API I think you're looking for an extra property, *pageviews*. Maybe something like this:
https://en.wikipedia.org/wiki/Special:ApiSandbox#action=quer y&format=json&prop=extracts%7Cinfo%7Cpageimages%7Cpageviews% 7Cpageterms%7Cpageprops%7Crevisions&generator=mostviewed& exchars=512&exlimit=max&exintro=1&explaintext=1& exsectionformat=plain&inprop=url&pithumbsize=640&pilimit= max&pvipdays=8&wbptterms=description%7Calias%7Clabel&gpvimlimit=20
On Thu, Apr 6, 2017 at 4:44 PM, Anandroid Inc anandrdinc@gmail.com wrote:
++Michael
On 07-Apr-2017, at 3:09 AM, Anandroid Inc anandrdinc@gmail.com wrote:
Hi Michael,
Thanks for your quick response. This is very helpful, exactly what I was looking for.
Thanks, Anand
On 07-Apr-2017, at 12:08 AM, Michael Holloway mholloway@wikimedia.org wrote:
Hi Anand,
It sounds like the REST API's featured feed endpoint https://en.wikipedia.org/api/rest_v1/#!/Feed/aggregatedFeed provides what you're looking for.
For example: https://en.wikipedia.org/api/r est_v1/feed/featured/2017/04/06 (see the content under the "mostread" key).
Under the hood, titles are obtained from the Pageview API https://wikitech.wikimedia.org/wiki/Analytics/PageviewAPI and then supplementary information for our desired titles is obtained from the REST API's page summary endpoint https://en.wikipedia.org/api/rest_v1/#!/Page_content/get_page_summary_title. That would be the easiest way to go if you'd like to go in a slightly different direction from what's provided in our featured feed endpoint.
If you'd like, you can view the implementation for the "mostread" section of our aggregated feed endpoint here:
https://phabricator.wikimedia.org/diffusion/GMOA/browse/mast er/lib/feed/most-read.js
Please note that as of now the REST API endpoints are officially designated as unstable.
Best, Michael
On Thu, Apr 6, 2017 at 2:19 PM, Anandroid Inc anandrdinc@gmail.com wrote:
Hi Team,
Thanks for the great api provided for all the wiki info. I am looking for a combination of right parameters which will return me most viewed pages with the page url, article snippet, thumbnail image url. Hope you can help with this.
Thanks, Anand _______________________________________________ Mediawiki-api mailing list Mediawiki-api@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-api
Mediawiki-api mailing list Mediawiki-api@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-api
Mediawiki-api mailing list Mediawiki-api@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-api
Mediawiki-api mailing list Mediawiki-api@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-api
mediawiki-api@lists.wikimedia.org