Thanks Leon,

I agree adding pagecounts (legacy) per article and top articles with most pagecounts (legacy) to AQS would be awesome. We analytics already knew this should happen at some point. I created the task you suggest: https://phabricator.wikimedia.org/T173720 I think, though, that it will take some time (a couple months) for us to be able to work on this. We'll groom it this week and prioritize it. I added you as a subscriber.

Cheers!

On Fri, Aug 18, 2017 at 6:35 PM, Leon Ziemba <musikanimal@wikimedia.org> wrote:
For the record, this is what T149358 was originally about. I was under the impression we were going to have pagecounts for all endpoints (per-article, top and aggregate), and it was somewhat disappointing to find out we only added support for aggregate. From my experience per-article data is actually of greatest interest, and I've gotten requests to add it to Pageviews Analysis since its inception. This was also part of one the top wishes in the German Technical Wishlist (I can dig up a link if need be). In addition, some things like the Did You Know project on enwiki rely on it, where tens of thousands of template transclusions link to stats.grok.se on article talk pages (see the template test cases for how this works). With stats.grok.se now gone, we have no public-facing web service to get this historical data. So I'd love to see it added to the awesome RESTBase API, but I understand it probably involves a lot of challenges. I can create another phabricator task if Vipul has not already. At any rate, I have endless thanks to give to the Analytics team for everything you've done for us. It seems we're always asking more from you! :)

R.I.P. stats.grok.se! 10 years was a good run!

~MA

On Sun, Aug 13, 2017 at 1:15 PM, Dan Andreescu <dandreescu@wikimedia.org> wrote:
Ah, yes, for now we have no plans to add the per-article stats, but do open a task and explain how it would be useful, we'll prioritize it accordingly. And in the meantime, looks like the pagecounts-ez are your best bet (use that instead of pagecounts-raw because the compression is lossless and saves a lot of download time)

From: Vipul Naik
Sent: Sunday, August 13, 2017 11:12
To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics.
Reply To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics.
Subject: Re: [Analytics] Anybody know about stats.grok.se going down?

Hi Dan,

From the documentation of legacy metrics it looks like the legacy metrics are only available for sitewide pageviews for each site, rather than for individual pages. Is per-page data also part of your existing or planned legacy metrics?

Vipul

On Sat, Aug 12, 2017 at 6:17 PM, Dan Andreescu <dandreescu@wikimedia.org> wrote:
Hi Vipul, actually that's also available via the API now! https://wikitech.wikimedia.org/wiki/Analytics/AQS/Legacy_Pagecounts

It's a different path though, to highlight that pre-2015 numbers were counted slightly differently.

On Sat, Aug 12, 2017 at 18:59 Vipul Naik <vipulnaik1@gmail.com> wrote:
Hi Dan and Dan,

Thanks for taking the time to respond. I appreciate it!

I'm aware of the APIs and the WMF Labs tool. I am specifically interested in stats.grok.se for accessing data before July 2015, for which the only way right now is to process rather large raw dumps. I have built-in integrations that get data from stats.grok.se; processing raw dumps to generate pageview counts is possible but a lot of extra work :).

Cheers,

Vipul

On Mon, Aug 7, 2017 at 4:17 AM, Dan Andreescu <dandreescu@wikimedia.org> wrote:

On Mon, Aug 7, 2017 at 4:21 AM, Dan Garry <dgarry@wikimedia.org> wrote:
Hi Vipul,

stats.grok.se is pretty much deprecated now. You ran in to one of the reasons why: it's not very reliable. You should use the Pageviews Analysis tool instead, which was put together by MusikAnimal and Community Tech. This tool was intended to replace stats.grok.se. There is documentation about the tool that you may wish to read.

Thanks,
Dan

On 7 August 2017 at 06:34, Vipul Naik <vipulnaik1@gmail.com> wrote:
stats.grok.se (a source of pageview stats for the time before the Wikimedia API became available) has been down for about a week. I tried emailing Henrik Abelsson, whom I've previously contacted when the site had issues, but haven't received a response this time.

Any ideas on why it's down and whom to reach out to to help resolve the issue?

Vipul

_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics




--
Dan Garry
Senior Product Manager, Editing
Wikimedia Foundation

_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics



_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics


_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics

_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics




_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics



_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics




--
Marcel Ruiz Forns
Analytics Developer
Wikimedia Foundation