Thanks Erik. This is exactly what I'm looking for. Thanks for the help for everybody else as well.

On Sat, Jul 9, 2016 at 1:42 PM, Erik Zachte <ezachte@wikimedia.org> wrote:

Manny,

 

If you're willing to download large files, monthly totals are here

https://dumps.wikimedia.org/other/pagecounts-ez/merged/

 

pagecounts-2016-06-views-ge-5.bz2  has all titles with 5 or more requests per month

It contains monthly totals, plus efficiently packed hourly data, all in one line.

 

pagecounts-2016-06-views-ge-5-totals.bz2 is derived from the previous file, with hourly data stripped.

 

But monthly totals in the API would also be great (and asked for by several people ; I count Magnus Manske as one, but his audience would rejoice as well ;-)

 

Cheers

Erik

 

 

From: Analytics [mailto:analytics-bounces@lists.wikimedia.org] On Behalf Of Lane Rasberry
Sent: Saturday, July 09, 2016 15:01
To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics.
Subject: Re: [Analytics] Wiki Page Views Project

 

I collected some non-technical tools at
<https://meta.wikimedia.org/wiki/Traffic_reporting>

 

On Sat, Jul 9, 2016 at 8:29 AM, Dan Andreescu <dandreescu@wikimedia.org> wrote:

If those two don't help there is some raw data available here: https://dumps.wikimedia.org/other/analytics/

 

Those files are also hourly but you make a good case for us including monthly totals, so we're open to that if you can't use the other resources.  (I work on the analytics team)



On Saturday, July 9, 2016, Alex Druk <alex.druk@gmail.com> wrote:

Hi Manny,

 

 

Alex

 

 

 

On Sat, Jul 9, 2016 at 4:22 AM, Manny Manny <mannya897@gmail.com> wrote:

Hello All,

 

I am working on a project the uses page view numbers for wiki articles and I was hoping somebody could help me out. I am using wikipedia redirects to find aliases for query names. Unfortunately there is a lot of noise in the redirects. I was hoping to use the page views as a heuristic to weed out bad redirects. I was looking at the page view files but the ones on stats.grok.se are hourly which is too much to process in a reasonable amount of time. I was wondering if anybody had (or knew where I could access) page view files for a longer amount of time like yearly, monthly, or even daily. I need to able to download the file locally because I will be dealing with a lot of query names. I appreciate any help you can provide. 

 

Thanks,

 

Manny


_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics



 

--

Thank you.

Alex Druk, PhD.


_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics




--

Lane Rasberry

user:bluerasberry on Wikipedia


_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics