Thanks Erik. This is exactly what I'm looking for. Thanks for the help for
everybody else as well.
On Sat, Jul 9, 2016 at 1:42 PM, Erik Zachte <ezachte(a)wikimedia.org> wrote:
Manny,
If you're willing to download large files, monthly totals are here
https://dumps.wikimedia.org/other/pagecounts-ez/merged/
pagecounts-2016-06-views-ge-5.bz2
<https://dumps.wikimedia.org/other/pagecounts-ez/merged/pagecounts-2016-06-views-ge-5.bz2>
has all titles with 5 or more requests per month
It contains monthly totals, plus efficiently packed hourly data, all in
one line.
pagecounts-2016-06-views-ge-5-totals.bz2
<https://dumps.wikimedia.org/other/pagecounts-ez/merged/pagecounts-2016-06-views-ge-5-totals.bz2>
is derived from the previous file, with hourly data stripped.
But monthly totals in the API would also be great (and asked for by
several people ; I count Magnus Manske as one, but his audience would
rejoice as well ;-)
Cheers
Erik
*From:* Analytics [mailto:analytics-bounces@lists.wikimedia.org] *On
Behalf Of *Lane Rasberry
*Sent:* Saturday, July 09, 2016 15:01
*To:* A mailing list for the Analytics Team at WMF and everybody who has
an interest in Wikipedia and analytics.
*Subject:* Re: [Analytics] Wiki Page Views Project
I collected some non-technical tools at
<https://meta.wikimedia.org/wiki/Traffic_reporting>
On Sat, Jul 9, 2016 at 8:29 AM, Dan Andreescu <dandreescu(a)wikimedia.org>
wrote:
If those two don't help there is some raw data available here:
https://dumps.wikimedia.org/other/analytics/
Those files are also hourly but you make a good case for us including
monthly totals, so we're open to that if you can't use the other resources.
(I work on the analytics team)
On Saturday, July 9, 2016, Alex Druk <alex.druk(a)gmail.com> wrote:
Hi Manny,
Have a look at
http://www.wikipediatrends.com.
Alex
On Sat, Jul 9, 2016 at 4:22 AM, Manny Manny <mannya897(a)gmail.com> wrote:
Hello All,
I am working on a project the uses page view numbers for wiki articles and
I was hoping somebody could help me out. I am using wikipedia redirects to
find aliases for query names. Unfortunately there is a lot of noise in the
redirects. I was hoping to use the page views as a heuristic to weed out
bad redirects. I was looking at the page view files but the ones on
stats.grok.se are hourly which is too much to process in a reasonable
amount of time. I was wondering if anybody had (or knew where I could
access) page view files for a longer amount of time like yearly, monthly,
or even daily. I need to able to download the file locally because I will
be dealing with a lot of query names. I appreciate any help you can
provide.
Thanks,
Manny
_______________________________________________
Analytics mailing list
Analytics(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics
--
Thank you.
Alex Druk, PhD.
wikipediatrens.com
alex.druk(a)gmail.com
(775) 237-8550 Google voice
_______________________________________________
Analytics mailing list
Analytics(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics
--
Lane Rasberry
user:bluerasberry on Wikipedia
206.801.0814
lane(a)bluerasberry.com
_______________________________________________
Analytics mailing list
Analytics(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics