Thank you both for the great pointers!
One more question: if I were interested in the counts based on the
country of origin of a user, is this data publicly available?
Best,
Gheorghe
On Sep 29, 2016 01:36, "Joseph Allemandou" <jallemandou(a)wikimedia.org>
wrote:
Hello Gheorghe,
What Dan said, plus a goodie for easy manual comparisons: the pageview
viewer tool <https://tools.wmflabs.org/pageviews>
For instance, "Suicide Squad (film)" vs "Json Bourne (film)", user
only
pageviews (no explicit bots) for July 2016:
https://tools.wmflabs.org/pageviews/?project=en.wikipedia.or
g&platform=all-access&agent=user&start=2016-07-01&end=2016-0
7-31&pages=Suicide_Squad_(film)|Jason_Bourne_(film)
Cheers
Joseph
On Thu, Sep 29, 2016 at 2:37 AM, Dan Andreescu <
dandreescu(a)wikimedia.org> wrote:
> Hello Gheorghe, that dataset is deprecated and we have a much cleaner
> one in the same format. Check out:
>
> * the new landing page for analytics dumps:
>
dumps.wikimedia.org/other/analytics and specifically: dumps.wikimedia.
> org/other/pageviews
>
> * the in-depth documentation of the different datasets we provide:
>
wikitech.wikimedia.org/wiki/Analytics/Data
>
> *From: *Gheorghe Postelnicu
> *Sent: *Wednesday, September 28, 2016 20:32
> *To: *analytics(a)lists.wikimedia.org
> *Reply To: *A mailing list for the Analytics Team at WMF and
> everybody who has an interest in Wikipedia and analytics.
> *Subject: *[Analytics] Question re. PageCounts
>
> Hello,
>
> First of all, many thanks for this wonderful project!
>
> I am writing as I downloaded the July pagecounts data from:
>
>
https://dumps.wikimedia.org/other/pagecounts-raw/2016/2016-07/
>
> As I was browsing it, I was surprised to notice that some entities,
> such as the movie "Suicide Squad", only seem to have gotten very sparse
> views in July - see below. In comparison, the clicks for Jason Bourne seem
> to have been much higher for the same period. Below are lines from the logs.
>
> Am I doing something wrong?
>
> Many thanks in advance,
> Gheorghe
>
> *Suicide Squad*:
>
> (pagecounts-20160727-020000.gz,en Suicide_squad_(film) 1 6614)
>
> (pagecounts-20160727-160000.gz,en Suicide_squad_(film) 1 25599)
>
> (pagecounts-20160728-220000.gz,en Suicide_squad_(film) 2 32210)
>
> (pagecounts-20160731-210000.gz,en Suicide_squad_(film) 11 72721)
>
>
> *Jason Bourne*:
>
> pagecounts-20160731-210000.gz,sv Jason_Bourne_(film) 12 124894)
>
> (pagecounts-20160731-210000.gz,tr Jason_Bourne_(film) 78 1852192)
>
> (pagecounts-20160731-220000.gz,en File:Jason_Bourne_(film).jpg 2
> 19067)
>
> (pagecounts-20160731-220000.gz,en Jason_Bourne_(film) 2119 73275075)
>
> (pagecounts-20160731-220000.gz,en Talk:Jason_Bourne_(film) 1 10059)
>
> pagecounts-20160731-220000.gz,fr Jason_Bourne_(film) 55 1226127)
>
> (pagecounts-20160731-220000.gz,hu Jason_Bourne_(film) 3 34335)
>
> (pagecounts-20160731-220000.gz,it Jason_Bourne_(film) 29 579129)
>
> (pagecounts-20160731-220000.gz,nl Jason_Bourne_(film) 11 125928)
>
>
> _______________________________________________
> Analytics mailing list
> Analytics(a)lists.wikimedia.org
>
https://lists.wikimedia.org/mailman/listinfo/analytics
>
>
--
*Joseph Allemandou*
Data Engineer @ Wikimedia Foundation
IRC: joal
_______________________________________________
Analytics mailing list
Analytics(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics
_______________________________________________
Analytics mailing list
Analytics(a)lists.wikimedia.org