No, Gheorghe, geographic location is not publicly
available for privacy
reasons. Even in aggregate it could be used to de-anonymize other data
like public editing activity. We do have an algorithm that we believe
makes geo data safe for public consumption, and we're working on that. But
it's very unlikely to make data available at the page level for pages with
relatively low traffic.
On Thu, Sep 29, 2016 at 10:09 AM, Gheorghe Postelnicu <
gheorghe.postelnicu(a)gmail.com> wrote:
Thank you both for the great pointers!
One more question: if I were interested in the counts based on the
country of origin of a user, is this data publicly available?
Best,
Gheorghe
On Sep 29, 2016 01:36, "Joseph Allemandou" <jallemandou(a)wikimedia.org>
wrote:
> Hello Gheorghe,
> What Dan said, plus a goodie for easy manual comparisons: the pageview
> viewer tool <https://tools.wmflabs.org/pageviews>
> For instance, "Suicide Squad (film)" vs "Json Bourne (film)",
user
> only pageviews (no explicit bots) for July 2016:
>
https://tools.wmflabs.org/pageviews/?project=en.wikipedia.or
> g&platform=all-access&agent=user&start=2016-07-01&end=2016-0
> 7-31&pages=Suicide_Squad_(film)|Jason_Bourne_(film)
> Cheers
> Joseph
>
> On Thu, Sep 29, 2016 at 2:37 AM, Dan Andreescu <
> dandreescu(a)wikimedia.org> wrote:
>
>> Hello Gheorghe, that dataset is deprecated and we have a much cleaner
>> one in the same format. Check out:
>>
>> * the new landing page for analytics dumps:
>>
dumps.wikimedia.org/other/analytics and specifically:
>>
dumps.wikimedia.org/other/pageviews
>>
>> * the in-depth documentation of the different datasets we provide:
>>
wikitech.wikimedia.org/wiki/Analytics/Data
>>
>> *From: *Gheorghe Postelnicu
>> *Sent: *Wednesday, September 28, 2016 20:32
>> *To: *analytics(a)lists.wikimedia.org
>> *Reply To: *A mailing list for the Analytics Team at WMF and
>> everybody who has an interest in Wikipedia and analytics.
>> *Subject: *[Analytics] Question re. PageCounts
>>
>> Hello,
>>
>> First of all, many thanks for this wonderful project!
>>
>> I am writing as I downloaded the July pagecounts data from:
>>
>>
https://dumps.wikimedia.org/other/pagecounts-raw/2016/2016-07/
>>
>> As I was browsing it, I was surprised to notice that some entities,
>> such as the movie "Suicide Squad", only seem to have gotten very
sparse
>> views in July - see below. In comparison, the clicks for Jason Bourne seem
>> to have been much higher for the same period. Below are lines from the logs.
>>
>> Am I doing something wrong?
>>
>> Many thanks in advance,
>> Gheorghe
>>
>> *Suicide Squad*:
>>
>> (pagecounts-20160727-020000.gz,en Suicide_squad_(film) 1 6614)
>>
>> (pagecounts-20160727-160000.gz,en Suicide_squad_(film) 1 25599)
>>
>> (pagecounts-20160728-220000.gz,en Suicide_squad_(film) 2 32210)
>>
>> (pagecounts-20160731-210000.gz,en Suicide_squad_(film) 11 72721)
>>
>>
>> *Jason Bourne*:
>>
>> pagecounts-20160731-210000.gz,sv Jason_Bourne_(film) 12 124894)
>>
>> (pagecounts-20160731-210000.gz,tr Jason_Bourne_(film) 78 1852192)
>>
>> (pagecounts-20160731-220000.gz,en File:Jason_Bourne_(film).jpg 2
>> 19067)
>>
>> (pagecounts-20160731-220000.gz,en Jason_Bourne_(film) 2119 73275075)
>>
>> (pagecounts-20160731-220000.gz,en Talk:Jason_Bourne_(film) 1 10059)
>>
>> pagecounts-20160731-220000.gz,fr Jason_Bourne_(film) 55 1226127)
>>
>> (pagecounts-20160731-220000.gz,hu Jason_Bourne_(film) 3 34335)
>>
>> (pagecounts-20160731-220000.gz,it Jason_Bourne_(film) 29 579129)
>>
>> (pagecounts-20160731-220000.gz,nl Jason_Bourne_(film) 11 125928)
>>
>>
>> _______________________________________________
>> Analytics mailing list
>> Analytics(a)lists.wikimedia.org
>>
https://lists.wikimedia.org/mailman/listinfo/analytics
>>
>>
>
>
> --
> *Joseph Allemandou*
> Data Engineer @ Wikimedia Foundation
> IRC: joal
>
> _______________________________________________
> Analytics mailing list
> Analytics(a)lists.wikimedia.org
>
https://lists.wikimedia.org/mailman/listinfo/analytics
>
>
_______________________________________________
Analytics mailing list
Analytics(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics
_______________________________________________
Analytics mailing list
Analytics(a)lists.wikimedia.org