Hello!
Please post questions such us these to analytics@ e-mail list.
A) ACCESS TRAFFIC
We are interested in wikipedia clicks per world-wide geographical unit
(sub-national, ideally) and per category (
https://en.wikipedia.org/
wiki/Portal:Contents/Categories).
Is there a way to stream (maybe per API) the origin of
wikipedia clicks in
a specific category? Instead it would already be great for us to
find a
statistic that shows regional >access statistics on wikipedia in general
over time.
We do not track click data per category, in fact, as far as I know, very
few things are kept track "per category" (someone will correct me if I am
wrong, I'm sure).
Now, maybe someone on this list can point you to alredy compiled data by
other research project on that regard.
Some data that might be of interest:
Pageviews per country:
https://stats.wikimedia.org/wikimedia/squids/SquidReportPageViewsPerCountry…
Pageviews per article can be obtained either from dumps of pageview API>
data is updated daily but it does not have a country dimension:
https://wikitech.wikimedia.org/wiki/Analytics/PageviewAPI
There is a clickstream dataset that does not have geograhic information
either:
https://datahub.io/dataset/wikipedia-clickstream
B) EDITING TRAFFIC
As far as we have seen, it is possible to access the editing statistics
(often with
IP addresses given) for the last 30 days
(
https://en.wikipedia.org/w/index.php?namespace=&tagfilter=&days=30&…
).
Is there likewise a tool or API to steam the editing
process of wikipedia
(IP and editor name, if registered)?
Either there is a user id, or an IP, not both. You only get IPs from
anonymous editors.
You can consume this data on an stream format, see docs here:
https://wikitech.wikimedia.org/wiki/EventStreams#JavaScript
We have recently launched this service and our docs need some work, we will
document in a more friendly way what streams are available but the example
I linked to should work and includes recent changes.
Thanks,
Nuria
On Wed, Feb 8, 2017 at 6:23 PM, Erik Zachte <ezachte(a)wikimedia.org> wrote:
> Dear Fabian,
>
>
>
> Let me relay your question to the WMF Analytics Team.
>
>
>
> Best regards,
>
> Erik Zachte
>
>
>
> *From:* Fabian Stephany [mailto:fns27@cam.ac.uk]
> *Sent:* Sunday, February 05, 2017 20:48
> *To:* erikzachte(a)infodisiac.com; erikzachte(a)wikimedia.org
> *Cc:* Fabian Braesemann
> *Subject:* Research on wikipedia traffic and educational quality
>
>
>
> Dear Erik,
>
>
>
> please allow me to contact you regarding your work for wikipedia (
>
https://stats.wikimedia.org/ <http://stats.grok.se/>). My colleague and
> me (Oxford Internet Institute/University of Cambridge/Wittgenstein Centre
> Vienna) are about to start a research project on educational quality and
> wikipedia traffic.
>
>
>
> Hopefully, you find the time give us some advise on our questions
> regarding wikipedia access and edit traffic. We want to look at the
> category and geographical origin of wikipedia article access and editing.
>
>
>
> A) ACCESS TRAFFIC
>
>
>
> We are interested in wikipedia clicks per world-wide geographical unit
> (sub-national, ideally) and per category (
https://en.wikipedia.org/
> wiki/Portal:Contents/Categories).
>
> Is there a way to stream (maybe per API) the origin of wikipedia clicks in
> a specific category? Instead it would already be great for us to find a
> statistic that shows regional access statistics on wikipedia in general
> over time.
>
>
>
> B) EDITING TRAFFIC
>
>
>
> As far as we have seen, it is possible to access the editing statistics
> (often with IP addresses given) for the last 30 days (
>
https://en.wikipedia.org/w/index.php?namespace=&tagfilter=&days=30&…
> Special%3ARecentChanges
>
<https://en.wikipedia.org/w/index.php?namespace=&tagfilter=&days=30&title=Special:RecentChanges>).
Is there likewise a tool or API to steam the editing
process of wikipedia
> (IP and editor name, if registered)?
>
>
>
>
>
> Thank you very much for your help or likewise for suggesting somebody, who
> could help us out.
>
>
>
> Best wishes,
>
>
>
> Fabian Stephany, PhD MSc, MPhil Cantab
>
>
> fns27(a)cam.ac.uk
> fabian.stephany(a)wu.ac.at
>
fabianstephany.com
>
> UK +44 776 3505 435 <+44%207763%20505435>
> AT +43 680 5015 960 <+43%20680%205015960>
> DE +49 176 3121 5012 <+49%20176%2031215012>
>
>
>
> _______________________________________________
> Analytics mailing list
> Analytics(a)lists.wikimedia.org
>
https://lists.wikimedia.org/mailman/listinfo/analytics
>
>