@Hay: Thank you. The site is still in very early stage of development. We would like to get constructive criticism from the wikipedians from this list first. Our resources are very limited and we cannot include all other wiki projects now. What languages would you like to see first? What projects? Commons is a good idea, but I am not sure many people use these stats.
On Tue, Mar 25, 2014 at 12:00 PM, Hay (Husky) huskyr@gmail.com wrote:
@Alex: that's really awesome. Thanks for providing a stats.grok.se alternative. Really looking forward to other languages as well, and maybe throw in Commons in the mix as well?
-- Hay
On Tue, Mar 25, 2014 at 11:06 AM, Alex Druk alex.druk@gmail.com wrote:
Hi Magnus,
Only en.wp for now. We will wait and see how popular it is before adding other projects. You cannot limit data range in csv now, but the size of the response is usually < 10 KB.
By the way, many thanks for your great work!
Regards, Alex
On Tue, Mar 25, 2014 at 10:53 AM, Magnus Manske magnusmanske@googlemail.com wrote:
Quick question: Does this have en.wp data only, or can I query (as in
CSV)
other wikipedias/projects? And, can I limit the data range (not really necessary, but less data to transmit)?
On Tue, Mar 25, 2014 at 9:09 AM, Alex Druk alex.druk@gmail.com wrote:
Hi Burton,
We just opened a new site www.wikipediatrends.com that show Wikipedia page view data. Our site is very similar to existing http://tools.wmflabs.org/wikiviewstats/ and http://stats.grok.se/,
but use
slightly different approach to calculating and presenting data as well
as
allow comparison of different articles.
I hope it will serve your purpose. I am ready to discuss integration
out
of the list.
Alex Druk
On Mon, Mar 24, 2014 at 11:40 PM, Burton DeWilde burton@harmony-institute.org wrote:
Dear Toby,
I recently saw your comment on a blog post by Magnus Manske regarding the lack of Wikipedia page view data besides the oft-overloaded http://stats.grok.se/. I was wondering if there's been any progress
at WMF
on building a more stable, central, and complete source for this data?
I ask because I'm a data scientist at a small research non-profit
called
Harmony Institute, where we study the social impact of media
(primarily
television and film). I'm currently building an interactive web app
that
visualizes social impact on a variety of issues by many documentary
films.
One indicator of interest is "information-seeking behavior," i.e. are audiences seeking out information about a film or issue. Besides
search trends, an excellent proxy for this is Wikipedia page views
for both
film pages, e.g. Escape Fire, and issue-related pages, e.g. Health
care
reform.
I'm currently trying to use stats.grok.se to grab raw data in JSON
form;
unfortunately, the site almost always responds with "Server
overloaded,
please throttle your requests," and no amount of throttling seems to suffice. I'm aware that there are many TBs of raw data for the
downloading,
but I don't have the resources to handle that much data, nor do I
need more
than the tiniest fraction of it.
I would love to show Wikipedia page view statistics for film pages in our app. If you have any updates on progress or suggestions on how I
might
do this, I would be very appreciative.
Thanks very much for your and all of WMF's hard work — I'm a proud
donor
to the cause. :)
Best, Burton DeWilde
-- Burton DeWilde
Data Scientist Harmony Institute harmony-institute.org blog | twitter | facebook
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
-- Thank you.
Alex Druk alex.druk@gmail.com (775) 237-8550 Google voice
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
-- undefined
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
-- Thank you.
Alex Druk alex.druk@gmail.com (775) 237-8550 Google voice
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics