Thanks Nuria. I will write a similar pilot research project proposal with concrete parameters and send it over to the analytics@list.wikimedia.org for further review.


2014-08-05 8:49 GMT+01:00 Nuria Ruiz <nuria@wikimedia.org>:
I know this answer comes late as I was on vacation, sorry about that.

At this time the cluster is not ready to be accessed by users not in the analytics team as things are still WIP. Now, in order to get the data you rae interested in you can always ask the research team to retrieve it for you (this is what we did for our pilot, actually). 

Please e-mail: analytics@lists.wikimedia.org and let us know what you are interested in.


On Wed, Jul 30, 2014 at 8:40 PM, Pine W <wiki.pine@gmail.com> wrote:

Nuria and Andrew,

Forwarding a question from Han-teng below.

Pine

Dear Pine, 

   A humorous touch here in your most recent email: "*A $1 fine will be imposed by Oliver Keyes on anyone who misspells Leila's name or misdirects emails to the WMF Executive Director."

   I have one slightly more serious question, on the possibility to use the analytics infrastructure for the upcoming Hackathon. 

   My Hackathon wish is to duplicate and reapply what  Nuria Ruiz and Andrew Otto has done for NARA analytics pilot.  https://commons.wikimedia.org/wiki/Commons:GLAMwiki_Toolset_Project/NARA_analytics_pilot

   So to your knowledge, is it feasible to do so, in terms of (a) setting up basic access for other users to duplicate the pilot, (b) getting some help from Ruiz and/or Otto, and (c) setting up for other GLAM institution that is not NARA.

   Feel free to forward this email to   Nuria Ruiz and/or Andrew Otto because I do not have their contacts.

Best,

--
han-teng liao

"[O]nce the Imperial Institute of France and the Royal Society of London begin to work together on a new encyclopaedia, it will take less than a year to achieve a lasting peace between France and England." - Henri Saint-Simon (1810)

"A common ideology based on this Permanent World Encyclopaedia is a possible means, to some it seems the only means, of dissolving human conflict into unity." - H.G. Wells (1937)





2014-07-18 8:28 GMT+01:00 Pine W <wiki.pine@gmail.com>:
Thanks for this. Forwarding to Analytics and Research for others who are curious.

Pine


On Tue, Jul 15, 2014 at 9:29 AM, Rachel Farrand <rfarrand@wikimedia.org> wrote:
This Tech Talk will be starting in 30 minuets. Thanks!


On Fri, Jul 11, 2014 at 3:30 PM, Rachel Farrand <rfarrand@wikimedia.org>
wrote:

> Hello!
>
> Please join Nuria Ruiz and Andrew Otto next Tuesday, July 15th at 10am SF
> time/5pm UTC
> <http://www.timeanddate.com/worldclock/fixedtime.html?msg=Analytics+Tech+Talk&iso=20140715T10&p1=224&am=30>
> for a 30 min tech talk. You can join our hangout or follow along on
> youtube:
https://plus.google.com/u/0/b/103470172168784626509/events/c53ho5esd0luccd09a1c30rlrmg
> (please note that a link to join the hangout will be posted in the comments
> of this event just as it starts).
>
> You can follow ask questions on IRC during the talk in #wikimedia-dev.
>
> If you are not able to follow along live, a video recording will be posted
> here
> <https://plus.google.com/u/0/b/103470172168784626509/103470172168784626509/videos>,
> to the MediaWiki YouTube channel immediately following the tech talk for
> you to view at any time.
>
> More information about the tech talk:
>
> *Hadoop and Beyond. An overview of Analytics infrastructure*In this tech
> talk we will be presenting the analytics infrastructure that we have
> recently rolled out in production. By now probably everybody knows that
> wikimedia hosts an instance of hadoop from which we are going to extract
> pageview data in the near future. But .. how exactly does the data get
> there?
>
> We will go over the path that webrequest log data takes from varnish to
> kafka (a distributed log buffer) to hadoop and the challenges of deploying
> this java-based infrastructure in production. We will also talk about how
> can we query the data with hive, an SQL-like interface. How can you set up
> this stack on vagrant to play with and, last but not least, how we used
> hive recently to provide GLAM folks with image view stats:
https://commons.wikimedia.org/wiki/Commons:GLAMwiki_Toolset_Project/NARA_analytics_pilot
>
> Thanks!
>
>
_______________________________________________
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


_______________________________________________
Wiki-research-l mailing list
Wiki-research-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l