On Nov 30, 2014, at 16:06, Aaron Halfaker
<ahalfaker(a)wikimedia.org> wrote:
Hey folks,
I just finished a blog post about how I'm incorporating hadoop streaming into my
workflow.
http://socio-technologist.blogspot.com/2014/11/fitting-hadoop-streaming-int…
<http://socio-technologist.blogspot.com/2014/11/fitting-hadoop-streaming-into-my-python.html>
TL;DR: I have strong opinions about Good Ways(TM) to process large datafiles in
interesting ways and hadoop streaming will support them nicely. :)
Props to ottomata for spending a bunch of time helping me get up to speed with our
cluster and to gage for making it easier to find hadoop's error messages.
-Aaron
_______________________________________________
Analytics mailing list
Analytics(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics