One of the more interesting analytics stacks being built on top of Hadoop is coming out of UC Berkeley.
https://amplab.cs.berkeley.edu/software/
Particularly interesting is SparkR, which they just blogged about here:
https://amplab.cs.berkeley.edu/2014/01/26/large-scale-data-analysis-made-eas...
We don't want to get ahead of ourselves; after all, we're just starting to get some page view data into HDFS, but it's important to understand that one of the reasons we like Hadoop is the ecosystem of open source tools built around it.
-Toby
OO Tachyon looks nice.
On Jan 31, 2014, at 12:48 PM, Toby Negrin tnegrin@wikimedia.org wrote:
One of the more interesting analytics stacks being built on top of Hadoop is coming out of UC Berkeley.
https://amplab.cs.berkeley.edu/software/
Particularly interesting is SparkR, which they just blogged about here:
https://amplab.cs.berkeley.edu/2014/01/26/large-scale-data-analysis-made-eas...
We don't want to get ahead of ourselves; after all, we're just starting to get some page view data into HDFS, but it's important to understand that one of the reasons we like Hadoop is the ecosystem of open source tools built around it.
-Toby _______________________________________________ Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics