Analytics folks, what would you say to setting aside some time to collaborate on this? Maybe pair a mobile web engineer with an analytics engineer for a sprint or two? It would be great if we can get this figured out sooner rather than later since we rely so heavily on the data and its presentation.


On Thu, Nov 14, 2013 at 6:40 AM, Dan Andreescu <dandreescu@wikimedia.org> wrote:


> > > We
> > > could start with a spike investigating if there is a framework for
> > > aggregating the sums [...]

Our approaches are hard-wired into our legacy code. So we do not use a
common, solid framework for it.

I haven't done any research on whether or not such frameworks
exist. But if you find some good framework, please let us know, it
would certainly be interesting.

There are certainly products like Cassandra and Spark that make working with big (or bunches of small) data easy and fast.

There are more sophisticated but less mature products like Druid that work with dimensional data.

We have solid options, we just have to decide that this is a priority and move on it.  The pageviews API sprint was nice but we abandoned it after a week of work because of changed priorities.



--
Arthur Richards
Software Engineer, Mobile
[[User:Awjrichards]]
IRC: awjr
+1-415-839-6885 x6687