Oh, and in general, you can dump the results of queries to a location on stat1002 that rsyncs to a public place.  But we need people to be very careful with that so we usually want to go through code review for any code that does it.  Reportupdater is a tool that you can use to write SQL scripts or bash scripts which make the process of "publishing" data a little better.

On Thu, Nov 5, 2015 at 9:45 AM, Dan Andreescu <dandreescu@wikimedia.org> wrote:
Max, there's a pageview API that we're not fully ready to announce because we haven't finished the documentation but it works so I'll tell you offline about it.  It has the data you're looking for in that query.

Anyone else who is interested in the API - we're just finishing up docs and synchronizing with a blog post, it won't be long now, the actual code and infrastructure is stable.

On Wed, Nov 4, 2015 at 7:50 PM, Max Semenik <maxsem.wiki@gmail.com> wrote:
Hey, I was wondering if it is possible to export the results of Hive queries to some world-readable place?

What I'm trying to achieve: for my www portals work, I want the results of aggregation (SELECT project, sum(view_count) AS num FROM projectview_hourly WHERE year=2015 AND month=11 AND day=3 GROUP BY project) published somewhere in a machine-readable format. Ideally, this could be published externally (for example, https://stats.wikimedia.org/daily_pageviews.csv or whatever). If that is hard, making it somehow available on the cluster would suffice. What are the options for doing that?

--
Best regards,
Max Semenik ([[User:MaxSem]])

_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics