I'd be willing to work on this on a volunteer basis.
I developed
http://toolserver.org/~emw/wikistats/, a page view analysis tool
that incorporates lots of features that have been requested of Henrik's tool.
The main bottleneck has been that, like MZMcBride mentions, an underlying
database of page view data is unavailable. Henrik's JSON API has limitations
probably tied to the underlying data model. The fact that there aren't any
other such API's is arguably the bigger problem.
I wrote down some initial thoughts on how this data reliability, and WMF's page
view data services generally, could be improved at
http://en.wikipedia.org/w/index.php?title=User_talk:Emw&oldid=442596566…rver.
I've also drafted more specific implementation plans. These plans assume that
I would be working with the basic data in Domas's archives. There is still a
lot of untapped information in that data -- e.g. hourly views -- and potential
for mashups with categories, automated inference of trend causes, etc. If more
detailed (but still anonymized) OWA data were available, however, that would
obviously open up the potential for much richer APIs and analysis.
Getting the archived page view data into a database seems very doable. This
data seems like it would be useful even if there were OWA data available, since
that OWA data wouldn't cover 12/2007 through 2009. As I see it, the main thing
needed from WMF would be storage space on a publicly-available server. Then,
optionally, maybe some funds for the cost of cloud services to process and
compress the data, and put it into a database. Input and advice would be
invaluable, too.
Eric