Glad to be helpful!😁 It is interesting that, contrary to my predictions, number of visitors to wikipediatrends.com and downloads did not drop when API became available. I am wondering if stats.grok.se notice significant drop. Alex
On Fri, Jul 29, 2016 at 6:08 PM, Dan Andreescu dandreescu@wikimedia.org wrote:
That's very useful, Alex, thanks! I guess those requests would need to be covered by dumps anyway, since we only have data back to 2015. I'll ping Henrik too.
On Fri, Jul 29, 2016 at 12:06 PM, Alex Druk alex.druk@gmail.com wrote:
Maybe it make sense to ask Henrik (stats.grok.se) for his download stats. We at wikipediatrends.com usually receive < 5 request/month for full data (from 2008).
On Fri, Jul 29, 2016 at 2:40 PM, Dan Andreescu dandreescu@wikimedia.org wrote:
Dear Pageview API consumers,
We would like to plan storage capacity for our pageview API cluster. Right now, with a reliable RAID setup, we can keep *18 months* of data. If you'd like to query further back than that, you can download dump files (which we'll make easier to use with python utilities).
What do you think? Will you need more than 18 months of data? If so, we need to add more nodes when we get to that point, and that costs money, so we want to check if there is a real need for it.
Another option is to start degrading the resolution for older data (only keep weekly or monthly for data older than 1 year for example). If you need more than 18 months, we'd love to hear your use case and something in the form of:
need daily resolution for 1 year need weekly resolution for 2 years need monthly resolution for 3 years
Thank you!
Dan
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
-- Thank you.
Alex Druk alex.druk@gmail.com (775) 237-8550 Google voice
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics