>Would there happen to be a dataset of that available somewhere?

Data is available on public labs replicas but sql is complicated to write and likely to time out due the volume of data that is combing. Data is also available on Hadoop Data Lake which is not public yet (it is our plan to make it so). This data has already been used to gather such a stats. See: https://phabricator.wikimedia.org/T149021

On Sun, Aug 13, 2017 at 10:10 AM, Morten Wang <nettrom@gmail.com> wrote:
Hello everyone,

I'm currently working gathering data for the Autoconfirmed article creation trial project[1]. One of the measures we're interested in is the number of new articles, both surviving and deleted, that is created per day. I know that recent data is logged through EventBus, but if possible I'd would also like to have historic stats on this (e.g. going back a handful of years). Would there happen to be a dataset of that available somewhere?


References:
1: https://meta.wikimedia.org/wiki/Research:Autoconfirmed_article_creation_trial

Cheers,
Morten

_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics