Thanks for sending me to https://phabricator.wikimedia.org/T149021! That seems to answer the question I forgot to ask: does the mediawiki_history table include creation of deleted pages, and it looks like it does. I'll reuse the query and findings from that task then. Always great to find shortcuts like that, thanks again!


Cheers,
Morten


On 14 August 2017 at 08:00, Nuria Ruiz <nuria@wikimedia.org> wrote:
>Would there happen to be a dataset of that available somewhere?

Data is available on public labs replicas but sql is complicated to write and likely to time out due the volume of data that is combing. Data is also available on Hadoop Data Lake which is not public yet (it is our plan to make it so). This data has already been used to gather such a stats. See: https://phabricator.wikimedia.org/T149021

On Sun, Aug 13, 2017 at 10:10 AM, Morten Wang <nettrom@gmail.com> wrote:
Hello everyone,

I'm currently working gathering data for the Autoconfirmed article creation trial project[1]. One of the measures we're interested in is the number of new articles, both surviving and deleted, that is created per day. I know that recent data is logged through EventBus, but if possible I'd would also like to have historic stats on this (e.g. going back a handful of years). Would there happen to be a dataset of that available somewhere?


References:

Cheers,
Morten

_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics



_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics