Here's a related bit of work: https://meta.wikimedia.org/wiki/Research:Wikipedia_article_creation
In this research project, I used a mix of both the deletion log and the archive table to get a sense for when pages were being deleted.
Ultimately, I found that the easiest deletion event to operationalize was to look at the most recent ar_timestamp for a page in the archive table. I could only go back to 2008 with this metric because the archive table didn't exist before then.
The archive table is available in quarry. See https://quarry.wmflabs.org/query/38414 for an example query that gets the timestamp of an article's last revision.
The logging table is also in quarry. See https://quarry.wmflabs.org/query/38415 for an example query that gets deletion events.
On Tue, Aug 13, 2019 at 2:51 PM Haifeng Zhang haifeng1@andrew.cmu.edu wrote:
Dear all,
Is there an easy way to get the number of articles deleted over time (e.g., month) in Wikipedia?
Can I use Quarry? What tables should I use?
Thanks,
Haifeng Zhang _______________________________________________ Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l