Hi,
Are there any easy to see statistics about the survival rate of newly-created pages in Wikipedias in different languages?
I need this for understanding the success of ContentTranslation, which is primarily an article creation tool
I couldn't find something like this in stats.wikimedia.org. It does have the number of created pages per day. For en.wikipedia, for example, it's about 800. But how many are deleted the same day ("speedy")? Knowing that alone would be very useful, and there are other possible questions, such as: How many are deleted within a week or a month? What is the age distribution of the articles that are deleted every day - how many of them were created the same day, how many were created a year ago, and so on.
Using a simple (and possibly wrong - I don't do this often) query,[1] I found that around 500 or 600 deletions happen each day in the English Wikipedia. Does this sound sensible? Is there a better query that I could run, or a dashboard where I could see such a thing conveniently? And of course, I'd love to see it for all languages and not just English.
Thanks for any help!
[1] SELECT max(ar_id), ar_title, ar_timestamp FROM `archive` WHERE ar_namespace = 0 and ar_timestamp between 20150521000000 and 20150521999999 group by ar_title ORDER BY NULL;
-- Amir Elisha Aharoni · אָמִיר אֱלִישָׁע אַהֲרוֹנִי http://aharoni.wordpress.com “We're living in pieces, I want to live in peace.” – T. Moore
I'd suggest looking for deleted revisions where the oldid is 0; that's a reliable basis (ish) for identifying page deletions without duplication.
There are certainly no dashboards, but Aaron had done a one-off research project on this that lives somewhere on Meta; I don't have the URL, but he will!
On 31 May 2015 at 08:26, Amir E. Aharoni amir.aharoni@mail.huji.ac.il wrote:
Hi,
Are there any easy to see statistics about the survival rate of newly-created pages in Wikipedias in different languages?
I need this for understanding the success of ContentTranslation, which is primarily an article creation tool
I couldn't find something like this in stats.wikimedia.org. It does have the number of created pages per day. For en.wikipedia, for example, it's about 800. But how many are deleted the same day ("speedy")? Knowing that alone would be very useful, and there are other possible questions, such as: How many are deleted within a week or a month? What is the age distribution of the articles that are deleted every day - how many of them were created the same day, how many were created a year ago, and so on.
Using a simple (and possibly wrong - I don't do this often) query,[1] I found that around 500 or 600 deletions happen each day in the English Wikipedia. Does this sound sensible? Is there a better query that I could run, or a dashboard where I could see such a thing conveniently? And of course, I'd love to see it for all languages and not just English.
Thanks for any help!
[1] SELECT max(ar_id), ar_title, ar_timestamp FROM `archive` WHERE ar_namespace = 0 and ar_timestamp between 20150521000000 and 20150521999999 group by ar_title ORDER BY NULL;
-- Amir Elisha Aharoni · אָמִיר אֱלִישָׁע אַהֲרוֹנִי http://aharoni.wordpress.com “We're living in pieces, I want to live in peace.” – T. Moore
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
You can presumably replicate https://meta.wikimedia.org/wiki/Research:Wikipedia_article_creation for other languages with the code provided, if you're interested in more than the 8 or so biggest Wikipedias.
Or you can ask the speedy deletion Wikias: http://speedydeletion.wikia.com/wiki/Speedy_deletion_Wiki/Network
Nemo
We have a nice set of Event Logging schemas for this now. This will help you to not need to do all of the analytical backflips that I did.
See
- https://meta.wikimedia.org/wiki/Schema:PageCreation - https://meta.wikimedia.org/wiki/Schema:PageMove - https://meta.wikimedia.org/wiki/Schema:PageDeletion - https://meta.wikimedia.org/wiki/Schema:PageRestoration
On Sun, May 31, 2015 at 10:17 AM, Federico Leva (Nemo) nemowiki@gmail.com wrote:
You can presumably replicate https://meta.wikimedia.org/wiki/Research:Wikipedia_article_creation for other languages with the code provided, if you're interested in more than the 8 or so biggest Wikipedias.
Or you can ask the speedy deletion Wikias: http://speedydeletion.wikia.com/wiki/Speedy_deletion_Wiki/Network
Nemo
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics