In my short experience studying this, there are some articles that might be
deleted very quickly, not even waiting 30 minutes.
If I look here at monthly dumps,
http://dumps.wikimedia.org/enwiki/ then
we would be missing many articles that were created and deleted between the
two dumps. We could look in total what articles were deleted
Historical data, I guess they could be extracted from older dumps, to
extract the list of articles tagged with categories based on a series of
dumps.
right now I am pulling them every thirty minutes, it would be possible to
scan historical dumps and find any articles that are no longer in the newer
dumps.
The deletion logs here
http://en.wikipedia.org/w/index.php?title=Special:Log/delete we could scan
those, but as i said, how to get the text?
the CPU usage for something like that would go way over my current
processing. we could as I said install in on the toolserver, I would have
to work on the code for a bit first.
so, this comes down to the question, do we have a full log of the deleted
articles ?
thanks,
mike
On Mon, Jun 11, 2012 at 5:49 AM, Samuel Klein <meta.sj(a)gmail.com> wrote:
This is great. Thank you, Mike! It would be nice to
see this done
for historically speedied articles, too. Sam.
On Sun, Jun 10, 2012 at 3:37 AM, Mike Dupont
<jamesmikedupont(a)googlemail.com> wrote:
Hi,
I have launched
speedydeletion.wika.com , it is updated every 30 minutes
with the proposed deletions and speedy deletion articles (not notable and
hoaxes, not others).
it is running on the
en.wikipedia.org. the sources for the script are
all
on git hub and are a merger of pywikipediabot and
the wikiteam codebases.
hope you enjoy it,
thanks,
mike
--
James Michael DuPont
Member of Free Libre Open Source Software Kosova
http://flossk.org
Contributor FOSM, the CC-BY-SA map of the world
http://fosm.org
Mozilla Rep
https://reps.mozilla.org/u/h4ck3rm1k3
_______________________________________________
Wikimedia-l mailing list
Wikimedia-l(a)lists.wikimedia.org
Unsubscribe:
https://lists.wikimedia.org/mailman/listinfo/wikimedia-l
--
Samuel Klein identi.ca:sj w:user:sj +1 617
529 4266
_______________________________________________
Wikimedia-l mailing list
Wikimedia-l(a)lists.wikimedia.org
Unsubscribe:
https://lists.wikimedia.org/mailman/listinfo/wikimedia-l
--
James Michael DuPont
Member of Free Libre Open Source Software Kosova
http://flossk.org
Contributor FOSM, the CC-BY-SA map of the world
http://fosm.org
Mozilla Rep
https://reps.mozilla.org/u/h4ck3rm1k3