On 9/26/07, Magnus Manske magnusmanske@googlemail.com wrote:
On 9/26/07, David Gerard dgerard@gmail.com wrote:
On 26/09/2007, David Gerard dgerard@gmail.com wrote:
On 26/09/2007, Magnus Manske magnusmanske@googlemail.com wrote:
Try http://tools.wikimedia.de/~magnus/missing_images.php with "Living people" as category, and check "Skip articles that have an image". Will take a while, but it's rather comprehensive :-)
Excellent! Is there any way to get that data as just a list of article names, rather than as formatted HTML?
Oh - does that use a database dump or the live database? It doesn't pick up articles that already have a "Replace this image" placeholder, but maybe that's just new ones.
It uses the toolserver database, which has some replication lag, depending on the language (currently up to two days, but decreasing).
I can add optional CSV or wiki output. Which one would you prefer?
CSV is better for AWB, pywikipedia takes flat text input (one entry per line, no brackets)