On Wed, Sep 07, 2011 at 11:57:33AM +0100, Nuno Tavares wrote:
Maarten, this has already been done, actually :(
But which sortkey field are you talking about?
The text after the | is in the database as a sortkey. As an example, the following database query will give you the number of unique id's fotographed for WLM 2010 in NL:
SELECT COUNT(DISTINCT(clB.cl_sortkey)) FROM image JOIN page ON img_name=page_title JOIN categorylinks AS clA ON page_id=clA.cl_from JOIN categorylinks AS clB ON page_id=clB.cl_from WHERE page_namespace=6 AND page_is_redirect=0 AND clA.cl_to='Images_from_Wiki_Loves_Monuments' AND clB.cl_to='Rijksmonumenten_with_known_IDs' AND ( ( img_timestamp > '20100901020000' AND img_timestamp < '20101001020000') OR ( img_user_text = 'BotMultichillT' AND img_timestamp > '20101001020000') );
the keypart here is the count distinct on sortkey.
Using the database that is available on toolserver is certainly much more quicker then scanning all the pages on commons.
(Thanks to Maarten who helped me with these queries last year.
Regards,
Andre
Did we miss something here: http://commons.wikimedia.org/w/index.php?title=File:Canada_do_Inferno_6.jpg&... ?
Or should we adapt the upload wizard to put a sortkey in the category (that would *have helped* a lot...)
-- Nuno Tavares Wikimedia Portugal http://www.wikimedia.pt
Imagine um mundo onde ?? dada a qualquer pessoa a possibilidade de ter livre acesso ao somat??rio de todo o conhecimento humano. ?? isso o que estamos a fazer.
Participe tamb??m: http://www.wikimedia.pt
Em 07-09-2011 06:47, Maarten Dammers escreveu:
Hi Nuno,
Op 7 sep 2011 om 05:24 heeft Nuno Tavares nuno.tavares@wikimedia.pt het volgende geschreven:\
Dear fellows,
Thanks for the positive feedback, the statistics will soon get more robust. Part of the things I'd like to have is to which ID an uploaded image refers to. This makes necessary to parse the iamge description page and extract the template each of you are using.
That's not needed. You can find the id in the sortkey field :-)
For Portugal is {{WLM-PT|xxxxx}}
I'd like to know what's yours, for each country. Replying to me will allow you not only to automatically detect missing IDs in submitted images (thanks France, for sending more than 10 french-uploaded images to us) but also to perform statistics on which objects got new images, which ones didn't have any, etc, etc, etc.
Before you start up double work: the python configuration file contains a lot of these identifier templates.
Maarten
Please, send me also an example.
Best,
-- Nuno Tavares Wikimedia Portugal http://www.wikimedia.pt
Imagine um mundo onde ?? dada a qualquer pessoa a possibilidade de ter livre acesso ao somat??rio de todo o conhecimento humano. ?? isso o q ue estamos a fazer.
Participe tamb??m: http://www.wikimedia.pt
Wiki Loves Monuments mailing list WikiLovesMonuments@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikilovesmonuments http://www.wikilovesmonuments.eu
Wiki Loves Monuments mailing list WikiLovesMonuments@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikilovesmonuments http://www.wikilovesmonuments.eu
Wiki Loves Monuments mailing list WikiLovesMonuments@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikilovesmonuments http://www.wikilovesmonuments.eu