Hello all,
Lately I was searching the Commons files for files without Information template or other maintained infobox templates mentioned in Commons:Infoboxeshttps://commons.wikimedia.org/wiki/Commons:Infoboxes [1] and adding them to Category:Media_missing_infobox_templatehttps://commons.wikimedia.org/wiki/Category:Media_missing_infobox_template [2] so me and others can add the information templates. One of the reasons why file might be missing infobox template, is that there is a problem with parsing of existing infotmation template, usually through some sort of {{}} [[]] bracket imbalance, such files go into Category:Pages_using_Information_template_with_parsing_errorshttps://commons.wikimedia.org/wiki/Category:Pages_using_Information_template_with_parsing_errors . Files with such problem are found through CatScan3 queries for files not transcluding {{Information}}, but which contain string "{{Information". Lately I had a large number of "false positive" files that according to the CatScan do not transclude {{Information}}, but which clearly have the template. That seems to happen occasionally but usually edit required to add the category is enough for the database to refresh it's links and a second CatScan query like [5] can find them. However this time it does not work.
Any idea how to purge such files? See also [6]
Jarek T. (user:Jarekt)
[1] https://commons.wikimedia.org/wiki/Commons:Infoboxes [2] https://commons.wikimedia.org/wiki/Category:Media_missing_infobox_template [3] https://commons.wikimedia.org/wiki/Commons:Bots/Work_requests#Adding_the_Inf... [4] https://commons.wikimedia.org/wiki/Category:Pages_using_Information_template... [5] https://tools.wmflabs.org/catscan3/catscan2.php?language=commons&project... [6] https://commons.wikimedia.org/wiki/User_talk:Jarekt#file:Amsterdam_P1080037....
Hello Jarek,
Le mardi 10 février 2015, 14:08:43 Tuszynski, Jarek W. a écrit :
Lately I was searching the Commons files for files without Information template or other maintained infobox templates mentioned in Commons:Infoboxes and adding them to Category:Media_missing_infobox_templatehttps://commons.wikimedia.org/wiki/ Category:Media_missing_infobox_template [2]
Lately I had a large number of "false positive" files that according to the CatScan do not transclude {{Information}}, but which clearly have the template. That seems to happen occasionally but usually edit required to add the category is enough for the database to refresh it's links and a second CatScan query like [5] can find them. However this time it does not work.
Any idea how to purge such files? See also [6]
I'm not sure if this addresses the core of your question, but one solution could be to generate the list as you currently do, purge the file pages using touch.py or similar, then re-generate the list and see which titles still appear.
I was thinking about it but touch.py is just an empty edit used to trigger the update of database tables. An actual edit of adding a category should have the same effect. But I went further by trying to purge couple pages and then checking if they are detected by CatScan but no such luck.
Jarek (user:Jarekt)
-----Original Message----- From: commons-l-bounces@lists.wikimedia.org [mailto:commons-l-bounces@lists.wikimedia.org] On Behalf Of Guillaume Paumier Sent: Tuesday, February 10, 2015 2:22 PM To: Wikimedia Commons Discussion List Subject: Re: [Commons-l] Database transclusion table inconsistent with the file metadata
Hello Jarek,
Le mardi 10 février 2015, 14:08:43 Tuszynski, Jarek W. a écrit :
Lately I was searching the Commons files for files without Information template or other maintained infobox templates mentioned in Commons:Infoboxes and adding them to Category:Media_missing_infobox_template<https://commons.wikimedia.org/ wiki/ Category:Media_missing_infobox_template> [2]
Lately I had a large number of "false positive" files that according to the CatScan do not transclude {{Information}}, but which clearly have the template. That seems to happen occasionally but usually edit required to add the category is enough for the database to refresh it's links and a second CatScan query like [5] can find them. However this time it does not work.
Any idea how to purge such files? See also [6]
I'm not sure if this addresses the core of your question, but one solution could be to generate the list as you currently do, purge the file pages using touch.py or similar, then re-generate the list and see which titles still appear.
-- Guillaume Paumier
_______________________________________________ Commons-l mailing list Commons-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/commons-l