Hi,
we have a diligent editor who puts defaultsorts and categories onto file description pages whose files are in Commons, not in huwiki. We tend to remove these cats and defsorts. So we have to review description pages of non-existing files for categories. Do we have any tool for this task, or do I have to write it from the base?
2012/3/8 Bináris wikiposta@gmail.com
we have a diligent editor who puts defaultsorts and categories onto file description pages whose files are in Commons, not in huwiki. We tend to remove these cats and defsorts. So we have to review description pages of non-existing files for categories. Do we have any tool for this task, or do I have to write it from the base?
To be more concrete:
* How do I decide if a page in file namespace contains an actual file or jsut a description page to a file located in Commons? * How do I list all description pages without a local file?
Please, can anyone help with an idea?
2012/3/8 Bináris wikiposta@gmail.com
2012/3/8 Bináris wikiposta@gmail.com
we have a diligent editor who puts defaultsorts and categories onto file description pages whose files are in Commons, not in huwiki. We tend to remove these cats and defsorts. So we have to review description pages of non-existing files for categories. Do we have any tool for this task, or do I have to write it from the base?
To be more concrete:
- How do I decide if a page in file namespace contains an actual file or
jsut a description page to a file located in Commons?
- How do I list all description pages without a local file?
-- Bináris
Another idea from cs.wiki: file namespace is allowed for edit sysoponly, cs.wiki contains only wikipedia logo, all other pictures are from commons :-)
JAnD
Dne 12. března 2012 11:20 Bináris wikiposta@gmail.com napsal(a):
Please, can anyone help with an idea?
2012/3/8 Bináris wikiposta@gmail.com
2012/3/8 Bináris wikiposta@gmail.com
we have a diligent editor who puts defaultsorts and categories onto file description pages whose files are in Commons, not in huwiki. We tend to remove these cats and defsorts. So we have to review description pages of non-existing files for categories. Do we have any tool for this task, or do I have to write it from the base?
To be more concrete:
- How do I decide if a page in file namespace contains an actual file or
jsut a description page to a file located in Commons?
- How do I list all description pages without a local file?
-- Bináris
-- Bináris
Pywikipedia-l mailing list Pywikipedia-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/pywikipedia-l
2012/3/13 Jan Dudík jan.dudik@gmail.com
Another idea from cs.wiki: file namespace is allowed for edit sysoponly, cs.wiki contains only wikipedia logo, all other pictures are from commons :-)
This is another question, Hungarian community disputed this a few times, and rejected the solution. My question is of technical nature, but thanks.
2012/3/8 Bináris wikiposta@gmail.com
Do we have any tool for this task, or do I have to write it from the base?
Your problem is so specific, I don't think this is the case.
On 8 March 2012 23:39, Bináris wikiposta@gmail.com wrote:
- How do I decide if a page in file namespace contains an actual file or
jsut a description page to a file located in Commons?
print wikipedia.ImagePage.fileIsOnCommons.__doc__
Return True if the image is stored on Wikimedia Commons
- How do I list all description pages without a local file?
This is more a request for a database query, but you could also just loop on all the pages in the Image namespace.
The query is: select count(*) from page left join image on page_title=img_name where page_namespace=6 and page_is_redirect=0 and img_name is null;
The 461 pages are listed here: http://toolserver.org/~valhallasw/huwiki_voor_binaris/ (all utf-8 encoded).
Best, Merlijn
Merlijn, Thanks a lot! I begin to process them. Query requests are not done too fast in huwiki, but I store the query for later use.
2012/3/16 Merlijn van Deen valhallasw@arctus.nl
The query is: select count(*) from page left join image on page_title=img_name where page_namespace=6 and page_is_redirect=0 and img_name is null;
It should be better
select page_title
as it stands above the first list, shouldn't it?
Merlijn, how do you do these queries? From a dump? Or with a toolserver account? Does an account on toolserver give a possibility to direct command line queries?
On 18 March 2012 09:55, Bináris wikiposta@gmail.com wrote:
Merlijn, how do you do these queries? From a dump? Or with a toolserver account? Does an account on toolserver give a possibility to direct command line queries?
From the toolserver (where you can indeed run queries directly on a database).
The query above the saved file is correct; the other one just dumps the number of pages.
Best, Merlijn
2012/3/18 Merlijn van Deen valhallasw@arctus.nl
From the toolserver (where you can indeed run queries directly on a database).
Thanks, that's a good reason to apply for an account. :-) I saved the list, you may delete the original. Thanks again!
I hacked a script that writes the contents of the description page next to te filenames, so it is now easy to overview, vhich one should be deleted: http://hu.wikipedia.org/w/index.php?title=Szerkeszt%C5%91:Bin%C3%A1ris/Commo...
pywikipedia-l@lists.wikimedia.org