https://bugzilla.wikimedia.org/show_bug.cgi?id=55414
Web browser: --- Bug ID: 55414 Summary: Existing pages does not exist Product: Pywikibot Version: compat (1.0) Hardware: All OS: All Status: NEW Severity: critical Priority: Unprioritized Component: interwiki.py Assignee: Pywikipedia-bugs@lists.wikimedia.org Reporter: jan.dudik@gmail.com Classification: Unclassified Mobile Platform: ---
I run interwiki.py
Many existing categories and it's interwiki links are reported as missing https://cs.wikinews.org/w/index.php?title=Kategorie:Srpen_2013&curid=737...
https://cs.wikinews.org/w/index.php?title=Kategorie:21._%C4%8Dervenec_2013&a...
and many others
https://bugzilla.wikimedia.org/show_bug.cgi?id=55414
--- Comment #1 from JAn Dudík jan.dudik@gmail.com --- the same for wiktionary main namespace https://cs.wiktionary.org/w/index.php?title=%D1%81%D0%B5%D0%BE%D1%81%D0%BA%D...
other changes I stopped
https://bugzilla.wikimedia.org/show_bug.cgi?id=55414
JAn Dudík jan.dudik@gmail.com changed:
What |Removed |Added ---------------------------------------------------------------------------- Priority|Unprioritized |High Severity|critical |major
https://bugzilla.wikimedia.org/show_bug.cgi?id=55414
xqt info@gno.de changed:
What |Removed |Added ---------------------------------------------------------------------------- CC| |info@gno.de
--- Comment #2 from xqt info@gno.de --- Could you give any hints, tracebacks, messages while processing iw.py?
https://bugzilla.wikimedia.org/show_bug.cgi?id=55414
--- Comment #3 from JAn Dudík jan.dudik@gmail.com --- Created attachment 13448 --> https://bugzilla.wikimedia.org/attachment.cgi?id=13448&action=edit log from interwiki py
See attached log. But now seems to work correctly
https://bugzilla.wikimedia.org/show_bug.cgi?id=55414
--- Comment #4 from JAn Dudík jan.dudik@gmail.com --- try interwiki.py -lang:tr -family:wikinews -subcatsr:2013
Although all categories have all members, in february some categories "does not exist"
When I run -subcatsr:2013/02 exist all
It seems that bot takes only first 50 pages from some languages, because on interwiki.py -lang:cs -family:wikinews -new -namespace:14 it deleted some links and these languages now have only 50 pages to work.
Additionally there is bug https://bugzilla.wikimedia.org/show_bug.cgi?id=55374 very slow run - loading about 1 page per second and error messages every few minutes
------------ interwiki.py -lang:tr -family:wikinews -subcatsr:2013 ... NOTE: [[tr:Kategori:2013/02/18]] does not exist. Skipping. NOTE: [[tr:Kategori:2013/02/19]] does not exist. Skipping. NOTE: [[tr:Kategori:2013/02/20]] does not exist. Skipping. NOTE: [[tr:Kategori:2013/02/21]] does not exist. Skipping. NOTE: [[tr:Kategori:2013/02/22]] does not exist. Skipping. NOTE: [[tr:Kategori:2013/02/23]] does not exist. Skipping. NOTE: [[tr:Kategori:2013/02/24]] does not exist. Skipping. NOTE: [[tr:Kategori:2013/02/25]] does not exist. Skipping. NOTE: [[tr:Kategori:2013/02/26]] does not exist. Skipping. NOTE: [[tr:Kategori:2013/02/27]] does not exist. Skipping. ... NOTE: The first unfinished subject is [[tr:Kategori:2013/01]] NOTE: Number of pages queued is 50, trying to add 60 more. Getting [[Kategori:2013/02/27]] list... Getting [[Kategori:2013/02/28]] list... Getting [[Kategori:2013/03]] list... Getting [[Kategori:2013/03/01]] list... Getting [[Kategori:2013/03/02]] list... Getting [[Kategori:2013/03/03]] list... Getting [[Kategori:2013/03/04]] list... Getting [[Kategori:2013/03/05]] list... Getting [[Kategori:2013/03/06]] list... Getting [[Kategori:2013/03/07]] list... Getting [[Kategori:2013/03/08]] list...
https://bugzilla.wikimedia.org/show_bug.cgi?id=55414
Malafaya malafaya@clix.pt changed:
What |Removed |Added ---------------------------------------------------------------------------- CC| |malafaya@clix.pt
--- Comment #5 from Malafaya malafaya@clix.pt --- *** Bug 55655 has been marked as a duplicate of this bug. ***
https://bugzilla.wikimedia.org/show_bug.cgi?id=55414
Malafaya malafaya@clix.pt changed:
What |Removed |Added ---------------------------------------------------------------------------- Severity|major |critical
--- Comment #6 from Malafaya malafaya@clix.pt --- I can add that the interwikis removed are somewhat random. In two consecutive runs, interwiki.py readds the interwikis removed in the previous run, and sometimes also removes others it didn't remove in the previous run. The problem is definitely related to the "NOTE: [[**:***]] does not exist. Skipping." message, which sometimes isn't correct.
https://bugzilla.wikimedia.org/show_bug.cgi?id=55414
--- Comment #7 from Malafaya malafaya@clix.pt --- Got it:
https://no.wiktionary.org/w/api.php?action=query&prop=info&format=xm...
returns results (just 50...) but also:
<warnings> <query xml:space="preserve">Too many values supplied for parameter 'titles': the limit is 50</query> </warnings>
Default query value in pywiki is 60... Setting it to 50 or lower (-query:50) should do the trick. This is probably some new change to the API.
https://bugzilla.wikimedia.org/show_bug.cgi?id=55414
--- Comment #8 from Gerrit Notification Bot gerritadmin@wikimedia.org --- Change 89500 had a related patch set uploaded by Xqt: (Bug 55414) Initial bugfix for non existing pages
https://gerrit.wikimedia.org/r/89500
https://bugzilla.wikimedia.org/show_bug.cgi?id=55414
Gerrit Notification Bot gerritadmin@wikimedia.org changed:
What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |PATCH_TO_REVIEW
https://bugzilla.wikimedia.org/show_bug.cgi?id=55414
--- Comment #9 from Gerrit Notification Bot gerritadmin@wikimedia.org --- Change 89500 merged by Xqt: (Bug 55414) Initial bugfix for non existing pages
https://gerrit.wikimedia.org/r/89500
https://bugzilla.wikimedia.org/show_bug.cgi?id=55414
xqt info@gno.de changed:
What |Removed |Added ---------------------------------------------------------------------------- Status|PATCH_TO_REVIEW |RESOLVED Resolution|--- |FIXED Assignee|Pywikipedia-bugs@lists.wiki |info@gno.de |media.org |
--- Comment #10 from xqt info@gno.de --- Decreased maxquerysize to 50 which is the same value as in core
pywikipedia-bugs@lists.wikimedia.org