Support Requests item #1813173, was opened at 2007-10-14 14:22 Message generated for change (Comment added) made by wikipedian You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=603139&aid=1813173...
Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: None Group: None
Status: Closed
Priority: 5 Private: No Submitted By: André Malafaya Baptista (malafaya) Assigned to: Nobody/Anonymous (nobody) Summary: 'Title ... not found in list. Expected one of ...'
Initial Comment: I'm constantly getting this message while processing [[en:Democratic Republic of the Congo]]:
--- Getting 1 pages from wikipedia:ka... BUG>> title kongos demokratiuli respublika ([[ka:kongos demokratiuli respublika]]) not found in list Expected one of: [[ka:kongos demokratiuli respublika?]] ---
The ka article (title is yellow/transliterated) existswithout the '?'. Why does the bot expect the page title to have that '?' at the end?
----------------------------------------------------------------------
Comment By: Daniel Herding (wikipedian)
Date: 2008-02-12 02:42
Message: Logged In: YES user_id=880694 Originator: NO
Left-to-right and right-to-left markers in page titles are now removed in the Page constructor and thus ignored by the bot framework, so this bug is fixed.
----------------------------------------------------------------------
Comment By: André Malafaya Baptista (malafaya) Date: 2007-10-24 23:03
Message: Logged In: YES user_id=1037345 Originator: YES
Is there any way of going around this? Maybe by using the article name returned by Special:Export instead of the name given by the interwiki (being aware of redirects)? This problem is expanding because one bad new interwiki is enough for the bot to spread the mistake to all Wikipedias and then it's harder to correct.
----------------------------------------------------------------------
Comment By: André Malafaya Baptista (malafaya) Date: 2007-10-24 01:14
Message: Logged In: YES user_id=1037345 Originator: YES
It seems interwiki.py itself is adding those invisible characters. Take a look at: http://en.wikipedia.org/w/index.php?title=Williamsburg%2C_Colorado&diff=... There you can see a first bot add (by Rei-bot) which adds a pt interwiki with invisible character. The next change by MalafayaBot happens after I manually correct one of the interwikis by removing the invisible character.
----------------------------------------------------------------------
Comment By: Daniel Herding (wikipedian) Date: 2007-10-15 00:48
Message: Logged In: YES user_id=880694 Originator: NO
This happens with titles that include invisible left-to-right or right-to-left control characters. These are omitted in Special:Export or something like that, I have forgotten what exactly happens.
----------------------------------------------------------------------
Comment By: André Malafaya Baptista (malafaya) Date: 2007-10-14 15:05
Message: Logged In: YES user_id=1037345 Originator: YES
I think I got to a conclusion: there seemed to be an invisible character in the ka interwiki in all Wikipedias. After deleting and retyping the interwiki in English Wikipedia (http://en.wikipedia.org/w/index.php?title=Democratic_Republic_of_the_Congo&a...), the bot now says there are 2 interwikis while processing: one plain title and the other one with the yellow '?'. The problem is that apparently when the bot fetches the page 'kongos demokratiuli respublika?' it actually works and fetches the page 'kongos demokratiuli respublika' (as if it were an implicit redirect). Another problem is that the bot tries to replace the bad interwiki (the one with '?') by the correct one (without it), MediaWiki detects 'no changes' and just ignores the page update. So the incorrect interwiki will still prevail until a major update to the page by the bot is made.
----------------------------------------------------------------------
You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=603139&aid=1813173...