https://bugzilla.wikimedia.org/show_bug.cgi?id=55227
Web browser: --- Bug ID: 55227 Summary: interwiki problems in km wikipedia Product: Pywikibot Version: unspecified Hardware: All OS: All Status: NEW Severity: normal Priority: Unprioritized Component: interwiki.py Assignee: Pywikipedia-bugs@lists.wikimedia.org Reporter: legoktm.wikipedia@gmail.com Classification: Unclassified Mobile Platform: ---
Originally from: http://sourceforge.net/p/pywikipediabot/bugs/1382/ Reported by: Anonymous user Created on: 2011-11-27 12:57:44 Subject: interwiki problems in km wikipedia Original description: it seems like iw bots running different Python versions read Khmer text in a different way. Please see http://en.wikipedia.org/w/index.php?title=Angelina%5C_Jolie&action=histo.... Python 2.7.1 bot adds a link and Python 2.5.1 bot removes a link to km, but when you follow that removed link it in fact points to nothing. Is there any way to fix the problem?
https://bugzilla.wikimedia.org/show_bug.cgi?id=55227
--- Comment #1 from Kunal Mehta (Legoktm) legoktm.wikipedia@gmail.com --- Seem unicode bug #3081100 is back
https://bugzilla.wikimedia.org/show_bug.cgi?id=55227
--- Comment #2 from Kunal Mehta (Legoktm) legoktm.wikipedia@gmail.com --- Interwiki bots running under python 2.7.1 should just be blocked indefinitely for not paying attention to the pwb mailing list and console warnings.
https://bugzilla.wikimedia.org/show_bug.cgi?id=55227
--- Comment #3 from Kunal Mehta (Legoktm) legoktm.wikipedia@gmail.com --- I guess it is vice versa. py 2.5.1 does this failure but unicode test sounds ok [1]. I checked these links and found the last 3 characters are missed at 2.5.1-Bot.
[1]: http://ru.wikipedia.org/wiki/%D0%9E%D0%B1%D1%81%D1%83%D0%B6%D0%B4%D0%B5%D0%B...
https://bugzilla.wikimedia.org/show_bug.cgi?id=55227
Kunal Mehta (Legoktm) legoktm.wikipedia@gmail.com changed:
What |Removed |Added ---------------------------------------------------------------------------- See Also| |https://sourceforge.net/p/p | |ywikipediabot/bugs/1382
https://bugzilla.wikimedia.org/show_bug.cgi?id=55227
xqt info@gno.de changed:
What |Removed |Added ---------------------------------------------------------------------------- CC| |info@gno.de
--- Comment #4 from xqt info@gno.de --- That means we should discard py2.5 for running pwbots. This would make things easier.
https://bugzilla.wikimedia.org/show_bug.cgi?id=55227
Merlijn van Deen valhallasw@arctus.nl changed:
What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |RESOLVED CC| |valhallasw@arctus.nl See Also| |https://bugzilla.wikimedia. | |org/show_bug.cgi?id=27446 Resolution|--- |DUPLICATE
--- Comment #5 from Merlijn van Deen valhallasw@arctus.nl --- Duplicate of #55256 - the cause is a buggy page name ( km:អែនជេលីណា ចូលី ends in \u200b zero width space ):
Not #3081100, but related. (cur | prev) 00:09, 12 November 2012 ElphiBot (talk | contribs) m . . (95,243 bytes) (+10) . . (r2.7.1) (Robot: Modifying km:អែនជេលីណា ចូលី to km:អែនជេលីណា ចូលី) most clearly shows what is happening:
This is combined with a change in behavior -- to cite myself:
To clarify; the pywikipedia bug was caused by calling .strip() on the page title. When working with Unicode < 4.0, this will strip the U+200B character (python < 2.7), with Unicode > 4.0, this will *not* strip the U+200B character (python >= 2.7).
*** This bug has been marked as a duplicate of bug 55256 ***
https://bugzilla.wikimedia.org/show_bug.cgi?id=55227
--- Comment #6 from Merlijn van Deen valhallasw@arctus.nl --- Oh, and this specific issue was solved by this page move:
https://km.wikipedia.org/w/index.php?title=%E1%9E%A2%E1%9F%82%E1%9E%93%E2%80...
https://bugzilla.wikimedia.org/show_bug.cgi?id=55227
--- Comment #7 from Merlijn van Deen valhallasw@arctus.nl ---
*** This bug has been marked as a duplicate of bug 55246 ***
pywikipedia-bugs@lists.wikimedia.org