https://bugzilla.wikimedia.org/show_bug.cgi?id=55155
Web browser: --- Bug ID: 55155 Summary: LEFT-TO-RIGHT MARK makes categories() fail Product: Pywikibot Version: unspecified Hardware: All OS: All Status: NEW Severity: normal Priority: Unprioritized Component: General Assignee: Pywikipedia-bugs@lists.wikimedia.org Reporter: legoktm.wikipedia@gmail.com Classification: Unclassified Mobile Platform: ---
Originally from: http://sourceforge.net/p/pywikipediabot/bugs/1571/ Reported by: nu11zer0 Created on: 2013-01-30 19:36:16 Subject: LEFT-TO-RIGHT MARK makes categories() fail Original description: In categories() from class Page, if there is LEFT-TO-RIGHT MARK or other hidden characters in "[[Category:", pywikibot will fail to capture this category. I noticed that in constructor of class Page, there is the code for stripping many hidden characters. So why don't we apply this code with category and others (I'm not sure if there are other things which need stripping hidden characters. Interwiki link? At least, I see many regular expressions for capturing links, these actions can fail with hidden character too.)
Pywikipedia trunk/pywikipedia/ (r11014, 2013/01/28, 20:55:48, ok) Python 2.7.3 (default, Sep 26 2012, 21:53:58) [GCC 4.7.2] config-settings: use_api = True use_api_login = True unicode test: ok