Feature Requests item #3602684, was opened at 2013-01-30 10:33 Message generated for change (Comment added) made by xqt You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=603141&aid=3602684...
Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: None Group: None
Status: Closed Resolution: Duplicate
Priority: 5 Private: No Submitted By: Nullzer0 (nu11zer0) Assigned to: Nobody/Anonymous (nobody) Summary: LEFT-TO-RIGHT MARK makes categories() fail
Initial Comment: In categories() from class Page, if there is LEFT-TO-RIGHT MARK or other hidden characters in "[[Category:", pywikibot will fail to capture this category. I noticed that in constructor of class Page, there is the code for stripping many hidden characters. So why don't we apply this code with category and others (I'm not sure if there are other things which need stripping hidden characters. Interwiki link? At least, I see many regular expressions for capturing links, these action can fail with hidden character too.)
Pywikipedia trunk/pywikipedia/ (r11014, 2013/01/28, 20:55:48, ok) Python 2.7.3 (default, Sep 26 2012, 21:53:58) [GCC 4.7.2] config-settings: use_api = True use_api_login = True unicode test: ok
----------------------------------------------------------------------
Comment By: xqt (xqt)
Date: 2013-01-30 15:21
Message: duplicate to 3602690
----------------------------------------------------------------------
Comment By: Nullzer0 (nu11zer0) Date: 2013-01-30 11:36
Message: Please delete, I created this topic in wrong tracker. I have moved to bug tracker now. Sorry.
----------------------------------------------------------------------
You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=603141&aid=3602684...
pywikipedia-bugs@lists.wikimedia.org