Bugs item #2941628, was opened at 2010-01-28 14:30
Message generated for change (Tracker Item Submitted) made by nobody
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603138&aid=294162…
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: General
Group: None
Status: Open
Resolution: None
Priority: 5
Private: No
Submitted By: Nobody/Anonymous (nobody)
Assigned to: Nobody/Anonymous (nobody)
Summary: BOM (inserted by notepad.exe) should be stripped
Initial Comment:
python version.py
Pywikipedia [http]
trunk/pywikipedia (r7897, 2010/01/28, 12:22:31)
Python 2.6.2 (r262:71605, Apr 14 2009, 22:40:02) [MSC v.1500 32 bit (Intel)]
----
If a win32 user uses notepad.exe as an external editor for editarticle.py, notepad may
insert 3-byte utf-8 BOM in front of file if it was BOM-free utf-8 file.
It causes that pywikipedia affected pages (e.g. by solve_disambiguation.py) contain U+FEFF
(decoded BOM), and causes tables broken. (since there is U+FEFF before "{|")
http://ko.wikipedia.org/w/index.php?diff=prev&oldid=4442519
BOM should be stripped since python open() does not automatically.
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603138&aid=294162…