On 2 March 2011 07:46, xqt@svn.wikimedia.org wrote:
Log Message:
hak-wiki is also affected due to bug #3081100
Modified: trunk/pywikipedia/interwiki.py
rmPage.site().lang in ['hi', 'cdo'] and \
rmPage.site().lang in ['hak', 'hi', 'cdo'] and \
Note that *any* wiki can be affected - it's a problem with certain combinations of characters (multiple accents etc). This only commonly happens in some languages, but in principle any language that uses several accents can be affected. Not quite sure what the correct way of handling this would be though -- blocking all bots that trigger the unicode bug might be a bit too much.
Best regards, Merlijn
Note that *any* wiki can be affected - it's a problem with certain combinations of characters (multiple accents etc). This only commonly happens in some languages, but in principle any language that uses several accents can be affected. Not quite sure what the correct way of handling this would be though -- blocking all bots that trigger the unicode bug might be a bit too much.
Best regards, Merlijn
Shouldn't we publish a small patch for python >2.6.5 as I proposed some weeks ago? This means the compiled C-coded stuff and the two python lib files. I guess this would help some bot owners to cleanup this bug.
Greetings xqt
On 6 March 2011 11:42, info@gno.de wrote:
Shouldn't we publish a small patch for python >2.6.5 as I proposed some weeks ago? This means the compiled C-coded stuff and the two python lib files. I guess this would help some bot owners to cleanup this bug.
This is a possibility, but not completely trivial. We would have to compile a version for both python 2.6 and 2.7, and for both win32 and several flavours of linux. Of course, most linux users should be able to compile the .c themselves, so this would only require win32 builds. Might not be such a bad idea then, I guess, although I'm not a big fan of cleaning up the mess upstream makes ;-)
Merlijn
pywikipedia-l@lists.wikimedia.org