Revision: 8090 Author: xqt Date: 2010-04-14 13:18:54 +0000 (Wed, 14 Apr 2010)
Log Message: ----------- weblinkchecker.py - stripping ")" (bug #2985621, fix #2985625 by masti)
Modified Paths: -------------- trunk/pywikipedia/pywikibot/textlib.py
Modified: trunk/pywikipedia/pywikibot/textlib.py =================================================================== --- trunk/pywikipedia/pywikibot/textlib.py 2010-04-14 09:50:27 UTC (rev 8089) +++ trunk/pywikipedia/pywikibot/textlib.py 2010-04-14 13:18:54 UTC (rev 8090) @@ -677,10 +677,10 @@ # RFC 2396 says that URLs may only contain certain characters. # For this regex we also accept non-allowed characters, so that the bot # will later show these links as broken ('Non-ASCII Characters in URL'). - # Note: While allowing parenthesis inside URLs, MediaWiki will regard - # right parenthesis at the end of the URL as not part of that URL. - # The same applies to dot, comma, colon and some other characters. - notAtEnd = ']\s).:;,<>"|' + # Note: While allowing dots inside URLs, MediaWiki will regard + # dots at the end of the URL as not part of that URL. + # The same applies to comma, colon and some other characters. + notAtEnd = ']\s.:;,<>"|' # So characters inside the URL can be anything except whitespace, # closing squared brackets, quotation marks, greater than and less # than, and the last character also can't be parenthesis or another
pywikipedia-svn@lists.wikimedia.org