https://bugzilla.wikimedia.org/show_bug.cgi?id=55246
Web browser: ---
Bug ID: 55246
Summary: Problem with Tibetan script
Product: Pywikibot
Version: unspecified
Hardware: All
OS: All
Status: NEW
Severity: normal
Priority: Unprioritized
Component: General
Assignee: Pywikipedia-bugs(a)lists.wikimedia.org
Reporter: legoktm.wikipedia(a)gmail.com
Classification: Unclassified
Mobile Platform: ---
Originally from: http://sourceforge.net/p/pywikipediabot/bugs/1295/
Reported by: ganz-ru
Created on: 2011-02-15 20:40:15
Subject: Problem with Tibetan script
Original description:
Here is hard edit war:
http://en.wikipedia.org/w/index.php?title=Podolsk&action=history . Bots with
the old python version add incorrect tibetan interwiki. And bot with version
2.7.1 do it correctly.
--
You are receiving this mail because:
You are the assignee for the bug.
https://bugzilla.wikimedia.org/show_bug.cgi?id=55318
Web browser: ---
Bug ID: 55318
Summary: weblinkchecker.py error
Product: Pywikibot
Version: unspecified
Hardware: All
OS: All
Status: NEW
Severity: normal
Priority: Unprioritized
Component: General
Assignee: Pywikipedia-bugs(a)lists.wikimedia.org
Reporter: legoktm.wikipedia(a)gmail.com
Classification: Unclassified
Mobile Platform: ---
Originally from: http://sourceforge.net/p/pywikipediabot/bugs/789/
Reported by: wikishizhao
Created on: 2008-09-03 14:56:27
Subject: weblinkchecker.py error
Original description:
see:
Exception in thread 中華民國國旗 -
http://law.moj.gov.tw/Scripts/Query1A.asp?no=1D0020020&K1=國旗:
Traceback \(most recent call last\):
File "/usr/lib/python2.5/threading.py", line 486, in \_\_bootstrap\_inner
self.run\(\)
File "weblinkchecker.py", line 504, in run
linkChecker = LinkChecker\(self.url, HTTPignore = self.HTTPignore\)
File "weblinkchecker.py", line 302, in \_\_init\_\_
self.changeUrl\(url\)
File "weblinkchecker.py", line 357, in changeUrl
self.query = unicode\(urllib.quote\(self.query.encode\(encoding\), '=&'\)\)
UnicodeEncodeError: 'latin-1' codec can't encode characters in position 17-18:
ordinal not in range\(256\)
--
You are receiving this mail because:
You are the assignee for the bug.
https://bugzilla.wikimedia.org/show_bug.cgi?id=55145
Web browser: ---
Bug ID: 55145
Summary: weblinkchecker URL unicode problems
Product: Pywikibot
Version: unspecified
Hardware: All
OS: All
Status: NEW
Severity: normal
Priority: Unprioritized
Component: General
Assignee: Pywikipedia-bugs(a)lists.wikimedia.org
Reporter: legoktm.wikipedia(a)gmail.com
Classification: Unclassified
Mobile Platform: ---
Originally from: http://sourceforge.net/p/pywikipediabot/bugs/1613/
Reported by: valhallasw
Created on: 2013-04-13 19:55:05
Subject: weblinkchecker URL unicode problems
Original description:
As reported by Anima in
https://sourceforge.net/tracker/?func=detail&aid=3602096&group\_id=93107&at…
Weblinkchecker jumps through some strange unicode hoops. There is no such thing
as a unicode URL - URLs are /always/ urlencoded UTF-8 strings, so:
>>> urllib.quote\(u"ö".encode\('utf-8'\)\)
'%C3%B6'
anything else is \*wrong\*, including things like asking what encoding the web
server uses: that is only relevant for decoding the page \*text\*.
Basic test case:
>>> import weblinkchecker
>>> lc = weblinkchecker.LinkChecker\(u"http://svoya-igra.org/Райков
Александр Вадимович/"\)
Contacting server svoya-igra.org to find out its default encoding...
Error retrieving server's default charset. Using ISO 8859-1.
Traceback \(most recent call last\):
File "<stdin>", line 1, in <module>
File "weblinkchecker.py", line 218, in \_\_init\_\_
self.changeUrl\(url\)
File "weblinkchecker.py", line 275, in changeUrl
self.path = unicode\(urllib.quote\(self.path.encode\(encoding\)\)\)
UnicodeEncodeError: 'latin-1' codec can't encode characters in position 1-6:
ordinal not in range\(256\)
valhallasw@lisilwen:~/src/pywikipedia/trunk/pywikipedia$ python version.py
Pywikipedia \[svn+ssh\] valhallasw@trunk/pywikipedia \(r11368, 2013/04/13,
08:16:45, ok\)
Python 2.7.3 \(default, Aug 1 2012, 05:14:39\)
\[GCC 4.6.3\]
config-settings:
use\_api = True
use\_api\_login = True
unicode test: ok
--
You are receiving this mail because:
You are the assignee for the bug.
https://bugzilla.wikimedia.org/show_bug.cgi?id=55050
Web browser: ---
Bug ID: 55050
Summary: Reporting with templates in weblinkchecker.py
Product: Pywikibot
Version: unspecified
Hardware: All
OS: All
Status: NEW
Severity: enhancement
Priority: Unprioritized
Component: General
Assignee: Pywikipedia-bugs(a)lists.wikimedia.org
Reporter: legoktm.wikipedia(a)gmail.com
Classification: Unclassified
Mobile Platform: ---
Originally from: http://sourceforge.net/p/pywikipediabot/feature-requests/281/
Reported by: dixond
Created on: 2010-12-29 09:41:01
Subject: Reporting with templates in weblinkchecker.py
Original description:
It would nice to have option to report dead links with templates like
http://en.wikipedia.org/wiki/Template:Dead\_link instead of adding section in
the talk page.
--
You are receiving this mail because:
You are the assignee for the bug.
https://bugzilla.wikimedia.org/show_bug.cgi?id=58810
Web browser: ---
Bug ID: 58810
Summary: PYWP-23 Have weblinkchecker.py optionally apply
w:template:dead link directly to the page
Product: Pywikibot
Version: unspecified
Hardware: All
OS: All
Status: UNCONFIRMED
Severity: major
Priority: Unprioritized
Component: General
Assignee: Pywikipedia-bugs(a)lists.wikimedia.org
Reporter: wmf.bugconverter(a)gmail.com
Classification: Unclassified
Mobile Platform: ---
This issue was converted from https://jira.toolserver.org/browse/PYWP-23.
Summary: Have weblinkchecker.py optionally apply w:template:dead link directly
to the page
Issue type: New Feature - A new feature of the product, which has yet to be
developed.
Priority: Major
Status: Open
Assignee: Merlijn van Deen <valhallasw(a)arctus.nl>
On Tue, 17 Jan 2012 04:53:34, Jeff G. <jeff-wmj(a)usclec.net> opened the
following bug:
> Why does it "post to article talk pages instead of applying
> w:template:dead link directly to the page"? per
> http://en.wikipedia.org/wiki/Wikipedia:Bots/Requests_for_approval/JeffGBot
> as posted in edit
> http://en.wikipedia.org/w/index.php?title=Wikipedia:Bots/Requests_for_appro…
--
You are receiving this mail because:
You are the assignee for the bug.
https://bugzilla.wikimedia.org/show_bug.cgi?id=55282
Web browser: ---
Bug ID: 55282
Summary: weblinkchecker.py - anoying exceptions
Product: Pywikibot
Version: unspecified
Hardware: All
OS: All
Status: ASSIGNED
Severity: normal
Priority: Unprioritized
Component: General
Assignee: Pywikipedia-bugs(a)lists.wikimedia.org
Reporter: legoktm.wikipedia(a)gmail.com
Classification: Unclassified
Mobile Platform: ---
Originally from: http://sourceforge.net/p/pywikipediabot/bugs/1148/
Reported by: masti01
Created on: 2010-03-17 21:49:18
Subject: weblinkchecker.py - anoying exceptions
Assigned to: xqt
Original description:
While processing external links weblinkchecker often trows this exception:
Exception while processing URL
http://www.cev.lu/mmp-cgi/show.pl?cmd=tmpl&id=851&id2=150&id3=359&id4=4&id5…
in page Mistrzostwa Europy w Piłce Siatkowej Mężczyzn 1997
Exception in thread Mistrzostwa Europy w Piłce Siatkowej Mężczyzn 1997 -
http://www.cev.lu/mmp-cgi/show.pl?cmd=tmpl&id=851&id2=150&id3=359&id4=4&id5…:
Traceback \(most recent call last\):
File "/usr/lib64/python2.6/threading.py", line 525, in \_\_bootstrap\_inner
self.run\(\)
File "weblinkchecker.py", line 492, in run
ok, message = linkChecker.check\(\)
File "weblinkchecker.py", line 423, in check
msg = error\[1\]
IndexError: tuple index out of range
--
You are receiving this mail because:
You are the assignee for the bug.
https://bugzilla.wikimedia.org/show_bug.cgi?id=55276
Web browser: ---
Bug ID: 55276
Summary: weblinkchecker should ignore URLs inside some tags,
part 2
Product: Pywikibot
Version: unspecified
Hardware: All
OS: All
Status: ASSIGNED
Severity: normal
Priority: Unprioritized
Component: General
Assignee: Pywikipedia-bugs(a)lists.wikimedia.org
Reporter: legoktm.wikipedia(a)gmail.com
Classification: Unclassified
Mobile Platform: ---
Originally from: http://sourceforge.net/p/pywikipediabot/bugs/1164/
Reported by: djbarrett
Created on: 2010-04-12 18:33:11
Subject: weblinkchecker should ignore URLs inside some tags, part 2
Assigned to: xqt
Original description:
This is a followup to \[pywikipediabot-Bugs-1969051\] \"weblinkchecker should
ignore URLs inside some tags\"
The fix in pyrev:8076 by xqt is appreciated, but not an appropriate solution.
The particular tag I listed in the ticket, \"<sql>\", was just an
example. The fix by xqt simply hard-coded this example \(bogus\) tag into the
Pywikipedia source code:
svn diff -c8076 http://svn.wikimedia.org/svnroot/pywikipedia/trunk/pywikipedia
A better fix would be to recognize when you are reading a tag attribute:
<AnyTagGoesHere ... attr=\'http://whatever\' ...>
\{\{AnyTemplateOrParserFunction | attr=http://whatever
and ignore the URL in these situations.
$ python version.py
Pywikipedia \[http\] trunk/pywikipedia \(r8050, 2010/04/01, 15:43:14\)
Python 2.4.3 \(\#1, Sep 3 2009, 15:37:37\)
\[GCC 4.1.2 20080704 \(Red Hat 4.1.2-46\)\]
--
You are receiving this mail because:
You are the assignee for the bug.
https://bugzilla.wikimedia.org/show_bug.cgi?id=55269
Web browser: ---
Bug ID: 55269
Summary: weblinkchecker.py don't report death links
Product: Pywikibot
Version: unspecified
Hardware: All
OS: All
Status: NEW
Severity: normal
Priority: Unprioritized
Component: General
Assignee: Pywikipedia-bugs(a)lists.wikimedia.org
Reporter: legoktm.wikipedia(a)gmail.com
Classification: Unclassified
Mobile Platform: ---
Originally from: http://sourceforge.net/p/pywikipediabot/bugs/1207/
Reported by: Anonymous user
Created on: 2010-07-08 12:06:18
Subject: weblinkchecker.py don't report death links
Original description:
version.py:
Pywikipedia \[http\] trunk/pywikipedia \(r8347, 2010/07/08, 06:47:27\)
Python 2.6.1 \(r261:67515, Feb 11 2010, 00:51:29\)
\[GCC 4.2.1 \(build 5646\)\]
config-settings:
use\_api = True
use\_api\_login = True
weblinkchecker.py is correctly searching for death links, but -talk would not
reporting them on talk page, only the help text would be prompted. Already
waited 8 days and executed -repeat 4 times. The links are listed in the
deadlinks file.
--
You are receiving this mail because:
You are the assignee for the bug.
https://bugzilla.wikimedia.org/show_bug.cgi?id=55233
Web browser: ---
Bug ID: 55233
Summary: Weblinkchecker reports live links as dead
Product: Pywikibot
Version: unspecified
Hardware: All
OS: All
Status: NEW
Severity: normal
Priority: Unprioritized
Component: General
Assignee: Pywikipedia-bugs(a)lists.wikimedia.org
Reporter: legoktm.wikipedia(a)gmail.com
Classification: Unclassified
Mobile Platform: ---
Originally from: http://sourceforge.net/p/pywikipediabot/bugs/1352/
Reported by: hiw
Created on: 2011-09-26 22:35:45
Subject: Weblinkchecker reports live links as dead
Original description:
The weblinkchecker.py reported several web links as dead due to additional
braces at the end of the url.
\- http://de.wikipedia.org/wiki/Diskussion:DARC\_\(Protein\)
e.g:
Article:
<ref name="pmid19290478">\{\{cite journal |author=Horne K, Woolley IJ
|title=Shedding light on DARC: the role of the Duffy antigen/receptor for
chemokines in inflammation, infection and malignancy |journal=Inflamm. Res.
|volume=58 |issue=8 |pages=431–5 |year=2009 |month=August |pmid=19290478
|doi=10.1007/s00011-009-0023-9
|url=http://dx.doi.org/10.1007/s00011-009-0023-9\}\}</ref>
Response by pywikipedia on talk page:
Dead link found:
http://dx.doi.org/10.1007/s00011-009-0023-9\}\}
version.py
Pywikipedia \[http\] trunk/pywikipedia \(r9558, 2011/09/25, 20:30:54\)
Python 2.7.2 \(default, Jun 24 2011, 12:21:10\) \[MSC v.1500 32 bit \(Intel\)\]
config-settings:
use\_api = True
use\_api\_login = True
unicode test: ok
Using Active Python with Microsoft Windows XP \[Version 5.1.2600\]
--
You are receiving this mail because:
You are the assignee for the bug.