https://bugzilla.wikimedia.org/show_bug.cgi?id=55314
Web browser: ---
Bug ID: 55314
Summary: removing sr interwiki
Product: Pywikibot
Version: unspecified
Hardware: All
OS: All
Status: NEW
Severity: normal
Priority: Unprioritized
Component: interwiki.py
Assignee: Pywikipedia-bugs(a)lists.wikimedia.org
Reporter: legoktm.wikipedia(a)gmail.com
Classification: Unclassified
Mobile Platform: ---
Originally from: http://sourceforge.net/p/pywikipediabot/bugs/830/
Reported by: Anonymous user
Created on: 2009-01-03 22:54:37
Subject: removing sr interwiki
Original description:
There is problem with sr.wikipedia - latin characters in URL or in interwiki
are automatically translated to cyrilic, but bot does not recognize it ant
wants to remove this valid interwiki
see
http://cs.wikipedia.org/w/index.php?title=Nikdo\_v%C3%A1s\_nesm%C3%AD\_b%C3…
--
You are receiving this mail because:
You are the assignee for the bug.
https://bugzilla.wikimedia.org/show_bug.cgi?id=55322
Web browser: ---
Bug ID: 55322
Summary: PreloadingGenerator does not set editRestriction
Product: Pywikibot
Version: unspecified
Hardware: All
OS: All
Status: NEW
Severity: normal
Priority: Unprioritized
Component: General
Assignee: Pywikipedia-bugs(a)lists.wikimedia.org
Reporter: legoktm.wikipedia(a)gmail.com
Classification: Unclassified
Mobile Platform: ---
Originally from: http://sourceforge.net/p/pywikipediabot/bugs/680/
Reported by: nicdumz
Created on: 2008-04-23 09:09:47
Subject: PreloadingGenerator does not set editRestriction
Original description:
I realized that sometimes, in blockpageschecker.py,
someTotalProtectedPage.canBeEdited\(\) would wrongly return True \(I have no
sysop account, hence on every \[edit=sysop\] protected page, it should return
False\)
I first tried the single canBeEdited\(\) on that total protected page :
>>> import wikipedia; s = wikipedia.Site\('fr', 'wikipedia'\);
protectedpage = wikipedia.Page\(s, 'Zentrum'\)
Checked for running processes. 1 processes currently running, including the
current process.
>>> protectedpage.canBeEdited\(\)
Getting 1 pages from wikipedia:fr...
False
Which is fine.
However :
>>> import wikipedia; s = wikipedia.Site\('fr', 'wikipedia'\);
protectedpage = wikipedia.Page\(s, 'Zentrum'\)
Checked for running processes. 1 processes currently running, including the
current process.
>>> import pagegenerators; gen =
pagegenerators.PreloadingGenerator\(\[protectedpage\]\)
>>> for p in gen : print p.canBeEdited\(\)
...
Getting 1 pages from wikipedia:fr...
True
>>> protectedpage.canBeEdited\(\)
True
Which is wrong.
I then thought then calling canBeEdited on a total protected page from
PreloadingGenerator would \_every\_time\_ return wrongly True, but I was wrong
:
>>> import wikipedia; s = wikipedia.Site\('en', 'wikipedia'\);
protectedpage = wikipedia.Page\(s, 'Wikipedia:site support'\)
Checked for running processes. 1 processes currently running, including the
current process.
>>> import pagegenerators; gen =
pagegenerators.PreloadingGenerator\(\[protectedpage\]\)
>>> for p in gen : print p.canBeEdited\(\)
...
Getting 1 pages from wikipedia:en...
False
So apparently, in some special cases, calling canBeEdited on a total protected
page from PreloadingGenerator returns True. However I have not been able to
find \*when\* exactly this happens.
Any thoughts ?
PS : An easy fix for this is to get rid of the current implementation of
canBeEdited, to use instead the new getRestrictions using the API
\(getRestrictions\(\)\['edit'\]\). But we should find out what is happenning
here, since we might miss something important.
--
You are receiving this mail because:
You are the assignee for the bug.
https://bugzilla.wikimedia.org/show_bug.cgi?id=55271
Web browser: ---
Bug ID: 55271
Summary: undetected edit conficts
Product: Pywikibot
Version: unspecified
Hardware: All
OS: All
Status: NEW
Severity: normal
Priority: Unprioritized
Component: General
Assignee: Pywikipedia-bugs(a)lists.wikimedia.org
Reporter: legoktm.wikipedia(a)gmail.com
Classification: Unclassified
Mobile Platform: ---
Originally from: http://sourceforge.net/p/pywikipediabot/bugs/1204/
Reported by: xqt
Created on: 2010-07-06 10:41:29
Subject: undetected edit conficts
Original description:
bot should detect edit conficts of its own parallel task; see
http://zh.wikipedia.org/w/index.php?title=%E9%BA%92%E9%BA%9F%E6%93%8D%E4%BD…
and
http://zh.wikipedia.org/w/index.php?title=%E9%BA%92%E9%BA%9F%E6%93%8D%E4%BD…
version.py
Pywikipedia \[http\] trunk/pywikipedia \(r8344, 2010/07/05, 17:49:14\)
Python 2.5.2 \(r252:60911, Feb 21 2008, 13:11:45\) \[MSC v.1310 32 bit
\(Intel\)\]
config-settings:
use\_api = True
use\_api\_login = True
--
You are receiving this mail because:
You are the assignee for the bug.
https://bugzilla.wikimedia.org/show_bug.cgi?id=55209
Web browser: ---
Bug ID: 55209
Summary: replace.py error reading xml file
Product: Pywikibot
Version: unspecified
Hardware: All
OS: All
Status: NEW
Severity: normal
Priority: Unprioritized
Component: General
Assignee: Pywikipedia-bugs(a)lists.wikimedia.org
Reporter: legoktm.wikipedia(a)gmail.com
Classification: Unclassified
Mobile Platform: ---
Originally from: http://sourceforge.net/p/pywikipediabot/bugs/1417/
Reported by: paolobenve
Created on: 2012-03-04 18:05:03
Subject: replace.py error reading xml file
Original description:
$ python version.py
Pywikipedia \[http\] trunk/pywikipedia \(r9969, 2012/03/04, 15:55:41\)
Python 2.6.6 \(r266:84292, Dec 26 2010, 22:31:48\)
\[GCC 4.4.5\]
config-settings:
use\_api = True
use\_api\_login = True
unicode test: triggers problem \#3081100
It seems that replace.py cannot read my xml file:
$ python replace.py -ns:0 -dotall -xml:cathopedia.it.xml "1" "\1"unicode test:
triggers problem \#3081100
Reading XML dump...
Traceback \(most recent call last\):
File "/var/wwwc/cathopedia.org/pywikipedia/pagegenerators.py", line 1182, in
\_\_iter\_\_
for page in self.wrapped\_gen:
File "/var/wwwc/cathopedia.org/pywikipedia/pagegenerators.py", line 1039, in
NamespaceFilterPageGenerator
for page in generator:
File "/var/wwwc/cathopedia.org/pywikipedia/pagegenerators.py", line 1084, in
DuplicateFilterPageGenerator
for page in generator:
File "replace.py", line 224, in \_\_iter\_\_
for entry in self.parser:
File "/var/wwwc/cathopedia.org/pywikipedia/xmlreader.py", line 320, in
new\_parse
for rev in self.\_parse\(event, elem\):
File "/var/wwwc/cathopedia.org/pywikipedia/xmlreader.py", line 325, in
\_parse\_only\_latest
if event == "end" and elem.tag == "\{%s\}page" % self.uri:
AttributeError: 'XmlDump' object has no attribute 'uri'
'XmlDump' object has no attribute 'uri'
0 pages were changed.
My xml file where generated with the command
mysqldump --xml -ubackup -p\*\*\*\*\* --databases db > cathopedia.it.xml
The version is
$ mysqldump --version
mysqldump Ver 10.13 Distrib 5.1.49, for debian-linux-gnu \(x86\_64\)
I'm wondering if I'm missing som mysqldump option, but I don't think so.
Actually, another script \(add\_text.py\) reads the xml file without any
problem, and I could use it many times
--
You are receiving this mail because:
You are the assignee for the bug.
https://bugzilla.wikimedia.org/show_bug.cgi?id=55173
Web browser: ---
Bug ID: 55173
Summary: Line ending inconsistency when using XmlReader
Product: Pywikibot
Version: unspecified
Hardware: All
OS: All
Status: NEW
Severity: normal
Priority: Unprioritized
Component: General
Assignee: Pywikipedia-bugs(a)lists.wikimedia.org
Reporter: legoktm.wikipedia(a)gmail.com
Classification: Unclassified
Mobile Platform: ---
Originally from: http://sourceforge.net/p/pywikipediabot/bugs/1508/
Reported by: Anonymous user
Created on: 2012-08-31 19:08:06
Subject: Line ending inconsistency when using XmlReader
Original description:
The following example explains it all:
import wikipedia
p = wikipedia.Page\(wikipedia.getSite\(\), "User:WGH"\)
wikipedia.getall\(wikipedia.getSite\(\), \[p\]\)
print "\r\n" in p.get\(\) \# True \(using Special:Export and, in turn,
xmlreader\)
print "\r\n" in p.get\(force=True\) \# False \(direct api\)
It's caused by xmlreader replacing \n with \r\n for some reason. I suggest
removing it altogether, because in Pyhon universal newlines are ususally used.
Also \r\n line endings confuse mwlib parser.
Patch:
https://sourceforge.net/tracker/?func=detail&aid=3563805&group\_id=93107&at…
% python version.py
Pywikipedia trunk/pywikipedia/ \(r10511, 2012/08/30, 20:40:38\)
Python 2.7.3 \(default, Aug 31 2012, 16:05:44\)
\[GCC 4.2.1 20070831 patched \[FreeBSD\]\]
config-settings:
use\_api = True
use\_api\_login = True
unicode test: ok
--
You are receiving this mail because:
You are the assignee for the bug.
https://bugzilla.wikimedia.org/show_bug.cgi?id=55259
Web browser: ---
Bug ID: 55259
Summary: xmlreader.py fails a lot
Product: Pywikibot
Version: unspecified
Hardware: All
OS: All
Status: NEW
Severity: normal
Priority: Unprioritized
Component: General
Assignee: Pywikipedia-bugs(a)lists.wikimedia.org
Reporter: legoktm.wikipedia(a)gmail.com
Classification: Unclassified
Mobile Platform: ---
Originally from: http://sourceforge.net/p/pywikipediabot/bugs/1245/
Reported by: emijrp
Created on: 2010-10-03 13:51:00
Subject: xmlreader.py fails a lot
Original description:
Hi all;
I think that there is an error in xmlreader.py. When parsing a full revision
XML \(in this case\[1\]\), using this code\[2\] \(look at the try-catch, it
writes in console when it fails\) I get correctly username, timestamp and
revisionid, but sometimes, the page title and the page id are None or empty
string.
The first error is:
\['', None, 'QuartierLatin1968', '2004-10-10T04:24:14Z', '4267'\] \#look the
empty string for the title, and the None for pageid
But if we do:
7za e -bd -so kwwiki-20100926-pages-meta-history.xml.7z 2>/dev/null | egrep
-i '2004-10-10T04::14Z' -C20
We get this\[3\], which is OK, the page title and the page id are available in
the XML, but not correctly parsed. And this is not the only page title and page
it that fails.
Perhaps I have missed something, because I'm learning to parsing XML. Sorry in
that case.
Regards,
emijrp
\[1\]
http://download.wikimedia.org/kwwiki/20100926/kwwiki-20100926-pages-meta-hi…
\[2\] http://pastebin.ca/1951930
\[3\] http://pastebin.ca/1951937
--
You are receiving this mail because:
You are the assignee for the bug.
https://bugzilla.wikimedia.org/show_bug.cgi?id=55243
Web browser: ---
Bug ID: 55243
Summary: replaceCategoryLinks misplaces featured article links
Product: Pywikibot
Version: unspecified
Hardware: All
OS: All
Status: NEW
Severity: normal
Priority: Unprioritized
Component: General
Assignee: Pywikipedia-bugs(a)lists.wikimedia.org
Reporter: legoktm.wikipedia(a)gmail.com
Classification: Unclassified
Mobile Platform: ---
Originally from: http://sourceforge.net/p/pywikipediabot/bugs/1320/
Reported by: silvonen
Created on: 2011-06-12 10:47:24
Subject: replaceCategoryLinks misplaces featured article links
Original description:
I don't know if this is a known problem, but the function replaceCategoryLinks
in pywikibot/textlib.py moves featured/good article templates \(e.g., \{\{Link
FA|en\}\}\) above categories, although these templates are usually placed
between categories and interwiki links. This has caused problems in one of the
bot scripts that we use in the Finnish Wikipedia.
I am attaching a patch proposed by zache.
>python version.py
Pywikipedia \[http\] trunk/pywikipedia \(r9291, 2011/06/10, 04:46:24\)
Python 2.5.4 \(r254:67916, Jan 29 2009, 12:02:11\) \[MSC v.1310 32 bit
\(Intel\)\]
config-settings:
use\_api = True
use\_api\_login = True
unicode test: ok
--
You are receiving this mail because:
You are the assignee for the bug.
https://bugzilla.wikimedia.org/show_bug.cgi?id=55654
Web browser: ---
Bug ID: 55654
Summary: Recursive clone over ssh fails
Product: Pywikibot
Version: core (2.0)
Hardware: All
OS: All
Status: NEW
Severity: normal
Priority: Unprioritized
Component: General
Assignee: Pywikipedia-bugs(a)lists.wikimedia.org
Reporter: maarten(a)mdammers.nl
Classification: Unclassified
Mobile Platform: ---
<~/pywikibot>git clone --recursive
ssh://multichill@gerrit.wikimedia.org:29418/pywikibot/core.git
Cloning into 'core'...
Warning: Permanently added the RSA host key for IP address
'[208.80.154.81]:29418' to the list of known hosts.
Enter passphrase for key 'key':
remote: Counting objects: 10842, done
remote: Finding sources: 100% (10842/10842)
remote: Getting sizes: 100% (1495/1495)
remote: Compressing objects: 100% (2292733/2292733)
remote: Total 10842 (delta 7190), reused 10548 (delta 7165)
Receiving objects: 100% (10842/10842), 23.42 MiB | 326 KiB/s, done.
Resolving deltas: 100% (7337/7337), done.
Submodule 'externals/httplib2'
(https://gerrit.wikimedia.org/r/p/pywikibot/externals/httplib2.git) registered
for path 'externals/httplib2'
Submodule 'i18n' (https://gerrit.wikimedia.org/r/p/pywikibot/i18n.git)
registered for path 'scripts/i18n'
Cloning into 'externals/httplib2'...
error: SSL certificate problem: self signed certificate in certificate chain
while accessing
https://gerrit.wikimedia.org/r/p/pywikibot/externals/httplib2.git/info/refs…
fatal: HTTP request failed
Clone of 'https://gerrit.wikimedia.org/r/p/pywikibot/externals/httplib2.git'
into submodule path 'externals/httplib2' failed
I'm cloning over ssh. Why are the submodules being loaded over https?
--
You are receiving this mail because:
You are the assignee for the bug.
https://bugzilla.wikimedia.org/show_bug.cgi?id=55308
Web browser: ---
Bug ID: 55308
Summary: Fullurl in imageharvest.py does not work
Product: Pywikibot
Version: unspecified
Hardware: All
OS: All
Status: NEW
Severity: normal
Priority: Unprioritized
Component: General
Assignee: Pywikipedia-bugs(a)lists.wikimedia.org
Reporter: legoktm.wikipedia(a)gmail.com
Classification: Unclassified
Mobile Platform: ---
Originally from: http://sourceforge.net/p/pywikipediabot/bugs/897/
Reported by: dasch88
Created on: 2009-04-04 09:54:06
Subject: Fullurl in imageharvest.py does not work
Original description:
When I have a page where the picturelinks are given as fullurl they are
appended to the pageurl
Skript progess:
python imageharvest.py http://commons.wikimedia.org/wiki/User:DaSch/Bot-Export
Checked for running processes. 1 processes currently running, including the
current process.
What text should be added at the end of the description of each image from this
url? \[\[Import::04.04.2009\]\] \[\[Kategorie:Commons-Import\]\]
Include image
http://commons.wikimedia.org/wiki/User:DaSch/http://commons.wikipedia.org/a…
\(\[y\]es, \[N\]o, \[s\]top\) N
Include image
http://commons.wikimedia.org/wiki/User:DaSch/http://upload.wikimedia.org/wi…
\(\[y\]es, \[N\]o, \[s\]top\) y
Give the description of this image: 1789
Reading file
http://commons.wikimedia.org/wiki/User:DaSch/http://upload.wikimedia.org/wi…
Traceback \(most recent call last\):
File "imageharvest.py", line 117, in <module>
main\(url,image\_url\)
File "imageharvest.py", line 95, in main
uploadBot.run\(\)
File "/root/pywecowi/upload.py", line 296, in run
return self.upload\_image\(\)
File "/root/pywecowi/upload.py", line 119, in upload\_image
file = uo.open\(self.url\)
File "/usr/lib/python2.5/urllib.py", line 190, in open
return getattr\(self, name\)\(url\)
File "/usr/lib/python2.5/urllib.py", line 338, in open\_http
return self.http\_error\(url, fp, errcode, errmsg, headers\)
File "/usr/lib/python2.5/urllib.py", line 355, in http\_error
return self.http\_error\_default\(url, fp, errcode, errmsg, headers\)
File "/root/pywecowi/wikipedia.py", line 6954, in http\_error\_default
raise PageNotFound\(u'Page %s could not be retrieved. Check your family file ?'
% url\)
wikipedia.PageNotFound: Page
//commons.wikimedia.org/wiki/User:DaSch/http://upload.wikimedia.org/wikipedia/commons/1/1b/electoralcollege1789.svg
could not be retrieved. Check your family file ?
Version:
Pywikipedia \[http\] trunk/pywikipedia \(r6569, Apr 03 2009, 16:17:41\)
Python 2.5.2 \(r252:60911, Jul 31 2008, 17:28:52\)
\[GCC 4.2.3 \(Ubuntu 4.2.3-2ubuntu7\)\]
--
You are receiving this mail because:
You are the assignee for the bug.