Bugs item #1859078, was opened at 2007-12-27 17:07
Message generated for change (Settings changed) made by leogregianin
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603138&aid=1859078&group_…
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: None
Group: None
>Status: Closed
>Resolution: Fixed
Priority: 5
Private: No
Submitted By: Nicolas Dumazet (nicdumz)
Assigned to: Nobody/Anonymous (nobody)
Summary: interwiki.py causing crash in wikipedia::removeLanguageLinks
Initial Comment:
Appeared to run correctly for a while, then :
Dump fr (wikipedia) saved
Traceback (most recent call last):
File "interwiki.py", line 1587, in <module>
bot.run()
File "interwiki.py", line 1364, in run
self.queryStep()
File "interwiki.py", line 1338, in queryStep
self.oneQuery()
File "interwiki.py", line 1334, in oneQuery
subject.workDone(self)
File "interwiki.py", line 707, in workDone
elif page.isEmpty() and not page.isCategory():
File "/home/nico/projets/pywikipedia/wikipedia.py", line 846, in isEmpty
txt = removeLanguageLinks(txt)
File "/home/nico/projets/pywikipedia/wikipedia.py", line 3019, in removeLanguageLinks
['nowiki', 'comment', 'math', 'pre'], marker=marker)
File "/home/nico/projets/pywikipedia/wikipedia.py", line 2795, in replaceExcept
import weblinkchecker
UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in position 3: ordinal not in range(128)
For the record, the exact command was :
python interwiki.py -autonomous -skipauto -start:!
Thanks,
Nicolas Dumazet.
----------------------------------------------------------------------
Comment By: Rotem Liss (rotemliss)
Date: 2007-12-28 10:39
Message:
Logged In: YES
user_id=1327030
Originator: NO
Fixed in r4766.
----------------------------------------------------------------------
Comment By: Nicolas Dumazet (nicdumz)
Date: 2007-12-27 17:48
Message:
Logged In: YES
user_id=1963242
Originator: YES
This is apparently a consequence of r4765, since reverting to r4764 is a
temporary fix.
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603138&aid=1859078&group_…
Bugs item #1861698, was opened at 2008-01-01 12:14
Message generated for change (Settings changed) made by leogregianin
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603138&aid=1861698&group_…
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: other
Group: None
>Status: Closed
>Resolution: Fixed
Priority: 5
Private: No
Submitted By: Alex S.H. Lin (lin4h)
Assigned to: Nobody/Anonymous (nobody)
Summary: Featured.py cannot detect FA mark template with underline
Initial Comment:
from Japanese wikipedia.
http://ja.wikipedia.org/w/index.php?title=%E3%83%8D%E3%82%B3&diff=17082088&…
This article is tagged {{Link_FA}} with the underline, but the script didn't detect it.
----------------------------------------------------------------------
Comment By: Rotem Liss (rotemliss)
Date: 2008-01-05 05:19
Message:
Logged In: YES
user_id=1327030
Originator: NO
Should be fixed in r4800.
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603138&aid=1861698&group_…
Patches item #1861136, was opened at 2007-12-30 19:48
Message generated for change (Settings changed) made by leogregianin
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603140&aid=1861136&group_…
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: None
Group: None
>Status: Closed
>Resolution: Fixed
Priority: 5
Private: No
Submitted By: Gerben van der Stouwe (gerbennn)
Assigned to: Nobody/Anonymous (nobody)
Summary: image.py namespace translation
Initial Comment:
This patches image.py to also look for images using the local namespace. For example on nl.wp:
It will now look (in addition to [[Image:<arg>]]) for [[Afbeelding:<arg>]].
----------------------------------------------------------------------
Comment By: Rotem Liss (rotemliss)
Date: 2007-12-31 17:27
Message:
Logged In: YES
user_id=1327030
Originator: NO
Should be fixed in r4787 and 4788.
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603140&aid=1861136&group_…
Patches item #1864004, was opened at 2008-01-04 12:29
Message generated for change (Settings changed) made by leogregianin
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603140&aid=1864004&group_…
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: None
Group: None
>Status: Closed
>Resolution: Fixed
Priority: 5
Private: No
Submitted By: Alex S.H. Lin (lin4h)
Assigned to: Nobody/Anonymous (nobody)
Summary: Fix wikipedia.py for MediaWiki r29225 update
Initial Comment:
Script cannot get all page information because this mediawiki update.
http://svn.wikimedia.org/viewvc/mediawiki/trunk/phase3/includes/SpecialAllp…
Double checked by my bot server and my laptop.
----------------------------------------------------------------------
Comment By: Rotem Liss (rotemliss)
Date: 2008-01-04 14:20
Message:
Logged In: YES
user_id=1327030
Originator: NO
Fixed in r4795, in a backwards-compatible way. Note that the first part of
the patch seems to be a mistake.
----------------------------------------------------------------------
Comment By: Alex S.H. Lin (lin4h)
Date: 2008-01-04 13:45
Message:
Logged In: YES
user_id=1902991
Originator: YES
File Added: svnupdate.patch
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603140&aid=1864004&group_…
Feature Requests item #1515577, was opened at 2006-07-01 11:48
Message generated for change (Comment added) made by leogregianin
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603141&aid=1515577&group_…
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: None
Group: None
>Status: Closed
Priority: 5
Private: No
Submitted By: Nobody/Anonymous (nobody)
Assigned to: Nobody/Anonymous (nobody)
Summary: Wikier for lonely pages
Initial Comment:
There is a list of lonely pages in Wikipedia in
Special:Lonelypages. Also some other pages need links
to them because of the idea of Wikipedia - especially
the new ones. So I tried to code a intrawikier module
in pywikipediabot but I failed.
The idea is that the bot will crawl through Wikipedia
and it tries to find the wanted word's body. The body
of the word could be supplied by hand and when the bot
has found something, it would ask if a link to wanted
article was added.
It's must be quite near to replace.py but I couldn't
handle it. Could someone here code it?
----------------------------------------------------------------------
>Comment By: Leonardo Gregianin (leogregianin)
Date: 2008-01-05 08:10
Message:
Logged In: YES
user_id=1136737
Originator: NO
now you can run lonelypages.py ;)
----------------------------------------------------------------------
Comment By: Andre Engels (a_engels)
Date: 2006-12-19 16:13
Message:
Logged In: YES
user_id=843018
Originator: NO
I'm not really sure what you are looking for here... Could you explain
further?
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603141&aid=1515577&group_…
Revision: 4805
Author: rotem
Date: 2008-01-05 11:00:10 +0000 (Sat, 05 Jan 2008)
Log Message:
-----------
Getting moveRestriction and not only editRestriction from the HTML text, making Page._getEditPage return only the wiki text and not the other information, which is kept in the page variables instead.
Modified Paths:
--------------
trunk/pywikipedia/wikipedia.py
Modified: trunk/pywikipedia/wikipedia.py
===================================================================
--- trunk/pywikipedia/wikipedia.py 2008-01-05 10:18:19 UTC (rev 4804)
+++ trunk/pywikipedia/wikipedia.py 2008-01-05 11:00:10 UTC (rev 4805)
@@ -407,6 +407,7 @@
self._title = t
self.editRestriction = None
+ self.moveRestriction = None
self._permalink = None
self._userName = None
self._ipedit = None
@@ -592,7 +593,7 @@
# Make sure we did try to get the contents once
if not hasattr(self, '_contents'):
try:
- self._contents, self._isWatched, self.editRestriction = self._getEditPage(get_redirect = get_redirect, throttle = throttle, sysop = sysop, nofollow_redirects=nofollow_redirects)
+ self._contents = self._getEditPage(get_redirect = get_redirect, throttle = throttle, sysop = sysop, nofollow_redirects=nofollow_redirects)
hn = self.section()
if hn:
m = re.search("=+ *%s *=+" % hn, self._contents)
@@ -625,12 +626,8 @@
oldid - Retrieve an old revision (by id), not the current one
get_redirect - Get the contents, even if it is a redirect page
- This method returns a 3-tuple containing the raw wiki text as a
- unicode string, the watchlist status, and any edit restrictions.
-
+ This method returns the raw wiki text as a unicode string.
"""
- isWatched = False
- editRestriction = None
if verbose:
output(u'Getting page %s' % self.aslink())
path = self.site().edit_address(self.urlname())
@@ -704,12 +701,19 @@
# If read-only, check blocks.
if readonly and checkBlocks and self.site().isBlocked():
raise UserBlocked(self.site(), self.aslink(forceInterwiki = True))
- # Check locks
+ # Check for restrictions
m = re.search('var wgRestrictionEdit = \\["(\w+)"\\]', text)
if m:
if verbose:
output(u"DBG> page is locked for group %s" % m.group(1))
- editRestriction = m.group(1);
+ self.editRestriction = m.group(1);
+ else:
+ self.editRestriction = ''
+ m = re.search('var wgRestrictionMove = \\["(\w+)"\\]', text)
+ if m:
+ self.moveRestriction = m.group(1);
+ else:
+ self.moveRestriction = ''
# Look for the edit token
tokenloc = Rwatch.search(text)
if tokenloc:
@@ -742,7 +746,9 @@
# Look if the page is on our watchlist
matchWatching = Rwatchlist.search(text)
if matchWatching:
- isWatched = True
+ self._isWatched = True
+ else:
+ self._isWatched = False
# Now process the contents of the textarea
m = self.site().redirectRegex().match(text[i1:i2])
if m:
@@ -768,7 +774,7 @@
while x and x[-1] in '\n ':
x = x[:-1]
- return x, isWatched, editRestriction
+ return x
def getOldVersion(self, oldid, force=False, get_redirect=False,
throttle=True, sysop=False, nofollow_redirects=False,
@@ -784,7 +790,7 @@
sysop=sysop, oldid=oldid,
nofollow_redirects=nofollow_redirects,
change_edit_time=change_edit_time
- )[0]
+ )
def permalink(self):
"""Return the permalink URL for current revision of this page."""
Patches item #1862810, was opened at 2008-01-03 01:41
Message generated for change (Settings changed) made by alleborgo
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603140&aid=1862810&group_…
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: None
Group: None
>Status: Closed
>Resolution: Fixed
Priority: 5
Private: No
Submitted By: Nicolas Dumazet (nicdumz)
Assigned to: Nobody/Anonymous (nobody)
Summary: wikipedia.py:html2unicode : html chars from #128 to #159
Initial Comment:
Codepoints from #128 to #159 are unused in both ISO-8859-1 and Unicode, hence html entities numbered in this range *are* illegal.
But the fact is that a lot of websites do use these characters, our browsers now all print these characters, and some of these entities can be found on our wikis.
I found this working on a page containing #155 ( › ) : html2unicode would convert it to unichr(155), which is , way more unfit than the previous one. (Yes, actually that's the result you get on a wiki page)
#128, which produces €, was also being converted to unichr(128)... ()
Cheers,
Nicolas Dumazet.
----------------------------------------------------------------------
Comment By: Rotem Liss (rotemliss)
Date: 2008-01-05 09:29
Message:
Logged In: YES
user_id=1327030
Originator: NO
Applied, after some testing, to r4801. Thanks for the patch.
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603140&aid=1862810&group_…
Patches item #1862987, was opened at 2008-01-03 10:18
Message generated for change (Settings changed) made by alleborgo
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603140&aid=1862987&group_…
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: None
Group: None
>Status: Closed
>Resolution: Fixed
Priority: 5
Private: No
Submitted By: Swind_ja (swind_ja)
Assigned to: Nobody/Anonymous (nobody)
Summary: Adding ja message in isbn.py
Initial Comment:
As summary.
----------------------------------------------------------------------
Comment By: Rotem Liss (rotemliss)
Date: 2008-01-05 09:31
Message:
Logged In: YES
user_id=1327030
Originator: NO
Applied to r4802.
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603140&aid=1862987&group_…
Feature Requests item #1864162, was opened at 2008-01-04 22:11
Message generated for change (Comment added) made by rotemliss
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603141&aid=1864162&group_…
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: None
Group: None
Status: Open
Priority: 5
Private: No
Submitted By: Nobody/Anonymous (nobody)
Assigned to: Nobody/Anonymous (nobody)
Summary: Error interwiki.py and sysop account
Initial Comment:
I'm using a sysop bot to make some interwikies in protected pages in pt.wikinews. The bot finds a new interwikies, but the bot doesn't save.
This bot have sysop at pt.wikinews and es.wikinews, because both wikies are protecting pages to prevent spam bot and another things.
Here is a example:
======Post-processing [[pt:Milhares de pessoas fogem do furacão Katrina em Nova
Orleães, nos EUA]]======
Updating links on page [[es:Numerosas personas huyen del huracán Katrina en Nue
va Orleans]].
No changes needed
Updating links on page [[pt:Milhares de pessoas fogem do furacão Katrina em Nov
a Orleães, nos EUA]].
Changes to be made: Modificando: [[en:Mandatory evacuation declared in New Orle
ans, US]]
+ [[de:Angst vor dem Hurrikan: New Orleans wird evakuiert]]
+ [[en:Mandatory evacuation declared in New Orleans, US]]
- [[en:Mandatory Evacuation declared in New Orleans]]
- [[de:Angst vor dem Hurrikan: New Orleans wird evakuiert]]
NOTE: Updating live wiki...
Page is locked, using sysop account.
Sleeping for 3.0 seconds, 2008-01-04 17:05:23
Changing page [[pt:Milhares de pessoas fogem do furacão Katrina em Nova Orleãe
s, nos EUA]]
200 OK
WARNING: wikinews: [[pt:Milhares de pessoas fogem do furacão Katrina em Nova Or
leães, nos EUA]] does not link to [[en:Mandatory evacuation declared in New Orl
eans, US]] but to [[en:Mandatory Evacuation declared in New Orleans]]
WARNING: wikinews: [[en:Mandatory evacuation declared in New Orleans, US]] links
to incorrect [[nl:New-Orleans houdt hart vast voor komst Katrina]]
----------------------------------------------------------------------
Comment By: Rotem Liss (rotemliss)
Date: 2008-01-05 10:57
Message:
Logged In: YES
user_id=1327030
Originator: NO
It seems like an error Page.put didn't detect. Please replace line 1114 in
your copy of interwiki.py:
wikipedia.output(u'%s %s' % (status, reason))
(if this is not the line, update the file first) to:
wikipedia.output("Data: %s" % data)
(make sure to keep the spaces before the line!) and copy the results to
here. This will output the whole HTML data from the server in case of error
(note that it will contain many lines). Make sure to change the line to the
previous text before using interwiki.py again.
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603141&aid=1864162&group_…