Bugs item #2065095, was opened at 2008-08-21 15:45
Message generated for change (Settings changed) made by purodha
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603138&aid=2065095&group_…
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: interwiki
Group: None
Status: Open
Resolution: None
>Priority: 7
Private: No
Submitted By: Purodha B Blissenbach (purodha)
Assigned to: Nobody/Anonymous (nobody)
>Summary: interwiki - Bogus "Bot blocked" ?
Initial Comment:
"Bot blocked" message but no block found.
python /home/purodha/pywikipedia/interwiki.py -v -putthrottle:1 -initialredirect -new:2
Checked for running processes. 1 processes currently running, including the current process.
Pywikipediabot (r5821 (wikipedia.py), Aug 20 2008, 15:32:53)
Python 2.5.2 (r252:60911, Aug 14 2008, 13:31:58)
[GCC 4.3.1]
Retrieving mediawiki messages from Special:Allmessages
WARNING: No character set found.
NOTE: Number of pages queued is 0, trying to add 60 more.
Getting 2 pages from wikipedia:ksh...
[[Betty Hutton]]: [[ksh:Betty Hutton]] gives new interwiki [[cy:Betty Hutton]]
--- few lines skipped ---
Updating links on page [[da:Betty Hutton]].
Changes to be made: Tilfjer: [[ksh:Betty Hutton]]
+ [[ksh:Betty Hutton]]
NOTE: Updating live wiki...
Getting information for site wikipedia:da
WARNING: Your account on wikipedia:da is blocked. Editing using this account will stop the run.
Getting information for site wikipedia:da
Getting information for site wikipedia:da
Dump ksh (wikipedia) saved
Traceback (most recent call last):
File "/home/purodha/pywikipedia/interwiki.py", line 1760, in <module>
bot.run()
File "/home/purodha/pywikipedia/interwiki.py", line 1497, in run
self.queryStep()
File "/home/purodha/pywikipedia/interwiki.py", line 1476, in queryStep
subj.finish(self)
File "/home/purodha/pywikipedia/interwiki.py", line 1057, in finish
if self.replaceLinks(page, new, bot):
File "/home/purodha/pywikipedia/interwiki.py", line 1215, in replaceLinks
status, reason, data = page.put(newtext, comment = wikipedia.translate(page.site().lang, msg)[0] + mods)
File "/home/purodha/pywikipedia/wikipedia.py", line 1264, in put
self.site().checkBlocks(sysop = sysop)
File "/home/purodha/pywikipedia/wikipedia.py", line 4191, in checkBlocks
raise UserBlocked('User is blocked in site %s' % self)
wikipedia.UserBlocked: User is blocked in site wikipedia:da
I've inspected all the logs on dawiki, there is n block of user:Purbo_T or toolserver.org (91.198.174.203) to be found.
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603138&aid=2065095&group_…
Bugs item #2065095, was opened at 2008-08-21 15:45
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603138&aid=2065095&group_…
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: interwiki
Group: None
Status: Open
Resolution: None
Priority: 5
Private: No
Submitted By: Purodha B Blissenbach (purodha)
Assigned to: Nobody/Anonymous (nobody)
Summary: pywikipedia - Bogus "Bot blocked" ?
Initial Comment:
"Bot blocked" message but no block found.
python /home/purodha/pywikipedia/interwiki.py -v -putthrottle:1 -initialredirect -new:2
Checked for running processes. 1 processes currently running, including the current process.
Pywikipediabot (r5821 (wikipedia.py), Aug 20 2008, 15:32:53)
Python 2.5.2 (r252:60911, Aug 14 2008, 13:31:58)
[GCC 4.3.1]
Retrieving mediawiki messages from Special:Allmessages
WARNING: No character set found.
NOTE: Number of pages queued is 0, trying to add 60 more.
Getting 2 pages from wikipedia:ksh...
[[Betty Hutton]]: [[ksh:Betty Hutton]] gives new interwiki [[cy:Betty Hutton]]
--- few lines skipped ---
Updating links on page [[da:Betty Hutton]].
Changes to be made: Tilfjer: [[ksh:Betty Hutton]]
+ [[ksh:Betty Hutton]]
NOTE: Updating live wiki...
Getting information for site wikipedia:da
WARNING: Your account on wikipedia:da is blocked. Editing using this account will stop the run.
Getting information for site wikipedia:da
Getting information for site wikipedia:da
Dump ksh (wikipedia) saved
Traceback (most recent call last):
File "/home/purodha/pywikipedia/interwiki.py", line 1760, in <module>
bot.run()
File "/home/purodha/pywikipedia/interwiki.py", line 1497, in run
self.queryStep()
File "/home/purodha/pywikipedia/interwiki.py", line 1476, in queryStep
subj.finish(self)
File "/home/purodha/pywikipedia/interwiki.py", line 1057, in finish
if self.replaceLinks(page, new, bot):
File "/home/purodha/pywikipedia/interwiki.py", line 1215, in replaceLinks
status, reason, data = page.put(newtext, comment = wikipedia.translate(page.site().lang, msg)[0] + mods)
File "/home/purodha/pywikipedia/wikipedia.py", line 1264, in put
self.site().checkBlocks(sysop = sysop)
File "/home/purodha/pywikipedia/wikipedia.py", line 4191, in checkBlocks
raise UserBlocked('User is blocked in site %s' % self)
wikipedia.UserBlocked: User is blocked in site wikipedia:da
I've inspected all the logs on dawiki, there is n block of user:Purbo_T or toolserver.org (91.198.174.203) to be found.
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603138&aid=2065095&group_…
Revision: 5828
Author: purodha
Date: 2008-08-21 15:37:26 +0000 (Thu, 21 Aug 2008)
Log Message:
-----------
Documentation enhancements and typos
Modified Paths:
--------------
trunk/pywikipedia/interwiki.py
Modified: trunk/pywikipedia/interwiki.py
===================================================================
--- trunk/pywikipedia/interwiki.py 2008-08-21 14:32:44 UTC (rev 5827)
+++ trunk/pywikipedia/interwiki.py 2008-08-21 15:37:26 UTC (rev 5828)
@@ -115,10 +115,12 @@
of the page always, rather than doing so only when being
asked for (by typing '?'). Only useful in combination
with a hint-asking option like -untranslated, -askhints
- or -untranslatedonly. (note: without ending colon)
+ or -untranslatedonly.
+ (note: without ending colon)
- -noauto: Do not use the automatic translation feature for years and
+ -noauto Do not use the automatic translation feature for years and
dates, only use found links and hints.
+ (note: without ending colon)
These arguments define how much user confirmation is required:
@@ -146,10 +148,10 @@
These arguments specify in which way the bot should follow interwiki links:
- -noredirect do not follow redirects (note: without ending columns).
+ -noredirect do not follow redirects (note: without ending colon)
-initialredirect work on target if a redirect is entered on the command
- line (note: without ending columns).
+ line (note: without ending colon)
-neverlink: used as -neverlink:xx where xx is a language code:
Disregard any links found to language xx. You can also
Bugs item #2064976, was opened at 2008-08-21 16:44
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603138&aid=2064976&group_…
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: General
Group: None
Status: Open
Resolution: None
Priority: 7
Private: No
Submitted By: Multichill (multichill)
Assigned to: Nobody/Anonymous (nobody)
Summary: All pages soup problems
Initial Comment:
While running python2.4 imageuncat.py -start:Image:Chironomidae
Working on Image:Cicada.ogg
Got category Category:Images transwikied by BetacommandBot
Working on Image:Cicada.png
Got category Category:Magicicada
Working on Image:Cicada0001.jpg
Got category Category:Cicadellidae
Traceback (most recent call last):
File "/home/bot/pywikipedia/pagegenerators.py", line 755, in __iter__
for page in self.wrapped_gen:
File "/home/bot/pywikipedia/pagegenerators.py", line 688, in DuplicateFilterPageGenerator
for page in generator:
File "/home/bot/pywikipedia/pagegenerators.py", line 239, in AllpagesPageGenerator
for page in site.allpages(start = start, namespace = namespace, includeredirects = includeredirects):
File "/home/bot/pywikipedia/wikipedia.py", line 5169, in allpages
for p in soup.api.query.allpages:
AttributeError: 'NoneType' object has no attribute 'query'
'NoneType' object has no attribute 'query'
Pywikipedia [http] trunk/pywikipedia (r5827, Aug 21 2008, 14:32:44)
Python 2.4.4 (#1, Jun 11 2007, 23:35:50)
[GCC 3.3.3 (NetBSD nb3 20040520)]
Why are we using BeautifulSoup anyway? We dont need to screen-scrape the API.
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603138&aid=2064976&group_…
Revision: 5826
Author: multichill
Date: 2008-08-21 14:26:24 +0000 (Thu, 21 Aug 2008)
Log Message:
-----------
Country filter and option to only filter.
Modified Paths:
--------------
trunk/pywikipedia/imagerecat.py
Modified: trunk/pywikipedia/imagerecat.py
===================================================================
--- trunk/pywikipedia/imagerecat.py 2008-08-21 13:23:10 UTC (rev 5825)
+++ trunk/pywikipedia/imagerecat.py 2008-08-21 14:26:24 UTC (rev 5826)
@@ -23,23 +23,41 @@
category_blacklist = [u'Hidden categories',
u'Stub pictures']
-def categorizeImages(generator):
+countries = []
+
+def getCountries():
+ '''
+ Get the list of countries from Commons.
+ '''
+ result = []
+ countryPage = wikipedia.Page(wikipedia.getSite(), u'User:Multichill/Countries')
+ for country in countryPage.linkedPages():
+ result.append(country.titleWithoutNamespace())
+ return result
+
+def categorizeImages(generator, onlyfilter):
+ '''
+ Loop over all images in generator and try to categorize them. Get category suggestions from CommonSense.
+ '''
for page in generator:
if page.exists() and (page.namespace() == 6) and (not page.isRedirectPage()):
imagepage = wikipedia.ImagePage(page.site(), page.title())
#imagepage.get()
wikipedia.output(u'Working on ' + imagepage.title());
currentCats = getCurrentCats(imagepage)
- commonshelperCats = getCommonshelperCats(imagepage)
+ if(onlyfilter):
+ commonshelperCats = []
+ else:
+ commonshelperCats = getCommonshelperCats(imagepage)
newcats = filterBlacklist(commonshelperCats+currentCats)
newcats = filterDisambiguation(newcats)
newcats = followRedirects(newcats)
- #newcats = filterCountries(newcats)
+ newcats = filterCountries(newcats)
newcats = filterParents(newcats)
- if len(newcats) > 0:
+ if (len(newcats) > 0 and not(set(currentCats)==set(newcats))):
for cat in newcats:
wikipedia.output(u' Found new cat: ' + cat);
- saveImagePage(imagepage, newcats)
+ saveImagePage(imagepage, newcats, onlyfilter)
def getCurrentCats(imagepage):
@@ -54,7 +72,7 @@
def getCommonshelperCats(imagepage):
'''
- Get category suggestions from commonshelper. Parse them and return a list of suggestions.
+ Get category suggestions from CommonSense. Parse them and return a list of suggestions.
'''
result = []
parameters = urllib.urlencode({'i' : imagepage.titleWithoutNamespace().encode('utf-8'), 'r' : 'on', 'go-clean' : 'Find+Categories', 'cl' : 'li'})
@@ -82,6 +100,9 @@
def filterBlacklist(categories):
+ '''
+ Filter out categories which are on the blacklist.
+ '''
result = []
for cat in categories:
if (cat not in category_blacklist):
@@ -90,6 +111,9 @@
def filterDisambiguation(categories):
+ '''
+ Filter out disambiguation categories.
+ '''
result = []
for cat in categories:
if(not wikipedia.Page(wikipedia.getSite(), u'Category:' + cat).isDisambig()):
@@ -97,6 +121,9 @@
return result
def followRedirects(categories):
+ '''
+ If a category is a redirect, replace the category with the target.
+ '''
result = []
for cat in categories:
categoryPage = wikipedia.Page(wikipedia.getSite(), u'Category:' + cat)
@@ -110,13 +137,40 @@
def filterCountries(categories):
- result = []
- return result
+ '''
+ Try to filter out ...by country categories.
+ First make a list of any ...by country categories and try to find some countries.
+ If a by country category has a subcategoy containing one of the countries found, add it.
+ The ...by country categories remain in the set and should be filtered out by filterParents.
+ '''
+ result = categories
+ listByCountry = []
+ listCountries = []
+ for cat in categories:
+ if (cat.endswith(u'by country')):
+ listByCountry.append(cat)
+
+ #If cat contains 'by country' add it to the list
+ #If cat contains the name of a country add it to the list
+ else:
+ for country in countries:
+ if not(cat.find(country)==-1):
+ listCountries.append(country)
+
+ if(len(listByCountry) > 0):
+ for bc in listByCountry:
+ category = catlib.Category(wikipedia.getSite(), u'Category:' + bc)
+ for subcategory in category.subcategories():
+ for country in listCountries:
+ if (subcategory.titleWithoutNamespace().endswith(country)):
+ result.append(subcategory.titleWithoutNamespace())
+
+ return list(set(result))
def filterParents(categories):
'''
- Remove the current categories from the suggestions and remove blacklisted cats.
+ Remove all parent categories from the set to prevent overcategorization.
'''
result = []
toFilter = u''
@@ -135,18 +189,29 @@
return result
-def saveImagePage(imagepage, newcats):
+def saveImagePage(imagepage, newcats, onlyfilter):
+ '''
+ Remove the old categories and add the new categories to the image.
+ '''
newtext = wikipedia.removeCategoryLinks(imagepage.get(), imagepage.site())
newtext = removeTemplates(newtext) + u'{{subst:chc}}\n'
for category in newcats:
newtext = newtext + u'[[Category:' + category + u']]\n'
+
+ if(onlyfilter):
+ comment = u'Filtering categories'
+ else:
+ comment = u'Image is categorized by a bot using data from [[Commons:Tools#CommonSense|CommonSense]]'
wikipedia.showDiff(imagepage.get(), newtext)
- imagepage.put(newtext, u'Image is categorized by a bot using data from [[Commons:Tools#CommonSense|CommonSense]]')
+ imagepage.put(newtext, comment)
return
def removeTemplates(oldtext = u''):
+ '''
+ Remove {{Uncategorized}} and {{Check categories}} templates
+ '''
result = u''
result = re.sub(u'\{\{\s*([Uu]ncat(egori[sz]ed( image)?)?|[Nn]ocat|[Nn]eedscategory)[^}]*\}\}', u'', oldtext)
result = re.sub(u'<!-- Remove this line once you have added categories -->', u'', result)
@@ -158,8 +223,8 @@
'''
Main loop. Get a generator. Set up the 3 threads and the 2 queue's and fire everything up.
'''
- generator = None;
-
+ generator = None
+ onlyfilter = False
genFactory = pagegenerators.GeneratorFactory()
site = wikipedia.getSite(u'commons', u'commons')
@@ -170,12 +235,15 @@
generator = [wikipedia.Page(site, wikipedia.input(u'What page do you want to use?'))]
else:
generator = [wikipedia.Page(site, arg[6:])]
+ elif arg == '-onlyfilter':
+ onlyfilter = True
else:
generator = genFactory.handleArg(arg)
if not generator:
generator = pagegenerators.CategorizedPageGenerator(catlib.Category(site, u'Category:Media needing categories'), recurse=True)
-
- categorizeImages(generator)
+ global countries
+ countries = getCountries()
+ categorizeImages(generator, onlyfilter)
wikipedia.output(u'All done')
Bugs item #1879122, was opened at 2008-01-24 20:26
Message generated for change (Comment added) made by wikipedian
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603138&aid=1879122&group_…
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: other
Group: None
>Status: Closed
>Resolution: Fixed
Priority: 6
Private: No
Submitted By: Andr Malafaya Baptista (malafaya)
Assigned to: Nobody/Anonymous (nobody)
Summary: Double redirect fixes not coding special characters in EO.WP
Initial Comment:
A user of the Esperanto Wikipedia alerted me to this problem.
The command:
redirect.py -lang:eo double
messes up redirects to articles containing the special character convention in Esperanto sx, cx, ux, etc.. Check this edit made by my bot:
http://eo.wikipedia.org/w/index.php?title=Steindorf&action=edit&oldid=14450…
Instead of using the special character '', it mangles it as 'Sxx' (not even 'Sx'), causing it to be broken (I believe it's even not well interpreted by the Wiki, because 'Sxx' is being rendered as 'Sx').
----------------------------------------------------------------------
>Comment By: Daniel Herding (wikipedian)
Date: 2008-08-21 12:23
Message:
Logged In: YES
user_id=880694
Originator: NO
I think this is fixed now:
http://eo.wikipedia.org/w/index.php?title=Bourne_%C5%9Celo&diff=1900153&old…
----------------------------------------------------------------------
Comment By: AndreasJS (andreasjs)
Date: 2008-01-26 03:49
Message:
Logged In: YES
user_id=1738850
Originator: NO
Titles have to be x-decoded to work as links.
See patch 1880140
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603138&aid=1879122&group_…
Bugs item #1988290, was opened at 2008-06-08 20:18
Message generated for change (Comment added) made by wikipedian
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603138&aid=1988290&group_…
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: interwiki
Group: None
>Status: Closed
>Resolution: Fixed
Priority: 5
Private: No
Submitted By: Marc-Etienne Vargenau (vargenau)
Assigned to: Nobody/Anonymous (nobody)
Summary: Bot wants to remove existing eo: pages
Initial Comment:
[Ordinateur-de-Cathy-Vargenau:~/Applications/Escarbot/pywikipedia] vargenau% python interwiki.py en:vreux
Checked for running processes. 1 processes currently running, including the current process.
Getting 1 pages from wikipedia:en...
[[en:vreux]]: [[en:vreux]] gives new interwiki [[ro:vreux]]
[[en:vreux]]: [[en:vreux]] gives new interwiki [[eo:vre]]
[[en:vreux]]: [[en:vreux]] gives new interwiki [[es:vreux]]
Getting 1 pages from wikipedia:fr...
Getting 1 pages from wikipedia:zh...
Getting 1 pages from wikipedia:eo...
NOTE: [[eo:vre]] does not exist
Getting 1 pages from wikipedia:es...
======Post-processing [[en:vreux]]======
Updating links on page [[ro:vreux]].
Changes to be made: nlturat: [[eo:vre]]
- [[eo:vre]]
ERROR: Found incorrect link to eo in [[ro:vreux]]
Submit? ([y]es, [n]o, [g]ive up)
----------------------------------------------------------------------
>Comment By: Daniel Herding (wikipedian)
Date: 2008-08-21 12:20
Message:
Logged In: YES
user_id=880694
Originator: NO
The interwiki bot now runs on all the described examples without any
problems.
Also, I just fixed the summary message bug that julianmendez described.
----------------------------------------------------------------------
Comment By: Julian Mendez (julianmendez)
Date: 2008-08-02 18:55
Message:
Logged In: YES
user_id=1600163
Originator: NO
Hello,
The interwiki links seem to be right, but the summaries are still wrong.
Please, have a look at:
http://eo.wikipedia.org/w/index.php?title=Belorusio&diff=1862920&oldid=1859…
(r5771)
Thank you.
Regards,
Julian.
----------------------------------------------------------------------
Comment By: Marc-Etienne Vargenau (vargenau)
Date: 2008-06-11 10:11
Message:
Logged In: YES
user_id=1118700
Originator: YES
Hello,
I ran my bot on page vreux, the interwikis should be OK not.
Regards,
Marc-Etienne
----------------------------------------------------------------------
Comment By: Multichill (multichill)
Date: 2008-06-10 21:54
Message:
Logged In: YES
user_id=1777493
Originator: NO
Ok i now see your problem. I did
http://nl.wikipedia.org/w/index.php?title=.cx&diff=12689610&oldid=12597668
to have a hint ( i added [[eo:.cx]] )
Getting 1 pages from wikipedia:eo...
NOTE: [[eo:.c]] is redirect to [[eo:.c]]
Getting 1 pages from wikipedia:cv...
That's really strange. Further on:
======Post-processing [[nl:.cx]]======
Updating links on page [[zh:.cx]].
No changes needed
Updating links on page [[ro:.cx]].
No changes needed
Updating links on page [[th:.cx]].
No changes needed
Updating links on page [[tk:.cx]].
Changes to be made: Kaldiriliyor: [[eo:.c]]
- [[eo:.cx]]
ERROR: Found incorrect link to eo in [[tk:.cx]]
Submit? ([y]es, [n]o, [g]ive up)
ERROR: Found incorrect link to eo in [[tk:.cx]]
Submit? ([y]es, [n]o, [g]ive up) n
Updating links on page [[az:.cx]].
No changes needed
Updating links on page [[tg:.cx]].
No changes needed
Updating links on page [[pl:.cx]].
No changes needed
Updating links on page [[no:.cx]].
No changes needed
Updating links on page [[ja:.cx]].
No changes needed
Updating links on page [[nl:.cx]].
Changes to be made: Eraf: [[eo:.c]]
- [[eo:.cx]]
ERROR: Found incorrect link to eo in [[nl:.cx]]
Submit? ([y]es, [n]o, [g]ive up)
etc etc etc,
I noticed these two functions in wikipedia.py:
decodeEsperantoX: decode Esperanto text using the x convention.
encodeEsperantoX: convert wikitext to the Esperanto x-encoding.
def decodeEsperantoX(text):
"""
Decode Esperanto text encoded using the x convention.
E.g., Cxefpagxo and CXefpagXo will both be converted to
<C4><88>efpa<C4><9D>
o.
Note that to encode non-Esperanto words like Bordeaux, one uses a
double x, i.e. Bordeauxx or BordeauxX.
def encodeEsperantoX(text):
"""
Convert standard wikitext to the Esperanto x-encoding.
Double X-es where necessary so that we can submit a page to an
Esperanto
wiki. Again, we have to keep stupid stuff like cXxXxxX in mind. Maybe
someone wants to write about the Sony Cyber-shot DSC-Uxx camera series
on
eo: ;)
"""
These functions are probably broken. Why is this hack anyway?
----------------------------------------------------------------------
Comment By: Carsrac (carsrac)
Date: 2008-06-10 21:39
Message:
Logged In: YES
user_id=2066902
Originator: NO
An other example is [[.cx]] try add an iw link that from any of the
popular languages to eo. There is an eo article called [[eo:.cx]], but no
redirect can make sure that the correct eo will detected by the bot. Maybe
there is an overtranslation going on. If I translate [[ro:vreux]] to eo I
will try to create [[eo:vreux]] and not [[eo:vre]].
In the past did it work, but for several days it doesn't work any more.
----------------------------------------------------------------------
Comment By: Marc-Etienne Vargenau (vargenau)
Date: 2008-06-09 10:46
Message:
Logged In: YES
user_id=1118700
Originator: YES
Hello,
Perhaps a better example is fr:Deux-S%C3%A8vres that links to
eo:Deux-S%C3%A8vres
My example with vreux was bad as in fact the interwiki was indeed
eo:vre
Best regards,
Marc-Etienne
----------------------------------------------------------------------
Comment By: Multichill (multichill)
Date: 2008-06-09 10:28
Message:
Logged In: YES
user_id=1777493
Originator: NO
Does any of the pages link to the correct eo article?
http://en.wikipedia.org/wiki/%C3%89vreux links to
http://eo.wikipedia.org/wiki/%C3%89vre%C5%AD ( [[eo:vre]] )
----------------------------------------------------------------------
Comment By: Marc-Etienne Vargenau (vargenau)
Date: 2008-06-09 10:07
Message:
Logged In: YES
user_id=1118700
Originator: YES
Hello,
Sorry if I was not clear enough, but you totally missed the point.
Of course, page eo:vre does not exist. The correct page is eo:vreux
The problem is that the bot incorrectly translates "ux" to "".
Best regards,
Marc-Etienne
----------------------------------------------------------------------
Comment By: Multichill (multichill)
Date: 2008-06-08 21:28
Message:
Logged In: YES
user_id=1777493
Originator: NO
This is not an error.
http://eo.wikipedia.org/wiki/%C3%89vre%C5%AD does not exist
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603138&aid=1988290&group_…
Bugs item #1988771, was opened at 2008-06-09 12:17
Message generated for change (Comment added) made by wikipedian
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603138&aid=1988771&group_…
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: other
Group: None
>Status: Closed
>Resolution: Fixed
Priority: 6
Private: No
Submitted By: siebrand (siebrand)
Assigned to: Nobody/Anonymous (nobody)
Summary: Encoding issues with Esperanto
Initial Comment:
Output:
siebrand@v05-s341:~/nl$ python nowcommons.py -lang:eo -autonomous -replacealways -replaceloose
WARNING: Configuration variable 'language' is defined but unknown. Misspelled?
Checked for running processes. 1 processes currently running, including the current process.
Getting references to [[ablono:Nun en komunejo]]
NOTE: You have new messages on wikipedia:eo
>>> Dosiero:LieioMapoAmel.png <<<
Traceback (most recent call last):
File "nowcommons.py", line 231, in <module>
main()
File "nowcommons.py", line 227, in main
bot.run()
File "nowcommons.py", line 222, in run
wikipedia.output(u'%s' % e)
UnicodeEncodeError: 'ascii' codec can't encode character u'\u011d' in position 35: ordinal not in range(128)
This is about the page: http://eo.wikipedia.org/wiki/Dosiero:LiegxioMapoFleron.png
Esperanto uses LanguageEo.php with some iconv magic. (http://svn.wikimedia.org/viewvc/mediawiki/trunk/phase3/languages/classes/La…) .
----------------------------------------------------------------------
>Comment By: Daniel Herding (wikipedian)
Date: 2008-08-21 12:13
Message:
Logged In: YES
user_id=880694
Originator: NO
This is fixed now that I reverted this:
r5305 has changed Page::title() in wikipedia.py ("Page.title(): Esperanto
decoding, fix bug 1425206 and patch 1880140")
----------------------------------------------------------------------
Comment By: AndreasJS (andreasjs)
Date: 2008-06-23 23:44
Message:
Logged In: YES
user_id=1738850
Originator: NO
With this chante, bug [ 1879122 ] Double redirect fixes not coding special
characters in EO.WP reappears. A solution has to be found that fixes both
bugs.
----------------------------------------------------------------------
Comment By: siebrand (siebrand)
Date: 2008-06-12 10:39
Message:
Logged In: YES
user_id=1107255
Originator: YES
Ouch. No. It works if I remove those 4 lines, indeed. Now why are they in
there! :)
----------------------------------------------------------------------
Comment By: Francesco Cosoleto (cosoleto)
Date: 2008-06-12 10:23
Message:
Logged In: YES
user_id=181280
Originator: NO
I don't fixed the bug in SVN. Have you edited wikipedia.py in
Page::title() as described?
----------------------------------------------------------------------
Comment By: siebrand (siebrand)
Date: 2008-06-12 10:16
Message:
Logged In: YES
user_id=1107255
Originator: YES
It appears there is another issue related/behind this:
python nowcommons.py -autonomous -replaceloose -replacealways -lang:eo
WARNING: Configuration variable 'language' is defined but unknown.
Misspelled?
Checked for running processes. 1 processes currently running, including
the current process.
Getting references to [[ablono:Nun en komunejo]]
>>> Dosiero:LieioMapoAmel.png <<<
Image file URL for [[eo:Dosiero:LieioMapoAmel.png]] not found.
----------------------------------------------------------------------
Comment By: Francesco Cosoleto (cosoleto)
Date: 2008-06-12 08:38
Message:
Logged In: YES
user_id=181280
Originator: NO
>>> import wikipedia
>>> s=wikipedia.Site('eo')
>>> p=wikipedia.ImagePage(s, 'Dosiero:LiegxioMapoAmel.png')
>>> p.title()
u'Dosiero:Lie\u011dioMapoAmel.png'
>>> p.get()
[...]
wikipedia.NoPage: (wikipedia:eo,
u'[[eo:Dosiero:Lie\u011dioMapoAmel.png]]')
r5305 has changed Page::title() in wikipedia.py ("Page.title(): Esperanto
decoding, fix bug 1425206 and patch 1880140") [*],
If I remove:
if self.site().lang == 'eo':
title = decodeEsperantoX(title)
pages are found and the nowcommons.py seems full working. But maybe there
are problems with redirect.py.
[*]
[ 1880140 ] Decode Esperanto titles
http://sourceforge.net/tracker/index.php?func=detail&aid=1880140&group_id=9…
[ 1425206 ] Esperanto X-Convention and interwiki
http://sourceforge.net/tracker/index.php?func=detail&aid=1425206&group_id=9…
----------------------------------------------------------------------
Comment By: Marc-Etienne Vargenau (vargenau)
Date: 2008-06-11 10:25
Message:
Logged In: YES
user_id=1118700
Originator: NO
See also bug 1988290
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603138&aid=1988771&group_…