Bugs item #2825996, was opened at 2009-07-23 17:01
Message generated for change (Comment added) made by silvonen
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603138&aid=282599…
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: None
Group: None
Status: Open
Resolution: None
Priority: 5
Private: No
Submitted By: JAn (jandudik)
Assigned to: Nobody/Anonymous (nobody)
Summary: neverending loading
Initial Comment:
I run bot
interwiki.py -force -whenneeded -continue -lang:na
In dump was list of articles, every line unique
But bot still loads new pages,
(NOTE: The first unfinished subject is [[na:Uetersen]]
NOTE: Number of pages queued is 99, trying to add 60 more. )
so I interrupted it and in dump there were many links more than 1 time. (see
attachments)
after [[Żagań]] loaded [[Tarnów]]
----------------------------------------------------------------------
Comment By: Mikko Silvonen (silvonen)
Date: 2009-07-29 20:16
Message:
Bots now get stuck also when you use the -start parameter near the end of
the wiki. I do think this is a bug.
Here's an example on a small wiki:
python version.py
Pywikipedia [http]
trunk/pywikipedia (r7101, 2009/07/27, 15:54:13)
Python 2.5.4 (r254:67916, Jan 29 2009, 12:02:11) [MSC v.1310 32 bit
(Intel)]
pagegenerators.py -lang:fiu-vro -start:Z
Ähijärv
Äksi kihlkund
Ängä oja
Äniniidü lump
Ärnu jõgi
Ärq ei lääq
Ärqseletüs
Äühvoja
Õdagu-Viro maakund
Õdagumeresoomõ keeleq
Õdagumeri
Õdri järv
Õigustiidüs
Õuraasia
Õuro
Õuruupa
Õuruupa Liit
Üleherküs
Ülembjärv
Ülemine järv
Ülemäne jõgi
Ülene internetitunnus
Üräski oja
Ütidse osaga hulgaq
Ütidse osalda hulgaq
Ütine osa
Ütiskuningriik
Üts ummamuudu liin
Ütsik täht
Üvvärjärv
Üü tulõk
Üübjärv
Dông Hoi
Swinoujscie
Swietochlowice
Zemaidi kiil
When I press Ctrl+C, I get this traceback.
Traceback (most recent call last):
File "C:\svn\pywikipedia\pagegenerators.py", line 1138, in <module>
for page in gen:
File "C:\svn\pywikipedia\pagegenerators.py", line 738, in
DuplicateFilterPageGenerator
for page in generator:
File "C:\svn\pywikipedia\pagegenerators.py", line 263, in
AllpagesPageGenerator
for page in site.allpages(start = start, namespace = namespace,
includeredirects = includeredirects):
File "c:\svn\pywikipedia\wikipedia.py", line 5681, in allpages
get_throttle()
File "c:\svn\pywikipedia\wikipedia.py", line 3373, in __call__
time.sleep(waittime)
KeyboardInterrupt
----------------------------------------------------------------------
Comment By: Nobody/Anonymous (nobody)
Date: 2009-07-24 10:14
Message:
It is bug. In older versions bot loads only rest of pages and then stop.
Now there is *infinite loop*.
- restore is useful for dump where is the last article present.
----------------------------------------------------------------------
Comment By: xqt (xqt)
Date: 2009-07-24 08:24
Message:
It's not a bug. Use -restore instead of the -continue option and it will
stop.
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603138&aid=282599…