Bugs item #2825996, was opened at 2009-07-23 17:01 Message generated for change (Comment added) made by silvonen You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=603138&aid=2825996...
Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: None Group: None Status: Open Resolution: None Priority: 5 Private: No Submitted By: JAn (jandudik) Assigned to: Nobody/Anonymous (nobody) Summary: neverending loading
Initial Comment: I run bot interwiki.py -force -whenneeded -continue -lang:na In dump was list of articles, every line unique
But bot still loads new pages, (NOTE: The first unfinished subject is [[na:Uetersen]] NOTE: Number of pages queued is 99, trying to add 60 more. )
so I interrupted it and in dump there were many links more than 1 time. (see attachments) after [[Żagań]] loaded [[Tarnów]]
----------------------------------------------------------------------
Comment By: Mikko Silvonen (silvonen) Date: 2009-07-29 20:16
Message: Bots now get stuck also when you use the -start parameter near the end of the wiki. I do think this is a bug.
Here's an example on a small wiki:
python version.py
Pywikipedia [http] trunk/pywikipedia (r7101, 2009/07/27, 15:54:13) Python 2.5.4 (r254:67916, Jan 29 2009, 12:02:11) [MSC v.1310 32 bit (Intel)]
pagegenerators.py -lang:fiu-vro -start:Z
Ähijärv Äksi kihlkund Ängä oja Äniniidü lump Ärnu jõgi Ärq ei lääq Ärqseletüs Äühvoja Õdagu-Viro maakund Õdagumeresoomõ keeleq Õdagumeri Õdri järv Õigustiidüs Õuraasia Õuro Õuruupa Õuruupa Liit Üleherküs Ülembjärv Ülemine järv Ülemäne jõgi Ülene internetitunnus Üräski oja Ütidse osaga hulgaq Ütidse osalda hulgaq Ütine osa Ütiskuningriik Üts ummamuudu liin Ütsik täht Üvvärjärv Üü tulõk Üübjärv Dông Hoi Swinoujscie Swietochlowice Zemaidi kiil
When I press Ctrl+C, I get this traceback.
Traceback (most recent call last): File "C:\svn\pywikipedia\pagegenerators.py", line 1138, in <module> for page in gen: File "C:\svn\pywikipedia\pagegenerators.py", line 738, in DuplicateFilterPageGenerator for page in generator: File "C:\svn\pywikipedia\pagegenerators.py", line 263, in AllpagesPageGenerator for page in site.allpages(start = start, namespace = namespace, includeredirects = includeredirects): File "c:\svn\pywikipedia\wikipedia.py", line 5681, in allpages get_throttle() File "c:\svn\pywikipedia\wikipedia.py", line 3373, in __call__ time.sleep(waittime) KeyboardInterrupt
----------------------------------------------------------------------
Comment By: Nobody/Anonymous (nobody) Date: 2009-07-24 10:14
Message: It is bug. In older versions bot loads only rest of pages and then stop. Now there is *infinite loop*. - restore is useful for dump where is the last article present.
----------------------------------------------------------------------
Comment By: xqt (xqt) Date: 2009-07-24 08:24
Message: It's not a bug. Use -restore instead of the -continue option and it will stop.
----------------------------------------------------------------------
You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=603138&aid=2825996...
pywikipedia-bugs@lists.wikimedia.org