Getting 60 pages from wiktionary:fr...
[*1 CPU is at 100%]
[*WiFi manually disconnected: network down]
[*Around 1 minute elapses till the next console log]
No changes needed on page [[*****]]
No changes needed on page [[*****]]
No changes needed on page [[*****]]
Getting 15 pages from wiktionary:ro...
ERROR: URLError: [Errno 11001] getaddrinfo failed>
WARNING: Could not open 'http://ro.wiktionary.org/w/index.php?title=Special:Expo
rt&useskin=monobook'. Maybe the server or your connection is down. Retrying in 1 minutes...
[*1 CPU occupation drops to 0% while waiting for retry]

As it is visible in this log, it's the processing of pages that is taking too long and occupying too much CPU. Because I disconnected the WiFi, the bot was not in the process of page retrieval anymore. So what makes it take so long to process whatever it has to process after those 60 pages are retrieved? BTW, I had set -array:50 to make sure not too many pages are being processed.
Also, this only happens after running for a couple of hours. In the beginning of the bot run, these symptoms do not exist. Memory occupation seems normal at all times.


[bugs:#1646] interwiki.py CPU occupation too high (hogging)

Status: open
Created: Wed Jul 24, 2013 02:20 PM UTC by André Malafaya Baptista
Last Updated: Fri Aug 16, 2013 11:26 AM UTC
Owner: nobody

Since some time ago (I think less than 2 months), running interwiki.py hogs one CPU after running for some time. As I have four CPUs, I get a constant 25% CPU occupation by python.exe. The console output gets also sluggish.

Example:

interwiki.py -lang:nl -family:wiktionary -auto -async -cleanup -pt:1 -start:Category:!

Pywikipedia trunk/pywikipedia/ (r11780, 2013/07/20, 11:39:29, ok)
Python 2.7.2 (default, Jun 12 2011, 14:24:46) [MSC v.1500 64 bit (AMD64)]
config-settings:
use_api = True
use_api_login = True


Sent from sourceforge.net because Pywikipedia-bugs@lists.wikimedia.org is subscribed to https://sourceforge.net/p/pywikipediabot/bugs/

To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/pywikipediabot/admin/bugs/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.