I have posted to this mailing list in January with a library that I
wanted to contribute to the codebase. This is part of an effort on my
side to refactor code that accumulated over various bot-operator tasks
and make it available to the community. The main part of the code
deals with spellchecking using hunspell
(http://hunspell.sourceforge.net/) instead of the list-based approach
currently used in spellcheck.py. The second part is an interactive
robot to do revision control (Sichten) in the german wikipedia. There
are some api functions that use the "undo" functions of the
action=edit command and an api function that uses the action=review
So I wanted to ask whether somebody had time to have a look at the
code I submitted here
(I uploaded a new file "(moved testSamples)" please us this to test,
the other one seems corrupt and cannot be deleted any more as well).
Thus, is there a code-review process that I can undergo or what do you
suggest is the best way to get the code into trunk (if at all?). Would
it be easier if I talked directly to one of you?
What are the criteria to get SVN commit access -- I was just wondering
what the general rules are.
-----BEGIN PGP SIGNED MESSAGE-----
I got this error (from trunk):
Traceback (most recent call last):
File "../pywikipedia/subster.py", line 251, in subContent
(substed_content, tags) = self.subTemplate(substed_content, item)
File "../pywikipedia/subster.py", line 345, in subTemplate
external_buffer = http.request(self.site, param['url'],
no_hostname = True)
File "../pywikipedia/pywikibot/comms/http.py", line 125, in request
UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in position
42: ordinal not in range(128)
and was able to trace the code line 125 (http.py):
back to "revision 7591 by alexsh, Wed Nov 4 13:22:17 2009 UTC"
line 5508, please confer . Now this code snipplet is quite old and
I doubt someone is able to remember any details... ;)
...BUT what about the error I got? Does anybody experience the same?
I think it is related to network timeouts (e.g. servers not reachable)
and thus hard to reproduce...
I was using Python 2.7.3:
Python 2.7.3 (default, May 29 2012, 14:54:22)
[GCC 4.6.3 20120306 (Red Hat 4.6.3-2)] on linux2
Greetings and thanks for any hints!
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.12 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/
-----END PGP SIGNATURE-----
I was working on the rewrite branch last week (implementing xmlreader, additional retrieving of special pages, hooking cosmetic_changes with new functionality and some other stuff)
At least I am a bit frustrated: I guess nobody uses it as I found some blocker in the code and mainly it runs very very slow. The time consumption is more than 10 times of the trunk release. Maybe the cacherequest is still unfinished but I cannot follow the site request since -verbose gives no additional output about that. Is there another switch?
As I am not able to keep both releases on the same level an merge it vice versa, I would like to ask the next steps and how to proceed in that manner.
btw I created a matrix for python version / pwb release at http://www.mediawiki.org/wiki/Manual:Pywikipediabot/Overview. Because of the unicode bug (which is triggered by py 2.5/2.6/2.71) it is strictly recommended to use python 2.7.2 or 2.7.2 for the trunk. I guess that bug does not affect the rewrite. But we need 2.6 for some method decorators.
And at last: thanks Merlijn for the pwb.py shortener.
I'm seek interlanguage link edits that were undertaken by users who are no bots, these users for
interlanguage link edits would usually be active before bots come in to do their work
maybe it is a good path to look at the summaries of the first interwiki bot edits per entry and take what
happened before that point in time?
if yes, it should be great if anyone on this list could run such a search for me,
am ready to provide details about what is needed,
thanks & cheers,
I generated a toolserver family using generate_family_files.py and then set
the protocol to https (http doesnt work) yet it is not possible to log in.
can someone else talk a look and figure out what Im doing wrong?
If someone feels like participating:
---------- Forwarded message ----------
From: Sumana Harihareswara <sumanah(a)wikimedia.org>
Subject: PyCon Finland
PyCon Finland (Oct 22-23) is looking for talks. Deadline 14 Aug 2012.
I suggest that you submit a talk about pywikipedia, explaining to people
how crucial it is to the health of a resource they depend on every day!
https://lwn.net/Articles/501799/ has more info.
Please forward to the pywikipedia-l list. Thanks.
Engineering Community Manager
> Revision: 9909
> Author: drtrigon
> Date: 2012-02-17 23:09:15 +0000 (Fri, 17 Feb 2012)
> Log Message:
> follow-up to r9905 '-debug' replaced with new '-simulate' that allows
> switching off write access
Trigon, this is great, thank you! I have just tried it and works fine and
helps in testing a lot.
Several scripts use their own mechanism for that with "-dry" or "-test" or
something; we should revise these if they are still neccessary or may be
removed. (As far as I understand -simulate works with API query. What
happens if somebody uses the bot without API?)