There are strange people who make such links (kindof urlencoded?):
[[Második világháború#Partrasz.C3.A1ll.C3.A1s Szic.C3.ADli.C3.A1ban
.28Huskey hadm.C5.B1velet.29|Huskey hadműveletben]]
So the section title must have been copied from the URL.
Do we have a ready tool to fix these?
>From one of my assignments as a bot operator I have some code which
does template parsing and general text parsing (e.g. Image/File tags).
It is not using regex and thus able to correctly parse nested
templates and other such nasty things. I have written those as library
classes and written tests for them which cover almost all of the code.
I would now really like to contribute that code back to the community.
Would you be interested in adding this code to the pywikibot
framework? If yes, can I send the code to someone for code review or
how do you usually operate?
PS: wiki userpage is http://en.wikipedia.org/wiki/User:Hannes_R%C3%B6st
I have posted to this mailing list in January with a library that I
wanted to contribute to the codebase. This is part of an effort on my
side to refactor code that accumulated over various bot-operator tasks
and make it available to the community. The main part of the code
deals with spellchecking using hunspell
(http://hunspell.sourceforge.net/) instead of the list-based approach
currently used in spellcheck.py. The second part is an interactive
robot to do revision control (Sichten) in the german wikipedia. There
are some api functions that use the "undo" functions of the
action=edit command and an api function that uses the action=review
So I wanted to ask whether somebody had time to have a look at the
code I submitted here
(I uploaded a new file "(moved testSamples)" please us this to test,
the other one seems corrupt and cannot be deleted any more as well).
Thus, is there a code-review process that I can undergo or what do you
suggest is the best way to get the code into trunk (if at all?). Would
it be easier if I talked directly to one of you?
What are the criteria to get SVN commit access -- I was just wondering
what the general rules are.
-----BEGIN PGP SIGNED MESSAGE-----
I got this error (from trunk):
Traceback (most recent call last):
File "../pywikipedia/subster.py", line 251, in subContent
(substed_content, tags) = self.subTemplate(substed_content, item)
File "../pywikipedia/subster.py", line 345, in subTemplate
external_buffer = http.request(self.site, param['url'],
no_hostname = True)
File "../pywikipedia/pywikibot/comms/http.py", line 125, in request
UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in position
42: ordinal not in range(128)
and was able to trace the code line 125 (http.py):
back to "revision 7591 by alexsh, Wed Nov 4 13:22:17 2009 UTC"
line 5508, please confer . Now this code snipplet is quite old and
I doubt someone is able to remember any details... ;)
...BUT what about the error I got? Does anybody experience the same?
I think it is related to network timeouts (e.g. servers not reachable)
and thus hard to reproduce...
I was using Python 2.7.3:
Python 2.7.3 (default, May 29 2012, 14:54:22)
[GCC 4.6.3 20120306 (Red Hat 4.6.3-2)] on linux2
Greetings and thanks for any hints!
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.12 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/
-----END PGP SIGNATURE-----
I was working on the rewrite branch last week (implementing xmlreader, additional retrieving of special pages, hooking cosmetic_changes with new functionality and some other stuff)
At least I am a bit frustrated: I guess nobody uses it as I found some blocker in the code and mainly it runs very very slow. The time consumption is more than 10 times of the trunk release. Maybe the cacherequest is still unfinished but I cannot follow the site request since -verbose gives no additional output about that. Is there another switch?
As I am not able to keep both releases on the same level an merge it vice versa, I would like to ask the next steps and how to proceed in that manner.
btw I created a matrix for python version / pwb release at http://www.mediawiki.org/wiki/Manual:Pywikipediabot/Overview. Because of the unicode bug (which is triggered by py 2.5/2.6/2.71) it is strictly recommended to use python 2.7.2 or 2.7.2 for the trunk. I guess that bug does not affect the rewrite. But we need 2.6 for some method decorators.
And at last: thanks Merlijn for the pwb.py shortener.
I'm seek interlanguage link edits that were undertaken by users who are no bots, these users for
interlanguage link edits would usually be active before bots come in to do their work
maybe it is a good path to look at the summaries of the first interwiki bot edits per entry and take what
happened before that point in time?
if yes, it should be great if anyone on this list could run such a search for me,
am ready to provide details about what is needed,
thanks & cheers,
I generated a toolserver family using generate_family_files.py and then set
the protocol to https (http doesnt work) yet it is not possible to log in.
can someone else talk a look and figure out what Im doing wrong?