Hello I was used to end daily work of interwiki bot by pressing ctrl+c.
This shortcut ended bot and created interwikidump-wikipedia-xx.txt file,
where were stored last unfinished pages, so i was able to continue next
day. Message was Interwiki dump created.
But several days ago after some updates this shortcut doesn't work as
usually, but shows message Keyboard interrupt and falls down without
creating dump.
How to make dump now?
JAn
--
Using Opera's revolutionary e-mail client: http://www.opera.com/mail/
An one more request: please update:
disambiguations in cs:
{{rozcestník - příjmení}} instead of {{rozcestník - Příjmení}}
{{rozcestník - kostel}}
{{rozcestník - sakrální stavba}}
articles about nubers both in cs: and sk: are in format
1 (číslo)
100 (číslo)
etc.
JAn
--
Using Opera's revolutionary e-mail client: http://www.opera.com/mail/
Hello, I found possible bug:
Getting 2 pages from wikipedia:ro...
[[ro:Wikipedia:Articole fructuoase]] is a redirect
[[ro:Wikipedia:Articole de calitate]] is a redirect
NOTE: [[ro:Wikipedia:Articole fructuoase]] is redirect to [[ro:Articole de
calitate]]
NOTE: Ignoring link from page [[cs:Wikipedie:Nejlepší články]] in
namespace 4 to page [[ro:Articole de calitate]] in namespace 0 because
page [[ro:Wikipedia:Articole fructuoase]] in the correct namespace has
already been found.
NOTE: [[ro:Wikipedia:Articole de calitate]] is redirect to [[ro:Articole
de calitate]]
NOTE: Ignoring link from page [[cs:Wikipedie:Nejlepší články]] in
namespace 4 to page [[ro:Articole de calitate]] in namespace 0 because
page [[ro:Wikipedia:Articole fructuoase]] in the correct namespace has
already been found.
The result is, taht correct interwiki to another namespace was deleted:
ERROR: Found incorrect link to ro in [[no:Wikipedia:Utmerkede artikler]]
JAn
--
Using Opera's revolutionary e-mail client: http://www.opera.com/mail/
When I use -hint:all option, bot tries to find page in all old languages,
but not in "new" onec, which are from autumn. Between these langages are
e.g. ru-sib, hsb, cu...
I think, somewhere is incomplete language list.
JAn
--
Using Opera's revolutionary e-mail client: http://www.opera.com/mail/
A new issue of the Python Wikipediabot framework has been released, and can
be downloaded at http://sourceforge.net/project/showfiles.php?group_id=93107
. Everyone who does not update from CVS is advised to download this. Note
that this release does not include the wordlists, those who want to use
spellcheck.py should download a wordlist from
http://pywikipediabot.cvs.sourceforge.net/pywikipediabot/pywikipedia/spelli…
and Dutch wordlists are available, the French one is only small).
Note that one is advised to have the most recent version of Python (2.5) to
use with the bot.
The main changes since the previous release are:
new bots:
* clean_sandbox.py: empties the Sandbox (except for what should not be
removed)
* disambredir.py: goes through disambiguation pages to (ask whether to)
change redirect links on these pages
* inline_images.py: Searches and removes images that are linked inline
* isbn.py: Converts ISBN-10 to ISBN-13
category.py:
* Mode 'listify' added: gets a list of the pages that are in a specific
category.
* category.py remove can have customized edit summaries using the -delsum
option
commons_link.py:
* Can also be used to find categories rather than galleries
* Puts its template above categories and interwiki instead of all at the
bottom
copyright.py:
* Various fixes and improvements
delete.py:
* More options to decide which pages to delete: -ref (pages linking to a
specific page), -page (a single page), -file (a file with the pagenames),
-images (all images on a specific page)
featured.py:
* Now puts the template before categories and interwikis instead of after
the interwiki link
* Gives its command line options when -help is used
image.py:
* New option -loose: replaces the image new everywhere where it occurs. This
means that no occurences of the image are skipped, but also that there's
more risk of making errors.
interwiki.py:
* Quicker removal of links to different namespaces (they are removed when
there is a correct link or the option '-autonomous' is used)
* When the -ignore option is used, the page is also ignored if found as a
redirect
* It is now possible to combine -cat and -start to do a part of a category's
pages
* The -whenneeded option becomes slightly stricter (it does not change links
if the only problem was that there were links to be removed)
* Option -link renamed -links
movepages.py:
* Option -new to work on the new pages
* Options -from and -to to specify on the command line from and to which
title to move the page
pagefromfile.py:
* Option -notitle to not include the line containing the title in the page
to be created
replace.py:
* New options -allowoverlap and -recursive to change overlapping and
recursive occurences
* New option -nocase to make all regexes case insensitive
* New option -summary for custom edit summaries
* Does not crash on meeting the spam filter, but tells the problem and goes
on with the next page
* replace.py -fix:interproject has been deprecated
selflink.py:
* New option -xml to work from an XML dump
solve_disambiguation.py:
* The -primary option now works when the page is a disambiguation page (for
cases where [[X]] is a redirect to [[X (subject)]] with a disambiguation
page at [[X (disambiguation)]])
* More ignored pages for nl:
template.py:
* Now works on templates that have brackets in their name
touch.py:
* Does not do cosmetic changes, even when the bot normally does do so
upload.py:
* Is better (though possibly not yet perfect) in checking whether the upload
succeeded
weblinkchecker.py:
* Fakes being Firefox because some websites block unknown user agents
for programmers:
* Page objects now have a protect() method
* new method setUserAgent(). Uses the same user agent always.
* pagegenerators.py has a class CommandLineGenerator to automagically add
generator options to a bot
* Page.getReferences() gets a parameter
general:
* if a certain Mediawiki message is not found in the list the bot has, it
will first try reloading the messages before spawning an error
* many bugfixes
* large code refactoring in interwiki.py, pagegenerators.py
* namespace names updated
* more localization of edit summaries
--
Andre Engels, andreengels(a)gmail.com
ICQ: 6260644 -- Skype: a_engels
There is something wrong, when I give hint as ksh:User:xxx, Userpage is
not found. The same is with interwiki - there is Meedmacher:xxx, but
should be Meetmacher:xxx; meedmacher doesn't works
cs:User:JAn Dudík
--
Using Opera's revolutionary e-mail client: http://www.opera.com/mail/
Hello all,
I am running a bot on Tamil wikipedia (ta.wikipedia.org). All gets are
giving me a wikipedia.NoPage error and are not picking up any text.
For example,
self.stattext = ""
self.statpage = wikipedia.Page(site=self.site,
title="User:Ganeshbot/Created2")
try:
self.stattext = self.statpage.get()
except:
print "Unexpected error:", sys.exc_info()[0]
print "stat page does not exist"
The page, User:Ganeshbot/Created2 does exist. But still Python is going into
exception and returning Nopage.
I am using Windows XP sp2, IE7, Python 2.5. Please help.
Regards, Ganesh
There is a new version available for download of the Python Wikipediabot
framework. I would advise most people using the framework on Wikimedia wikis
(who does not get it through CVS) to update, because the old version will
not be able to read the allpages special page correctly. Also, those who use
it on a Windows system are advised to download version 2.5 of Python (if
they are still using an older version), because colours are now working for
Windows, but only under Python 2.5. Downloading 2.5 is also advised for
anyone (on any system) who wants to use xml dumps with the bot.
To save space, the current distribution does not include the wordlists,
because they are very large files. Those who want to use
spellcheck.pyshould download them from
http://pywikipediabot.cvs.sourceforge.net/pywikipediabot/pywikipedia/spelli….
Wordlists exist for Dutch and English; wordlists in other
languages are still welcome.
Recent changes in the bot (from the last month, only the most important are
given:
==new bots==
* commonslink.py. This searches for a page with the same name in commons,
and links to it. Currently only working on English and Portuguese Wikipedias
* delete.py. Deletes a group of pages.
==important bugfixes==
* Because of some changes in the wikimedia code, the bot wasn't able to read
[[Special:Allpages]] any more.
* Bot messed up categories or interwikis within <noinclude> tags (the > of
the noinclude was moved to after the categories/interwikis)
==user interface==
* More bots use colour highlighting
* Colours can now be used on Windows systems as well. This requires Python
2.5.
* Transliterated text is now coloured yellow; the stars to denote that
something has been transliterated are only used when using a Windows version
that does not support colours.
==interwiki==
* New option -subcat: When pages are found using -cat, pages from
subcategories are included as well.
==weblinkchecker==
* Does not hang on getting hit by the spamfilter
* Gives less false positives
==various==
* cosmetic_changes.py removes misplaced and extraneous spaces in
wiki-linking syntax
==aids for bot writers==
* in pagegenerators there is now PrefixingPageGenerator, yielding all pages
of which the title starts with a certain text.
--
Andre Engels, andreengels(a)gmail.com
ICQ: 6260644 -- Skype: a_engels
Users who use Pywikipediabot under Windows are advised to download the new
version of Python, version 2.5. Reason for this is that Daniel recently made
the usage of colours in the interface work under windows as well, but this
only works with Python 2.5, not with older versions.
P.S.: I also found that I find his default colour worse than Windows'. If
you agree with me, you can set it back by putting:
defaultcolor = 7
in your user-config.py
--
Andre Engels, andreengels(a)gmail.com
ICQ: 6260644 -- Skype: a_engels