Feature Requests item #1262584, was opened at 2005-08-17 22:10
Message generated for change (Settings changed) made by purodha
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603141&aid=1262584&group_…
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
>Category: Interface Improvements
Group: None
>Status: Closed
Priority: 5
Private: No
Submitted By: Nobody/Anonymous (nobody)
>Assigned to: Purodha B Blissenbach (purodha)
Summary: Special:Random
Initial Comment:
It would be nice to get a Page - constructor that reads
a random page with Special:Random - it's very useful
for sampling pages!
----------------------------------------------------------------------
>Comment By: Purodha B Blissenbach (purodha)
Date: 2009-02-13 13:01
Message:
Solved with Revision 6346
----------------------------------------------------------------------
Comment By: Leonardo Gregianin (leogregianin)
Date: 2007-07-04 11:55
Message:
Logged In: YES
user_id=1136737
Originator: NO
what they think of a randon page generator?
----------------------------------------------------------------------
Comment By: siebrand (siebrand)
Date: 2007-04-26 19:27
Message:
Logged In: YES
user_id=1107255
Originator: NO
Please let us know if this feature request is still applicable to the
current code. If no response is given, the feature request will be denied
and the issue will be closed one month from now. This message was added in
an effort to reduce the number of open issues on this project. Siebrand
----------------------------------------------------------------------
Comment By: Lenny (lenny222)
Date: 2005-12-17 13:37
Message:
Logged In: YES
user_id=1233097
That would be interesting for me as well.
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603141&aid=1262584&group_…
Feature Requests item #1911836, was opened at 2008-03-11 12:54
Message generated for change (Comment added) made by purodha
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603141&aid=1911836&group_…
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: interwiki
Group: None
>Status: Closed
Priority: 5
Private: No
Submitted By: Nobody/Anonymous (nobody)
Assigned to: Purodha B Blissenbach (purodha)
Summary: -end: option for interwiki.py
Initial Comment:
can you please add -end: option to complement -start: option in interwiki.py to work, say, on numeric pages or on a specific letter only?
----------------------------------------------------------------------
>Comment By: Purodha B Blissenbach (purodha)
Date: 2009-02-12 16:24
Message:
Done with Revision 6344.
----------------------------------------------------------------------
Comment By: Purodha B Blissenbach (purodha)
Date: 2009-02-12 13:10
Message:
There is already a similar option -number:#, which specifies that the robot
should process that amount of pages and then stop. This is only useful in
combination with -start. The default is not to stop.
I suggest to name the new option -until:page and make the robot stop when
it has passed this page in the alphabet.
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603141&aid=1911836&group_…
Revision: 6343
Author: purodha
Date: 2009-02-12 16:20:23 +0000 (Thu, 12 Feb 2009)
Log Message:
-----------
Few better messages and comments.
Modified Paths:
--------------
trunk/pywikipedia/pagegenerators.py
trunk/pywikipedia/redirect.py
Modified: trunk/pywikipedia/pagegenerators.py
===================================================================
--- trunk/pywikipedia/pagegenerators.py 2009-02-12 15:51:12 UTC (rev 6342)
+++ trunk/pywikipedia/pagegenerators.py 2009-02-12 16:20:23 UTC (rev 6343)
@@ -692,8 +692,8 @@
def RegexFilterPageGenerator(generator, regex):
"""
- Wraps around another generator. Yields only thos pages, which titles are
- positively matched to regex.
+ Wraps around another generator. Yields only those pages, the titles of
+ which are positively matched to regex.
"""
reg = re.compile(regex, re.I)
@@ -798,7 +798,7 @@
class GeneratorFactory:
"""
This factory is responsible for processing command line arguments
- that are used by many scripts and that determine on which pages
+ that are used by many scripts and that determine which pages
to work on.
"""
def __init__(self):
Modified: trunk/pywikipedia/redirect.py
===================================================================
--- trunk/pywikipedia/redirect.py 2009-02-12 15:51:12 UTC (rev 6342)
+++ trunk/pywikipedia/redirect.py 2009-02-12 16:20:23 UTC (rev 6343)
@@ -66,7 +66,7 @@
'ka': u'რობოტი: ორმაგი გადამისამართების გასწორება',
'ko': u'로봇: 이중 넘겨주기 수정',
'kk': u'Бот: Шынжырлы айдатуды түзетті',
- 'ksh':u'Bot: Dubbel Ömlëijdong fottjemaat',
+ 'ksh':u'Bot: [[special:doubleredirects|Dubbel Ömlëijdong]] fottjemaat',
'lb': u'Bot: Duebel Viruleedung gefléckt',
'lt': u'robotas: Taisomas dvigubas peradresavimas',
'nds':u'Bot: Dubbelte Wiederleiden rutmakt',
Feature Requests item #1911836, was opened at 2008-03-11 12:54
Message generated for change (Settings changed) made by purodha
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603141&aid=1911836&group_…
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
>Category: interwiki
Group: None
>Status: Pending
Priority: 5
Private: No
Submitted By: Nobody/Anonymous (nobody)
>Assigned to: Purodha B Blissenbach (purodha)
Summary: -end: option for interwiki.py
Initial Comment:
can you please add -end: option to complement -start: option in interwiki.py to work, say, on numeric pages or on a specific letter only?
----------------------------------------------------------------------
>Comment By: Purodha B Blissenbach (purodha)
Date: 2009-02-12 13:10
Message:
There is already a similar option -number:#, which specifies that the robot
should process that amount of pages and then stop. This is only useful in
combination with -start. The default is not to stop.
I suggest to name the new option -until:page and make the robot stop when
it has passed this page in the alphabet.
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603141&aid=1911836&group_…
Feature Requests item #1912001, was opened at 2008-03-11 16:29
Message generated for change (Comment added) made by purodha
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603141&aid=1912001&group_…
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: interwiki
Group: None
>Status: Pending
Priority: 5
Private: No
Submitted By: Marc-Etienne Vargenau (vargenau)
>Assigned to: Purodha B Blissenbach (purodha)
Summary: Add a new option to interwiki.py
Initial Comment:
Hello,
When working on years or birth categories for instance, I give a huge number of hints.
And I now these hint are right (when they exist).
If there is a wrong link somewhere (e.g. en:category:2000_deaths linking to fr:category:naissance_en_2000), I get a huge number of questions to which I always answer "1" (because the hint is proposed first).
So I propose to create a new option "-hintisright" (or something).
In that case, if there is a conflict between a hint and other page(s), the hint is selected automatically.
----------------------------------------------------------------------
>Comment By: Purodha B Blissenbach (purodha)
Date: 2009-02-12 12:50
Message:
I gave it a try with -hintsareright
If used, and a hint to an existing page is given, any interlanguage links
to the site of that hint are ignored, and a note "extra interwiki on hinted
site ignored" with the appropriate names is output to the console.
Hints can be given on the command line, in a file, etc. and interactively.
Before interactive hints are knows, i.e. concerning links in the starting
page, -hintsareright has no effect.
It also does not work on hints to sites where there are no pages found. If
you need that, it is an extra feature which, I believe, is a bit more
complicated to implement.
Not using -hintsareright should not make a difference to what we had
before, so I just submitted the change to svn as r 6341 for you to try it
out. Let me know if it fits your needs.
If so, this bug can be closed.
----------------------------------------------------------------------
Comment By: Marc-Etienne Vargenau (vargenau)
Date: 2009-02-03 13:36
Message:
Hello,
Thank you for your comment.
But I do not think the -localright parameter does what I need.
I do not want to modify the local page because I do not want to add links
that do not exist (yet).
I am 100% sure the hint "fr:category:décès_en_2000" (for
"en:category:2000_deaths") is right if it exists, but it might not exist
yet.
Since the code for -localright exists, maybe the code for -hintisright can
be easily implemented.
Best regards,
----------------------------------------------------------------------
Comment By: Purodha B Blissenbach (purodha)
Date: 2009-01-23 17:34
Message:
Hint: You can also solve the problem by inserting all the hints in the
starting page, and use the -localright parameter. If you are uncertain that
you got all, use -confirm and you will be able to interrupt the bot when it
comes accross a missing link which it tries to delete.
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603141&aid=1912001&group_…
Revision: 6341
Author: purodha
Date: 2009-02-12 12:45:09 +0000 (Thu, 12 Feb 2009)
Log Message:
-----------
Add -hintsareright commandline parameter that exclude interwiki links from being
followed and processed that go to sites for which hints on existing pages are
given via any of the ways the command line has to offer, or interactively.
This, hopefully, solves an issue raised by vargenau as tracker item 1912001, see:
https://sourceforge.net/tracker2/index.php?func=detail&aid=1912001&group_id…
Modified Paths:
--------------
trunk/pywikipedia/interwiki.py
Modified: trunk/pywikipedia/interwiki.py
===================================================================
--- trunk/pywikipedia/interwiki.py 2009-02-10 16:02:13 UTC (rev 6340)
+++ trunk/pywikipedia/interwiki.py 2009-02-12 12:45:09 UTC (rev 6341)
@@ -185,6 +185,15 @@
you are sure you have first gotten the interwiki on the
starting page exactly right).
(note: without ending colon)
+
+ -hintsareright do not follow interwiki links to sites for which hints
+ on existing pages are given. Note that, hints given
+ interactively, via the -askhint command line option,
+ are only effective once they have been entered, thus
+ interwiki links on the starting page are followed
+ regardess of hints given when prompted.
+ (Warning! Should be used with caution!)
+ (note: without ending colon)
-back only work on pages that have no backlink from any other
language; if a backlink is found, all work on the page
@@ -485,6 +494,7 @@
followinterwiki = True
minsubjects = config.interwiki_min_subjects
nobackonly = False
+ hintsareright = False
class Subject(object):
"""
@@ -511,7 +521,10 @@
# This is a list of all pages that are currently scheduled for
# download.
self.pending = []
- self.translate(hints)
+ if globalvar.hintsareright:
+ # This is a set of sites that we got hits to
+ self.hintedsites = set()
+ self.translate(hints, globalvar.hintsareright)
self.confirm = globalvar.confirm
self.problemfound = False
self.untranslated = None
@@ -558,7 +571,7 @@
return page
return None
- def translate(self, hints = None):
+ def translate(self, hints = None, keephintedsites = False):
"""Add the given translation hints to the todo list"""
if globalvar.same:
if hints:
@@ -573,6 +586,8 @@
for page in pages:
self.todo.append(page)
self.foundIn[page] = [None]
+ if keephintedsites:
+ self.hintedsites.add(page.site)
def openSites(self, allowdoubles = False):
"""Return a list of sites for all things we still need to do"""
@@ -781,6 +796,8 @@
= globalvar.hintnobracket)
for page in pages:
self.addIfNew(page, counter, None)
+ if globalvar.hintsareright:
+ self.hintedsites.add(page.site)
def workDone(self, counter):
"""
@@ -888,6 +905,10 @@
self.done.remove(page)
iw = ()
for linkedPage in iw:
+ if globalvar.hintsareright:
+ if linkedPage.site in self.hintedsites:
+ wikipedia.output(u"NOTE: %s: %s extra interwiki on hinted site ignored %s" % (self.originPage.aslink(), page.aslink(True), linkedPage.aslink(True)))
+ break
if not (self.isIgnored(linkedPage) or self.namespaceMismatch(page, linkedPage, counter) or self.wiktionaryMismatch(linkedPage)):
if globalvar.followinterwiki or page == self.originPage:
if self.addIfNew(linkedPage, counter, page):
@@ -1738,6 +1759,8 @@
globalvar.bracketonly = True
elif arg == '-localright':
globalvar.followinterwiki = False
+ elif arg == '-hintsareright':
+ globalvar.hintsareright = True
elif arg.startswith('-array:'):
globalvar.minsubjects = int(arg[7:])
elif arg.startswith('-query:'):
Bugs item #2484627, was opened at 2009-01-03 22:54
Message generated for change (Comment added) made by nobody
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603138&aid=2484627&group_…
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: interwiki
Group: None
Status: Open
Resolution: None
Priority: 5
Private: No
Submitted By: Nobody/Anonymous (nobody)
Assigned to: Nobody/Anonymous (nobody)
Summary: removing sr interwiki
Initial Comment:
There is problem with sr.wikipedia - latin characters in URL or in interwiki are automatically translated to cyrilic, but bot does not recognize it ant wants to remove this valid interwiki
see
http://cs.wikipedia.org/w/index.php?title=Nikdo_v%C3%A1s_nesm%C3%AD_b%C3%AD…
----------------------------------------------------------------------
Comment By: Nobody/Anonymous (nobody)
Date: 2009-02-12 08:28
Message:
I've found the same problem here, but it could solve it founding a cyrilic
link. Otherwise the sr-link would be deleted:
[[Dušan Simović]]: [[de:Dušan Simović]] gives new interwiki
[[no:Dušan Simović]]
[[Dušan Simović]]: [[de:Dušan Simović]] gives new interwiki
[[sr:Душан Симовић]]
[[Dušan Simović]]: [[de:Dušan Simović]] gives new interwiki
[[en:Dušan Simović]]
[[Dušan Simović]]: [[de:Dušan Simović]] gives new interwiki
[[fr:Dušan Simović]]
[[Dušan Simović]]: [[de:Dušan Simović]] gives new interwiki
[[pl:Dušan Simović]]
Getting 1 pages from wikipedia:no...
NOTE: [[Dušan Simović]]: [[no:Dušan Simović]] gives duplicate
interwiki on same site [[de:Dušan Simović]]
Getting 1 pages from wikipedia:de...
Getting 1 pages from wikipedia:sr...
Getting 1 pages from wikipedia:en...
NOTE: [[Dušan Simović]]: [[en:Dušan Simović]] gives duplicate
interwiki on same site [[sr:Dušan Simović]]
Getting 1 pages from wikipedia:sr...
NOTE: [[sr:Dušan Simović]] does not exist
----------------------------------------------------------------------
Comment By: Nobody/Anonymous (nobody)
Date: 2009-01-27 06:25
Message:
See also this edit:
http://cs.wikipedia.org/w/index.php?title=Vidovdansk%C3%A1_%C3%BAstava&diff…
interwiki.py types:
Getting 1 pages from wikipedia:sr...
NOTE: [[sr:Vidovdanski ustav]] does not exist
=====Post-processing [[Videovdanská ústava]]=====
WARNING: [[sr:Vidovdanski ustav]] is either deleted or has mismatching
disambiguation state
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=603138&aid=2484627&group_…