[Pywikipedia-l] [ pywikipediabot-Feature Requests-1993062 ] Use API module 'parse' for retrieving interwiki links

SourceForge.net noreply at sourceforge.net
Thu Nov 13 11:46:38 UTC 2008


Feature Requests item #1993062, was opened at 2008-06-13 16:47
Message generated for change (Comment added) made by multichill
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=603141&aid=1993062&group_id=93107

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: interwiki
Group: None
Status: Open
Priority: 7
Private: No
Submitted By: Melancholie (melancholie)
Assigned to: Nobody/Anonymous (nobody)
Summary: Use API module 'parse' for retrieving interwiki links

Initial Comment:
Currently pages are retrieved in a batch by using Special:Export.
Although being fast (as only one request is done), there is a huge data overhead with this method!

Why not use the API with its 'parse' module? Only interwiki links can be fetched with that, reducing traffic (overhead) a lot!

See:
http://de.wikipedia.org/w/api.php?action=parse&format=xml&page=Test&prop=langlinks

Outputs could be downloaded in parallel to virtualize a batch (faster).

----
At least make this method optional (config.py) for being able of reducing data traffic, if wanted. API is just more efficient.

----------------------------------------------------------------------

>Comment By: Multichill (multichill)
Date: 2008-11-13 12:46

Message:
We are working on a rewrite. The rewrite uses the api as much as possible.

----------------------------------------------------------------------

Comment By: Melancholie (melancholie)
Date: 2008-06-15 01:27

Message:
Logged In: YES 
user_id=2089773
Originator: YES

See http://meta.wikimedia.org/wiki/Interwiki_bot_access_protocol
concerning disambiguations and redirects:

http://de.wikipedia.org/w/api.php?action=parse&format=xml&text={{:Main_Page}}{{:Bot}}&prop=langlinks|templates

----------------------------------------------------------------------

Comment By: Melancholie (melancholie)
Date: 2008-06-14 16:38

Message:
Logged In: YES 
user_id=2089773
Originator: YES

Backwards compatibility?

That's no reason for not making software more efficient, where possible
;-)
That's also why I wrote something about "optional", too.
Because for current MediaWiki wikis there is a much more efficient way of
retrieving (only) certain contents (langlinks, categories), there should be
a method of using that advantage! Will reduce load (bot owner's and
server's)...

----------------------------------------------------------------------

Comment By: Bryan (btongminh)
Date: 2008-06-13 20:44

Message:
Logged In: YES 
user_id=1806226
Originator: NO

Backwards compatibility with non Wikimedia wikis?

----------------------------------------------------------------------

Comment By: Melancholie (melancholie)
Date: 2008-06-13 17:20

Message:
Logged In: YES 
user_id=2089773
Originator: YES

For not being misusable of confusing bots, the yet to be set up MediaWiki
message could contain [[foreigncode:{{CURRENTTIMESTAMP}}]] (cache issue?)

(sorry for spamming with this request ;-)

----------------------------------------------------------------------

Comment By: Melancholie (melancholie)
Date: 2008-06-13 17:08

Message:
Logged In: YES 
user_id=2089773
Originator: YES

Important note for getting pages' interwikis in a batch:
http://de.wikipedia.org/w/api.php?action=parse&text={{:Test}}{{:Bot}}{{:Haus}}&prop=langlinks

Either the bot could figure out what interwikis belong together then, or

maybe a marker could placed in between:
http://de.wikipedia.org/w/api.php?action=parse&text={{:Test}}{{MediaWiki:Iwmarker}}{{:Bot}}{{MediaWiki:Iwmarker}}{{:Haus}}&prop=langlinks

[[MediaWiki:Iwmarker]] (or 'Llmarker'?) would have to be set up by the
MediaWiki developers with [[en:/de:Abuse-save-mark]] as content (but this
is potentially misusable).

----------------------------------------------------------------------

Comment By: Melancholie (melancholie)
Date: 2008-06-13 16:51

Message:
Logged In: YES 
user_id=2089773
Originator: YES

Note: Maybe combine it with 'generator'.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=603141&aid=1993062&group_id=93107



More information about the Pywikipedia-l mailing list