Feature Requests item #1993062, was opened at 2008-06-13 16:47 Message generated for change (Settings changed) made by melancholie You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=603141&aid=1993062...
Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: interwiki Group: None Status: Open
Priority: 7
Private: No Submitted By: Melancholie (melancholie) Assigned to: Nobody/Anonymous (nobody) Summary: Use API module parse for retrieving interwiki links
Initial Comment: Currently pages are retrieved in a batch by using Special:Export. Although being fast (as only one request is done), there is a huge data overhead with this method!
Why not use the API with its 'parse' module? Only interwiki links can be fetched with that, reducing traffic (overhead) a lot!
See: http://de.wikipedia.org/w/api.php?action=parse&format=xml&page=Test&...
Outputs could be downloaded in parallel to virtualize a batch (faster).
---- At least make this method optional (config.py) for being able of reducing data traffic, if wanted. API is just more efficient.
----------------------------------------------------------------------
You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=603141&aid=1993062...