Bugs item #3607815, was opened at 2013-03-12 14:11 Message generated for change (Comment added) made by binbot You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=603138&aid=3607815...
Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: None Group: None Status: Open Resolution: None Priority: 5 Private: No Submitted By: https://www.google.com/accounts () Assigned to: Nobody/Anonymous (nobody) Summary: Optimize multiple replace.py instances
Initial Comment: Since the biggest part of the time and I/O resources, in a replace.py based on a xml file, is reading and uncompressing the xml file itself. When this action is done, execute 10 or 50 regex is very similar. I think that the replace.py should be able to apply more -fix in the same time. Also, if the -save parameter is active, each -fix should go in a different file.
I think that passing an array to -fix and -save to be able to do this. This implementation is only an idea, obviusly ;)
python version.py Pywikipedia trunk/pywikipedia/ (r11200, 2013/03/09, 11:43:52, OUTDATED) Python 2.7.3 (default, Sep 26 2012, 21:51:14) [GCC 4.7.2] config-settings: use_api = True use_api_login = True unicode test: ok
----------------------------------------------------------------------
Comment By: BinĂ¡ris (binbot)
Date: 2013-03-15 00:56
Message: The idea is good, but much more command-line parameters are involved, practically each except -xml and -xmlstart. Do you have any measurements on time consumption?
----------------------------------------------------------------------
You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=603138&aid=3607815...
pywikipedia-bugs@lists.wikimedia.org