Feature Requests item #1429447, was opened at 2006-02-11 00:12 Message generated for change (Comment added) made by xqt You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=603141&aid=1429447...
Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: None Group: None
Status: Closed
Priority: 5 Private: No Submitted By: Nobody/Anonymous (nobody) Assigned to: Nobody/Anonymous (nobody) Summary: "safe" replace.py
Initial Comment: I may be approaching this problem incorrectly, but here goes.
[[en:wikt:User:Connel MacKenzie]]
On the English Wiktionary, I do considerable off-line analysis of the XML dumps. I'd like to be able to replace whole entries. But ONLY if they have not changed in the intervening time. (If any edits to the entry have been made since the last dump, skip that entry and reformat it next time.)
The English Wiktionary has pretty specific formats, (heading levels at ==two== ===three=== ====four==== etc., which headings are subordinate to others, ordering of headings, formatting of examples, formatting of etymology, formatting of inflection lines, valid parts of speech, valid "other" headers, translation header nesting, multiple etymology rearrangement, etc.) that do not lend themselves well to the current replacement robot code. (That is, how do you reorder whole sections? How do you correct section levels? etc.)
I can format the input file for this bot in any format needed. I assume a structure similar to the XML dump would be best, with <text-to-replace> and <replace-text-with> sections.
It's hard to believe this hasn't been done before, but looking at these tools I don't see a better way to approach this. Ideas anyone?
----------------------------------------------------------------------
Comment By: xqt (xqt)
Date: 2009-08-20 08:17
Message: out of date
----------------------------------------------------------------------
Comment By: siebrand (siebrand) Date: 2007-04-26 21:27
Message: Logged In: YES user_id=1107255 Originator: NO
Please let us know if this feature request is still applicable to the current code. If no response is given, the feature request will be denied and the issue will be closed one month from now. This message was added in an effort to reduce the number of open issues on this project. Siebrand
----------------------------------------------------------------------
Comment By: Rob W.W. Hooft (hooft) Date: 2006-12-26 13:51
Message: Logged In: YES user_id=47476 Originator: NO
Your description is fairly general. It looks like this is best handled by a special robot. Since you are the one that knows exactly what you need, you would be the best one to program this!
Actually this is quite close to the first ever robot I wrote for the nl: wikipedia, to reorganize and standardize the date pages, except that it did not use the dumps....
----------------------------------------------------------------------
You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=603141&aid=1429447...
pywikipedia-bugs@lists.wikimedia.org