Patches item #3017517, was opened at 2010-06-17 13:38 Message generated for change (Comment added) made by lsundar You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=603140&aid=3017517...
Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: None Group: None Status: Open Resolution: None Priority: 5 Private: No Submitted By: BalaSundaraRaman L (lsundar) Assigned to: Nobody/Anonymous (nobody) Summary: cosmetic_changes.py to remove bad wikilinks
Initial Comment: Translated articles created using http://translate.google.com/toolkit?hl=en suffer from one complex issue. It creates links to impossible pages in the target wiki. Let's take the example below: ( Excerpt from http://en.wikipedia.org/wiki/Corporate_governance ) A related but separate thread of discussions focuses on the impact of a corporate governance system in [[economic efficiency]], with a strong emphasis on shareholders' welfare.
This when translated to Tamil, for example, will have a single word for "in economic efficiency" and the tool wrongly links to that phrase. Since article title can't be of the form "in economic efficiency", it'll remain a red link forever. Since articles are littered with such red links, it's hard to read.
In view of the large-scale http://wikimania2010.wikimedia.org/wiki/Submissions/Google_translation project and the problems we faced ( http://wikimania2010.wikimedia.org/wiki/Submissions/A_Review_of_Google_Trans... ), I've developed a patch for cosmetic_changes.py which'll remove red links of the form [[some phrase]] leaving out cases where the label is different from the target. I've attached the patch as well. The changes by my bot running the modified code is at http://ta.wikipedia.org/wiki/Special:Contributions/SundarBot
If approved, I can give it to a dedicated bot operator with the translation team.
----------------------------------------------------------------------
Comment By: BalaSundaraRaman L (lsundar)
Date: 2010-06-18 10:00
Message: The changes will be visible when run in the following manner: python cosmetic_changes.py -fewerlinks -keepblue -file:listofarticles.txt
For a diff of its changes, please check http://is.gd/cTJWr
----------------------------------------------------------------------
You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=603140&aid=3017517...
pywikipedia-bugs@lists.wikimedia.org