[patches:#454] cosmetic_changes.py to remove bad wikilinks

Status: closed-rejected
Created: Thu Jun 17, 2010 09:38 AM UTC by BalaSundaraRaman L
Last Updated: Thu Jun 17, 2010 09:38 AM UTC
Owner: nobody

Translated articles created using http://translate.google.com/toolkit?hl=en suffer from one complex issue. It creates links to impossible pages in the target wiki. Let's take the example below:
( Excerpt from http://en.wikipedia.org/wiki/Corporate_governance )
A related but separate thread of discussions focuses on the impact of a corporate governance system in [[economic efficiency]], with a strong emphasis on shareholders' welfare.

This when translated to Tamil, for example, will have a single word for "in economic efficiency" and the tool wrongly links to that phrase. Since article title can't be of the form "in economic efficiency", it'll remain a red link forever. Since articles are littered with such red links, it's hard to read.

In view of the large-scale http://wikimania2010.wikimedia.org/wiki/Submissions/Google_translation project and the problems we faced ( http://wikimania2010.wikimedia.org/wiki/Submissions/A_Review_of_Google_Translation_project_in_Tamil_Wikipedia:_Role_of_voluntarism,_free_and_organically_evolved_community_in_ensuring_quality_of_Wikipedia ), I've developed a patch for cosmetic_changes.py which'll remove red links of the form [[some phrase]] leaving out cases where the label is different from the target. I've attached the patch as well. The changes by my bot running the modified code is at http://ta.wikipedia.org/wiki/Special:Contributions/SundarBot

If approved, I can give it to a dedicated bot operator with the translation team.


Sent from sourceforge.net because Pywikipedia-bugs@lists.wikimedia.org is subscribed to https://sourceforge.net/p/pywikipediabot/patches/

To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/pywikipediabot/admin/patches/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.