On Tuesday 06 February 2007 02:50:47 Simetrical wrote:
I suggest you familiarize yourself with sed. A while back I was trying to do something pretty similar to this. After a couple dozen languages I was ready to go insane and left off for a few months. While it took several hours to figure out how sed worked and run it, it was a hell of a lot quicker than it would've been for me to finish it all by hand.
Of course, your job may be a little trickier, since you'll probably need to run one sed for each specific link that needs replacing, but something along the lines of
sed -R s/('anontalkpagetext'.*)[[([^]]*IP[^]]*)]]/\1\2/ Messages*.php
should be a lot faster than manually replacing same in every file.
It is not that easy. It would work like that provided all messages would be similar to the english one. But they aren't. Many "translated" messages don't have any common content with their english counterpart (and sadly some are also looking to me as if that person never did check the own translation). In most cases the strings contain more links and different links than the english ones. So I have to find out manually by content (via Wikipedia interwiki links for example) where a link is pointing at (which I need to know in order to replace it.
Furthermore there are quite some other problems like article namespace links, interwiki/crosswiki links into Wikipedia, translated namespace names in messages (MessagesCy.php even did have a typo in local namespace name in a message; did the translator really check his work?).
So I fear this is pure handwork. At least if I want to avoid creating even a bigger mess.
That's why I call you all for your help. Together this can be solved ways faster and better and I am sure most of you speak at least one language different than English and there are more than 150 people subscribed to that list...
Arnomane