[Foundation-l] Please delete mo. wikipedia

Marcus Buck me at marcusbuck.org
Tue Oct 5 12:33:54 UTC 2010


  Have a look at <http://www.marcusbuck.org/ro/>. It's a quick demo of 
ro.wp content converted to Cyrillic. It's just a tiny extract of about 
50 ro.wp articles (I wanted to import the full dump, but I have a 
limited bandwidth connection and the dump upload failed at 90% of the 
1GB file). The conversion isn't perfect yet, some special cases are 
missing, but nothing that cannot be fixed relatively easily. It took me 
about 30 min to get this result.

The demo doesn't support Commons images, interwiki links, templates etc. 
but all this would work on a real Wikimedia wiki.

Things that won't work without syntactical support in the ro.wp source 
(and ro.wp won't agree to put -{...}- syntactical markers into their 
articles):
- foreign names will be converted even when inappropiate
- Roman numbers will be converted (a conversion exception could be added 
for Roman numbers, but that can also affect strings that just look like 
Roman numbers)

Apart from the mentioned issues most of the converted articles look okay 
to me. I wish to emphasize the word "look". I don't speak a word 
Romanian and even less so when it's written in Cyrillic.

So if Wikimedia wanted to support a read-only Romanian in Cyrillic wiki 
at ro-cyrl.wikipedia.org it could easily go live in one day. From a 
technical point of view it's not hard.

Marcus Buck
User:Slomox



More information about the foundation-l mailing list