Hoi, After analysing how to parse the text version of a GEMET list, I decided to also have a look at the html code. The reason was that the Russian, Bulgarian, Greek characters became unreadable. The HTML can be read as well as the codes are changed to be in the pre-UTF format (eg ыш etc). It can therefore be parsed, eventually I could upload it to wiktionary. The question is how do I convert it to UTF-8??
A question about the UTF-8 conversion, is it possible to have a bot convert the non UTF-8 stuff to UTF-8 on en:wiktionary ??
Thanks, GerardM