That sounds to me as very useful indeed, Amir. I'm a newby to wikidata but as soon as I am back from my holiday (as of 5/5) I'd like to help. I know how to transliterate Armenian, Georgian, Ukrainian and Russian to internationally standardized (ISO-standards) forms (and to English and maybe French and German too). Problem could be that these standardized forms lead to diacritic characters (e.g. hacek on 'c' & 's'; Чайковский ---> Čajkovskij and Щедрин ---> Ščedrin). Is wikidata able to deal with these?
best regards, Eric van Balkum ('Eric de Muziekbibliothecaris') (former) Music Librarian, managing authority database at www.mcomb.nl _'Music was my first love, and it will be my last'_
Amir Ladsgroup schreef op 2015-04-22 14:48:
Hello, I started bot of auto-transliterating names of humans, initially with Persian and English (as a pair) since I know both and I can debug. After some modifications, In the last check, In more than several hundreds of edits I checked, I couldn't find any errors, I want to expand this bot for other languages but before, I need opinions of people who know rules of transliterating names in these languages, I tried to realize rules and this is my result but I need someone familiar to confirm *Chinese: Instead of space it uses "·" character (it's not dot) but order is the same. e.g Alan Turing is: "艾伦·图灵" [2] which "艾伦" means Alan and "图灵" means Turing *Japanese: it's the same but different separator: "・", e.g. "アラン・チューリング" [3] *Russian: The separator is space character but order is like "FamilyName, GivenName" e.g. "Тьюринг, Алан" [4] is "Turing, Alan". Handling names with more than two words would be pretty complicated (I skip them) *I checked Hebrew and Greek and both are simple languages like Persian, same order, space as separator.
If you can help me, it would make a great difference in number of labels in your language. Things you can help are: 1- Confirm or correct rules of these languages and add other rules if needed. 2- Suggest more languages. I thought about Sanskrit, Hindi, and Telugu but I don't know anyone who can check the rules, if you do, please help me. 3- For any language I will do an initial run just to test, if you can check edits of the bot (which is pretty easy, e.g. see this [5]) it would be awesome.
Thanks, Best
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l [1]
Links: ------ [1] https://lists.wikimedia.org/mailman/listinfo/wikidata-l [2] https://zh.wikipedia.org/wiki/%E8%89%BE%E4%BC%A6%C2%B7%E5%9B%BE%E7%81%B5 [3] https://ja.wikipedia.org/wiki/%E3%82%A2%E3%83%A9%E3%83%B3%E3%83%BB%E3%83%81%... [4] https://ru.wikipedia.org/wiki/%D0%A2%D1%8C%D1%8E%D1%80%D0%B8%D0%BD%D0%B3,_%D... [5] https://www.wikidata.org/w/index.php?title=Special:Contributions/Dexbot&...