Hi,
I added the support for German Wiktionary, it is available in the newest version. There is a quick test script that should get you 300k+ translations from the German Wiktionary in less than 15 minutes.
The dictionaries in 50 languages built using wikt2dict and other resources (parallel and comparable corpora) are available here: http://hlt.sztaki.hu/resources/index.html Please let me know if you find parsing errors.
I understand that DBPedia Wiktionary does a lot more than wikt2dict and I do not plan to compete with that. However, adding 35+ Wiktionaries would have been near impossible for me. This a quick (and dirty) way to extract the translations.
Cheers, Judit
2013/7/12 Judit, Ács acs.judit@sztaki.hu
Hi All,
I created a tool to extract translations from different editions of Wiktionary. Right now it supports 39 different Wiktionaries. It only extracts translations and ignores the rest.
Supported Wiktionaries: Azerbaijani, Bulgarian, Catalan, Czech, Danish, Greek, English, Esperanto, Spanish, Estonian, Basque, Finnish, French, Galician, Hebrew, Croatian, Hungarian, Indonesian, Italian, Georgian, Latin, Lithuanian, Malagasy, Dutch, Norwegian, Occitan, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Serbian, Swedish, Swahili, Turkish, Ukrainian, Vietnamese and Chinese.
Adding a new Wiktionary is done via a configuration file.
Right now the beta version is available for download at: https://github.com/juditacs/wikt2dict
Documentation is in progress, until then the README should be enough to get started.
Please test it and send me your feedback and bug reports.
Thanks, Judit Ács