Dear Wiktionary Community,
We have been working on a triangulation method to expand existing dictionaries in many languages. We were able to parse translations from 40 Wiktionary editions and using these as seed dictionaries (appr. 3.6M translation pairs), we created an additional 16M pairs in 50 languages. It is possible to extend the number of languages.
While the automatically generated dictionary is not a 100% correct, with correct filtering, 90%+ can be reached.
One version of the parsed Wiktionaries and the generated pairs can be found here: https://www.dropbox.com/sh/r95tdr52o5rzzrw/a54Y66YGOJ We used dumps from August to create these. The software used to build dictionaries: https://github.com/juditacs/wikt2dict
Do you think there is a way to contribute this dictionary back to Wiktionary?
Best, Judit Ács