First, I would like to congrat for all your marvellous work. I need some
help with the wiktionary extraction framework. I tried to run the
wiktionary extraction framework for the greek language as per your
instructions here http://wiki.dbpedia.org/Wiktionary, updated java jdk to
1.7, etc. I have changed the configuration in config.preperties to language
el, config.xml to language el, and made a file config-el.xml with all the
appropriate mappings, based on the config-en.xml. I downloaded the dump
is the newest elwiktionary dump. I put the file unziped in the folder
wiktionaryDumpa as metioned in the instructions, and after prompt during
the compilation of the code, i moved it to
wiktionaryDump/elwitkionary/20121029/ . The program seems to run, but after
5(!!!) days, even if during mvn scala:run showed the triples on the screen
it didn't outputed any file!!! So where is the triples?Or, is there any
spesific configuration i mistook and should change? I also tried the binary
provided on the project homepage but nothing seems to be happen even for
the default configuration and test dump.
Thank you in advance. Looking forward for your reply.
M. Sc. Web Science
Faculty of Mathematics
Aristotle University of Thessaloniki
Some solid CNI work.
---------- Forwarded message ----------
From: "Chris Leonard" <cjlhomeaddress(a)gmail.com>
Date: Oct 19, 2012 7:30 PM
Subject: [support-gang] Help needed - Amazonian Peru
To: "Community Support Volunteers -- who help respond to "help AT laptop.org<
Dear Support Gang,
I need some "mechanical turk" style help from the Support Gang to
create a usable dictionary resource for the children of Amazonian Peru
who speak Asháninka (lang-cni). I have a large (435 page) image
document that I need to process page-by-page into digitized text
All of the files and the necessary steps are laid out on this wiki
page. If you have an Internet connection and a mouse, you can help
out with this important task.
Please go to this page and read the step-by-step directions.
It really takes no particular computer skills to help out with this,
it is just repetitive and a bit tedious. The other limiting factor is
that the preferred OCR web-site will only process 15 pages per user
per hour (for free), so having as many users as possible working on
If twenty people stepped up, we could have this done tonight and I
could move on to the next steps (copy-editing and formatting),
preparation for distribution in various forms, etc.
Thanks to anyone out there that will help me out with this.
Sugar Labs Translation Team Coordinattor
support-gang mailing list