Mon, 21 Feb 2011 11:23:18 +0100, Alex Brollo alex.brollo@gmail.com wrote:
I took a look at that mainly interesting page and I added some preliminary comments. The field is a large, and promising one! Perhaps a specific, dedicated space is needed to share ideas and scripts! Some user is working about here and there, but perhaps a meeting point is needed.
Perhaps we can open a page/space on meta or wikisource.org about research and tools around Wikisource and OCRs (or perhaps it is already existing).
http://wikisource.org/wiki/Wikisource:Tools ? (not created)
PS: in our it.wiki talks, we call "Wikisource djvu" the same idea that you call "Reverse_OCR". :-)
I worked on a Python implementation 3-4 months ago but image processing is not really advanced (particularly creation of images of words, I began to write a wrapper of FreeType (more complete than the existing one) but it was quite long and I'm not a professionnal developer) and I had to create a particle filter in Python (not really complicated for me (it's my thesis research topic), but...)
I switched then to a C++ implementation to use directly FreeType and a particle filter is available on the English WP links. But I have no more time since about 1-2 months, I should share my code(s) on the toolserver SVN to show what I've done.
Sébastien