Mon, 21 Feb 2011 11:23:18 +0100, Alex Brollo <alex.brollo(a)gmail.com> wrote:
I took a look at that mainly interesting page and I
added some
preliminary
comments. The field is a large, and promising one! Perhaps a specific,
dedicated space is needed to share ideas and scripts! Some user is
working
about here and there, but perhaps a meeting point is needed.
Perhaps we can open a
page/space on meta or
wikisource.org about research
and tools around Wikisource and OCRs (or perhaps it is already existing).
http://wikisource.org/wiki/Wikisource:Tools ? (not created)
PS: in our it.wiki talks, we call "Wikisource
djvu" the same idea that
you
call "Reverse_OCR". :-)
I worked on a Python implementation 3-4 months
ago but image processing is
not really advanced (particularly creation of images of words, I began to
write a wrapper of FreeType (more complete than the existing one) but it
was quite long and I'm not a professionnal developer) and I had to create
a particle filter in Python (not really complicated for me (it's my thesis
research topic), but...)
I switched then to a C++ implementation to use directly FreeType and a
particle filter is available on the English WP links. But I have no more
time since about 1-2 months, I should share my code(s) on the toolserver
SVN to show what I've done.
Sébastien