Hi,

Google OCR for Devnagari (Languages like Hindi, Marathi etc) is used by many. Accuracy is about 85-90% which is pretty good.
There is a paid OCR software available that gives better results. Can share the name if you are interested.

I doubt if there is in-wikisource support for OCR in Indic languages.

If you can specify a bit more on your requirements, people can suggest a possible solution.

Regards
-Sudhanwa


On Thu, Aug 11, 2016 at 3:08 AM, Lane Rasberry <lane@bluerasberry.com> wrote:
Hello,

Can anyone here refer me to someone who is active in making Hindi-language contributions to Wikisource? I wish to meet someone with experience in that language and project. Otherwise, can anyone suggest to me which Indic languages in Wikisource seem to be most active?

Is anyone able to make a recommendation for any OCR software for converting scanned Hindi language documents to digital text? Does anyone know anything about in-Wikisource support for OCR in Hindi language? Does it exist? Is there documentation?

Thanks for anything anyone can share.

yours,

--
Lane Rasberry
user:bluerasberry on Wikipedia

_______________________________________________
Wikimediaindia-l mailing list
Wikimediaindia-l@lists.wikimedia.org
To unsubscribe from the list / change mailing preferences visit https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l




--

~!~!~!~!~!~!~!~!~!~!~!~!~!~!~!~!~!~!~!~!~!~!~!~!~!
web: www.sudhanwa.com  blog: www.sudhanwa.in
Twitter: sudhanwa Check on FB, Linkedin for more.