Hello,
Can anyone here refer me to someone who is active in making Hindi-language contributions to Wikisource? I wish to meet someone with experience in that language and project. Otherwise, can anyone suggest to me which Indic languages in Wikisource seem to be most active?
Is anyone able to make a recommendation for any OCR software for converting scanned Hindi language documents to digital text? Does anyone know anything about in-Wikisource support for OCR in Hindi language? Does it exist? Is there documentation?
Thanks for anything anyone can share.
yours,
Hi,
Google OCR for Devnagari (Languages like Hindi, Marathi etc) is used by many. Accuracy is about 85-90% which is pretty good. There is a paid OCR software available that gives better results. Can share the name if you are interested.
I doubt if there is in-wikisource support for OCR in Indic languages.
If you can specify a bit more on your requirements, people can suggest a possible solution.
Regards -Sudhanwa
On Thu, Aug 11, 2016 at 3:08 AM, Lane Rasberry lane@bluerasberry.com wrote:
Hello,
Can anyone here refer me to someone who is active in making Hindi-language contributions to Wikisource? I wish to meet someone with experience in that language and project. Otherwise, can anyone suggest to me which Indic languages in Wikisource seem to be most active?
Is anyone able to make a recommendation for any OCR software for converting scanned Hindi language documents to digital text? Does anyone know anything about in-Wikisource support for OCR in Hindi language? Does it exist? Is there documentation?
Thanks for anything anyone can share.
yours,
-- Lane Rasberry user:bluerasberry on Wikipedia 206.801.0814 lane@bluerasberry.com
Wikimediaindia-l mailing list Wikimediaindia-l@lists.wikimedia.org To unsubscribe from the list / change mailing preferences visit https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
Hi,
Write this script to connect Google OCR and WikiSource.
Hindi is supported well.
https://github.com/tshrinivasan/OCR4wikisource
Check the README and INSTALL for setup instructions in ubuntu or any other linux.
Ping me or here for any assistance.
This is used by Tamil, Bengali wikisource communities heavily to OCR more than 2000 Books.
Hi Lane ,
You got to try this . It works great :)
https://github.com/tshrinivasan/OCR4wikisource .
-Sibi
On Thu, Aug 11, 2016 at 3:08 AM, Lane Rasberry lane@bluerasberry.com wrote:
Hello,
Can anyone here refer me to someone who is active in making Hindi-language contributions to Wikisource? I wish to meet someone with experience in that language and project. Otherwise, can anyone suggest to me which Indic languages in Wikisource seem to be most active?
Is anyone able to make a recommendation for any OCR software for converting scanned Hindi language documents to digital text? Does anyone know anything about in-Wikisource support for OCR in Hindi language? Does it exist? Is there documentation?
Thanks for anything anyone can share.
yours,
-- Lane Rasberry user:bluerasberry on Wikipedia 206.801.0814 lane@bluerasberry.com
Wikimediaindia-l mailing list Wikimediaindia-l@lists.wikimedia.org To unsubscribe from the list / change mailing preferences visit https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
Hi,
I am a long term Wikisource contributor and I speak Hindi. But I don't contribute much in Hindi, as it is not my first language, but I am willing to help.
Regards,
Yann
2016-08-10 23:38 GMT+02:00 Lane Rasberry lane@bluerasberry.com:
Hello,
Can anyone here refer me to someone who is active in making Hindi-language contributions to Wikisource? I wish to meet someone with experience in that language and project. Otherwise, can anyone suggest to me which Indic languages in Wikisource seem to be most active?
Is anyone able to make a recommendation for any OCR software for converting scanned Hindi language documents to digital text? Does anyone know anything about in-Wikisource support for OCR in Hindi language? Does it exist? Is there documentation?
Thanks for anything anyone can share.
yours,
-- Lane Rasberry user:bluerasberry on Wikipedia 206.801.0814 lane@bluerasberry.com
Wikimediaindia-l mailing list Wikimediaindia-l@lists.wikimedia.org To unsubscribe from the list / change mailing preferences visit https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
wikimediaindia-l@lists.wikimedia.org