Google's Optical Character Recognition software now works with all South Asian languages - WikimediaIndia-l

29 Aug 2015

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256

Google's OCR which apparently is most accurate OCR
we have seen so far, works really good for all the major South Asian
scripts:
http://globalvoicesonline.org/2015/08/29/googles-optical-character-recog
nition-software-now-works-with-all-south-asian-languages
Here are test cases of many Indian scripts: https://goo.gl/3X75iR.
Except Gurmukhi most scripts are working really good.

This could be really useful for Indian language Wikimedians and will
come handy for digitization of printed and scanned text.  Here is an
animated tutorial for Wikimedians to use this tool for
Wikisource/Wikipedia:
https://commons.wikimedia.org/wiki/File:Tutorial_to_use_Google_Optical_C
haracter_Recognition.gif

Please write to me if anyone wants to localize this tutorial in your
language.

- -- 
Best!
Subhashish Panigrahi
Programme Officer, Access To Knowledge
Centre for Internet and Society
@subhapa / https://cis-india.org
...PGP SIGNATURE...
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2

iQIcBAEBCAAGBQJV4YD0AAoJEHThehXZGxGO9ywP/RcJOXB3tFHJNF03X23x1jkY
vffu+1Iob6kLMZt/JD3nTmpXasXDlme6pbGzaT7/YZsC0VouN+4NE9HoEmZAksJF
3nn7HoEive4mDalXH5qyATOilezqIEYOG2c32LVYHnX6Co+fXPVa5WqsHn5js957
OionIc5t0V9zlGB6e5RLOacPWXsAhXyVunaeY6Ma33cOWHFdVnu1XpUGphJ+miVj
EWszTzjDOPlFiMsSsVonjWHvuz7hYPKXxvVXViXY1QAsoOT7wztvOepzM/hAPmYM
kGiODSaN8fU/e/2l4xdnMRymAt8hsz61hdye2UYx7xRjlda/23BKNZz0hiuWiqgO
FBntHycaHyqR8+fUK5EPE0vnqLp/7XdtRtQkRficuEDYlHz4PlMW8oiVEGhSZOaG
fdpgg02sojU1iMOGOs3h/ODWxkRrE3qpG+eT8n1mWJp6Tq7ZLEaQGxW1P6ytlPFF
qOz8JKl94D/MI7ybAtp+IsuUQk160H9wUPmaLxgemDRom7220xV6BysbmaMEWwww
hgO4fBNG6dPUMp825pTSxx18rY/Kw53sgHmUasixCL6Zv6xnM3rRuTxjZh8j77TR
gq2sKgoU+JkYt9eBpVRjrFO90xS5MxPrvL/lGH6P1smAODPull3o0tR681+NGKRp
C8vU5vJOlmL+HlNXBSh9
=lwbI
-----END PGP SIGNATURE-----