Thank you for this report Jayanta, that's interesting. Would you expect
more impact Marathi, Assamese, Sanskrit, Odia, and Kannada following the
OCR avaibility? I mean, to my mind Sanskrit is somewhat like Latin in
occidental countries, so I'm wouldn't expect a very large frenzy when a
new handy tool come to help with transcription. For other languages, I
don't know their vernacular popularity, I'll check their Wikipedia
article later. Would you expect more possible contributors for them?
Le 06/12/2016 à 18:56, Jayanta Nath a écrit :
We've just published the November 2016 Indic Wikisource statistics.
After implementing Google OCR script to our all Indic Wikisource ,
they are growing rapidly.
Here is the few stats ans their top three rank...
As per Number of article
1. Sanskrit Wikisource ( 15692 pages) - supported by 0.05% scan pages.
2. Telugu Wikisource ( 11752 pages) - supported by 24.3% scan pages.
3. Kannada Wikisource ( 8414 pages) - supported by 0.99% scan pages.
As per Number of page Validation
1. Telugu Wikisource ( 18192 pages)
2. Tamil Wikisource (5179 pages)
3. Gujarati Wikisource ( 4064 pages)
As per Number of page Proofread
1. Telugu Wikisource ( 22565 pages)
2. Malayalam Wikisource ( 8065 pages)
3. Tamil Wikisource ( 7876 pages)
As per percentage supported by scan pages.
1. Bengali Wikisource (27.16%)
2. Telugu Wikisource ( 24.56%)
3. Gujarati Wikisource (17.51%)
I want to specially mention that there are no visible improvement on
proofreading works and page content supported by scan at
Marathi,Assamese, Sanskrit, Odia, and Kannada Wikisource.
Full Indic Wikisource stats here
Indic Wikisource Community
Wikisource-l mailing list