is very important that you keep track of this progress.
Have you talked with Sam Wilson about this?
There could be many ways in which the WMF can help you
analyze this important moment of the Indic community,
and it's also very important to them (and their donors)
to understand how do they have an impact.
Google OCR is a "simple thing", but we ("Western wikisources) learned very
OCR was not available in many Indic languages.
I have shown many people in the WMF the stats about Telugu Wikisource (the
peak in the chart)
and it's crucial that many other people inside WMF is aware of that.
The Indic Wikisource community can show that there are very "cheap" things
the WMF can do to help their communities thrive. The Indic Wikisource
community thus has a big responsability ;-)
On Wed, Nov 2, 2016 at 7:02 PM, Jayanta Nath <jayantanth(a)gmail.com> wrote:
We've just published the November 2016 Indic Wikisource statistics. After
implementing Google OCR script to our all Indic Wikisource , they are
Here is the few stats ans their top three rank...
As per Number of article
1. Sanskrit Wikisource ( 15445 pages) - supported by 0.05% scan pages.
2. Telugu Wikisource ( 11707 pages) - supported by 24.3% scan pages.
3. Kannada Wikisource ( 7864 pages) - supported by 0.99% scan pages.
As per Number of page Validation
1. Telugu Wikisource ( 18142 pages)
2. Tamil Wikisource ( 5167 pages)
3. Gujarati Wikisource ( 3729 pages)
As per Number of page Proofread
1. Telugu Wikisource ( 20213 pages)
2. Malayalam Wikisource ( 8065 pages)
3. Tamil Wikisource ( 7737 pages)
As per percentage supported by scan pages.
1. Bengali Wikisource (25.90%)
2. Telugu Wikisource ( 24.30%)
3. Gujarati Wikisource (17.51%)
I want to specially mention that there are no visible improvement at
Marathi and Assamese Wikisource.
For Sanskrit and Kannada Wikisource, they need to exploring their work of
proofreaded text towards scan page support.
Full Indic Wikisource stats here
Indic Wikisource Community
Wikisource-l mailing list