a reminder that today is Wikisource 13th birthday!
How many communities are running a "proofreading contest"?
Italian Wikisource is doing it, any other one?
Thanks to Cristian Consonni, this year the "counting" should be easier, the
code is all online:
I made a draft on a proposal here:
Please edit, it's a draft and I think it's a problem many of us know from a
The idea is that, if we had a system for MediaWiki to know what a book is,
* better metadata management system, thus a better integration with Wikidata
* we could develop better tools, for example create automatic Indexes and
navigation templates, which would maybe enable us to "import" EPUBs
directly on Wikisource
* have a better workflow, making it easier to new users to use us
* have even better statistics and analytics: which are the *books* (not the
pages!) which are read o WIkisource?
IMHO, the Community Wishlist is a great opportunity to push for a systemic
development of Wikisource: I'm convinced we don't need little tools here
and there, but a systematic overview of what Wikisource framework,
workflows and software.
I'm open to discuss the merging of different proposals in the Wishlist.
The Wikimedia Foundation Community Tech team has launched a new "Community Wishlist Survey".
Last year survey allowed us to get WMF staff time to work on using Google OCR in Wikisource that allowed some Indian languages Wikisources to raise and on VisualEditor support.
Please, take time to submit new wishes and comment them. It could be simple things (e.g. a new gadget for a specific workflow) or very complicated ones (e.g. native TEI support).
Début du message réexpédié :
De: Johan Jönsson <jjonsson(a)wikimedia.org<mailto:firstname.lastname@example.org>>
Objet: [Wikitech-ambassadors] Your help needed: Community Wishlist Survey 2016
Date: 7 novembre 2016 à 20:26:21 UTC+1
À: Wikitech Ambassadors <wikitech-ambassadors(a)lists.wikimedia.org<mailto:email@example.com>>
Répondre à: Coordination of technology deployments across languages/projects <wikitech-ambassadors(a)lists.wikimedia.org<mailto:firstname.lastname@example.org>>
Last year, the Community Tech team did a survey for a community wishlist to decide what we shoudl be working on throughout the year. Since it's useful to have a list of tasks from the Wikimedia communities, it's also been used by other developers, been the focus of Wikimedia hackathons and so on. In short, I think it matters.
Now we're doing the process again.
If you'd feel like spreading this in your communities, it would be much appreciated.
*) This is when you can suggest things. This phase will last from 7 November to 20 November.
*) Editors who are not comfortable writing in English can write proposals in their language.
*) Voting will take place 28 November to 12 December.
Wikitech-ambassadors mailing list
We've just published the November 2016 Indic Wikisource statistics. After
implementing Google OCR script to our all Indic Wikisource , they are
Here is the few stats ans their top three rank...
As per Number of article
1. Sanskrit Wikisource ( 15445 pages) - supported by 0.05% scan pages.
2. Telugu Wikisource ( 11707 pages) - supported by 24.3% scan pages.
3. Kannada Wikisource ( 7864 pages) - supported by 0.99% scan pages.
As per Number of page Validation
1. Telugu Wikisource ( 18142 pages)
2. Tamil Wikisource ( 5167 pages)
3. Gujarati Wikisource ( 3729 pages)
As per Number of page Proofread
1. Telugu Wikisource ( 20213 pages)
2. Malayalam Wikisource ( 8065 pages)
3. Tamil Wikisource ( 7737 pages)
As per percentage supported by scan pages.
1. Bengali Wikisource (25.90%)
2. Telugu Wikisource ( 24.30%)
3. Gujarati Wikisource (17.51%)
I want to specially mention that there are no visible improvement at
Marathi and Assamese Wikisource.
For Sanskrit and Kannada Wikisource, they need to exploring their work of
proofreaded text towards scan page support.
Full Indic Wikisource stats here
Indic Wikisource Community
I was wondering if someone made some script for "importing epubs"
into Wikisource, meaning that
* there is a "translation" from HTML to wikitext
* there is a translation of the Index
It's not a very big deal to do this by hand, but sometimes it would be very
cool and helpful for those texts/books which are maybe present in Gutenberg
prject (or similar) and are not available in scans...