Dear All,
I had represented the Kannada Wikisource community in the Indic Wikisource community consultation which was held in Kolkata on November 24th and 25th. The small and brief introduction was given by Jayantanth after this the following things were discussed:
- Stats tool https://tools.wmflabs.org/shrinitools/indic_ws_stats/: Which gives the number of validated, proofread, w/o text etc. - Wsexport https://tools.wmflabs.org/wsexport/tool/book.php: Which allow you to download the books in different formats. - Indic OCR tool https://tools.wmflabs.org/indic-ocr: This tool will allow us to run OCR in page namespace for individual pages. - IA upload tool https://tools.wmflabs.org/ia-upload/: This tool will help us to upload the books from Internet archive to commons. - URL2commons https://tools.wmflabs.org/url2commons/index.html: This tool will help us to upload books from any website. - Vicuna Uploader http://yarl.github.io/vicuna/: This tool will help us to upload books/images/videos in bulk to commons. - Fill Index Gadget https://en.wikisource.org/wiki/MediaWiki:Gadget-Fill_Index.js: This Gadget will fill the index page on Wikisource.
- Hathi downloader https://www.hathitrust.org/: This tool will help to download the books in bulk. - Fatkun Batch Download Image https://chrome.google.com/webstore/detail/fatkun-batch-download-ima/nnjjahlikiabnchcpehcpkdeckfgnohf/related?hl=en: A chrome addon which will help you to download images in bulk. - ABBYY fine reader https://finereaderonline.com/en-us: This tool will help you to run the OCR locally. - Other tools like Briss, pdfshuffle Gscan2pdf, and Scantailor where discussed which help to process the books after scanning. - The workflow of the Wikisource was discussed.
- Cleanup js https://en.wikisource.org/wiki/User:Bodhisattwa/cleanup.js: Help to clean up the text in a bulk after OCR. - OTRS process https://commons.wikimedia.org/wiki/Commons:OTRS was discussed. - Hands-on session on OCR. - Discussion on the tools that the Indic community needed to do work efficiently on Wikisource. - A small presentation on Transclusion of the book.
Thanks and Regards,
*ANANTH SUBRAY P V*
Programme Associate
Access to Knowledge program
The Centre for Internet & Society
+91-9739811664