Hello,
This message is to announce the availability of the "Match and Split" tool on all Wikisources that have a "Page" namespace.
"Match and Split" is a semi-automatic procedure that transfers text from a page of your wikisource, to the corresponding pages of the "Page" namespace. It finds the limits between physical pages in the text, by comparing text to the raw OCR text layer of a djvu file.
During the last few weeks the tool has been successfully tested at the French, English and German wikisources. At fr.ws, more than 1000 texts were processed and converted to the Page format, in less than a month (see the statistics here : http://toolserver.org/~thomasv/transclusions.html)
*How to install*
In order to install the tool on your wiki, you need to install the following code: http://wikisource.org/wiki/MediaWiki:MatchSplit.js In order to benefit from code updates you may include it directly, as in this gadget: http://fr.wikisource.org/wiki/MediaWiki:Gadget-robot.js In addition, you need to give User:ThomasBot the robot flag.
*How to use the tool*
Help in English can be found here : http://en.wikisource.org/wiki/Help:Match_and_Split In addition, the job queue is visible at : http://toolserver.org/~thomasv/robot.php
Thomas
wikisource-l@lists.wikimedia.org