This message is to announce the availability of the "Match and Split"
tool on all Wikisources that have a "Page" namespace.
"Match and Split" is a semi-automatic procedure that transfers text
from a page of your wikisource, to the corresponding pages of the
"Page" namespace. It finds the limits between physical pages in the
text, by comparing text to the raw OCR text layer of a djvu file.
During the last few weeks the tool has been successfully tested at the
French, English and German wikisources. At fr.ws, more than 1000 texts
were processed and converted to the Page format, in less than a month
(see the statistics here :
*How to install*
In order to install the tool on your wiki, you need to install the
In order to benefit from code updates you may include it directly,
as in this gadget:
In addition, you need to give User:ThomasBot the robot flag.
*How to use the tool*
Help in English can be found here :
In addition, the job queue is visible at :