[Wikisource-l] On linking Wikisource with page images

Ray Saintonge saintonge at telus.net
Tue Jan 22 07:25:05 UTC 2008


Jesse Martin (Pathoschild) wrote:
> That's a good point. How about a much cleaner syntax that can be used
> to generate the OCR markup? With your example text:
> {{ocr line| The first experiments were made on the absorption of carbonic }}
> {{ocr line| acid gas by water: and here a singular disagreement was observed }}
> {{ocr line| in the first trials made under exactly the same circumstances. It }}
>
> This is much easier to read, you know where the line breaks go, and
> it's immediately clear even to someone stumbling across the text that
> we're specifically keeping track of lines (so they don't helpfully
> remove unneeded line breaks). Since single line breaks are ignored by
> MediaWiki, we can just use the same line width so the template syntax
> lines up for easier ignoring.
I'm still skeptical about what this will accomplish, but will address 
that later.  The above does not address the treatment of hyphens.  When 
MediaWiki wraps single line breaks it ignores the hyphens that break up 
a word at the end of the line, and treats the word as though it were two.

Ec



More information about the Wikisource-l mailing list