@Alex: one thing I always would have liked to have is an /all page that would display all the ocr text and that would replace it by proofread text as soon as each page would be created.
It would be the equivalent of the text dump by internet archive and it would allow search engines to find text in the books even if they haven't been proofread yet.
Automatic creation of page transclusion is nice but also dangerous... too many structures to have an easy solution. It could be easier to have a parser that would read the proofread pages, detect "section" marks and create pages accordingly. IIRC, Phe did already something like that, perhaps you could port it to Lua?

By the way, do you think it would make sense that the Modulo:Dati is generated automatically by the Index: page? Maybe each time that it is saved?

Micru

On Thu, Jun 6, 2013 at 6:30 PM, Alex Brollo <alex.brollo@gmail.com> wrote:
I found a limit at approx 4000 calls of template|Pg; so I splitted the whole page  http://it.wikisource.org/wiki/Annali_del_Principato_ecclesiastico_di_Trento_dal_1022_al_1540/Indice into smaller subpages A....Z and now the script runs happily.

In the meantime, I'm translating js script which builds Modulo:Dati/.... in python, so that Alebot (running into TS willow and listening #it.wikisource) could write/update Modulo:Dati/... page as soon as Index:page has been edited. Index page edits are not so frequent IMHO, they are relatively "stable" pages. Global number of source edits would not been flooded IMHO by such additive edits of Modulo:Dati subpages. 

I'm thinking too about loading other useful book-related data which could be uploaded into Modulo:Dati to be used into any nsPage and  ns0 page related with rootIndex: page. I'm dreaming about an almost integral automation of ns0 transclusion.

But I absolutely need Aubrey's suggestions.....  :-)

Alex




2013/6/5 Alex Brollo <alex.brollo@gmail.com>
I'm testing tl|Pg into a "hard case", transcluded page will contain more than 8000 Pg template and Pg module calls; no expensive parser function is added by Lua. 


Alex


2013/6/3 Alex Brollo <alex.brollo@gmail.com>
Done Lars; I edited Modulo:Pg to keep Template:Pg simpler.

Alex




2013/6/3 Alex Brollo <alex.brollo@gmail.com>
Thanks Lars: OK. :-)


2013/6/3 Lars Aronsson <lars@aronsson.se>
On 06/03/2013 12:10 AM, Alex Brollo wrote:
Take a look to this page <http://it.wikisource.org/wiki/Pagina:Manuale_di_economia_politica_con_una_introduzione_alla_scienza_sociale.djvu/583>. As you see, page numbers are simply wrapped into a tl|Pg. No other parameters. Nevertheless, they are transformed into active links do right djvu pages.

Now, go here <http://it.wikisource.org/wiki/Manuale_di_economia_politica_con_una_introduzione_alla_scienza_sociale/Indice_dei_nomi_di_autori#pagename583> where the same page is transcluded in ns0. Page numbers now are active links to the right subpage/chapter.

This trick uses: Template:Pg <http://it.wikisource.org/wiki/Template:Pg>, Modulo:Pg <http://it.wikisource.org/wiki/Modulo:Pg>, and this Modulo:Dati/Manuale.... <http://it.wikisource.org/wiki/Modulo:Dati/Manuale_di_economia_politica_con_una_introduzione_alla_scienza_sociale.djvu>; nothing more than this, no javascript and no AJAX, so I presume (I didn't test by now) that links are running too into any html-based export of ns0 pages as server builds them.

Excellent! However, on the transcluded page, the link goes only to the
chapter URL. It should also add #pag137 or #pagename147, since
all page links in the margin of the transcluded chapter are marked
like this:... id="pag137"><span id="pagename147">...


--
  Lars Aronsson (lars@aronsson.se)
  Project Runeberg - free Nordic literature - http://runeberg.org/




_______________________________________________
Wikisource-l mailing list
Wikisource-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikisource-l





_______________________________________________
Wikisource-l mailing list
Wikisource-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikisource-l




--
Etiamsi omnes, ego non