Is nsPage merely a proofreading tool?

List overview All Threads
Download

newer

older

Multilingual works into...

Re: [Wikisource-l] [Wikidata-l]...

Alex Brollo

25 Aug 2013 25 Aug '13

7:46 a.m.

Into a recent talk at en.source Scriptorium, it has been told that nsPage can be viewed merely as a proofreading tool, the ns0 transclusion/text being the real core of source content.

I have a different opinion, since I see nsPage code as the real core of source content, ns0 being merely a derived content, that could be obtained with complete automation with a set of data wrapped into a Lua/Scribunto set of structural data (wrapping any needed data for header template and for pages tag), so that any ns0 page/subpage could be obtained with a template {{Derive|index base page name}}.

Giving to nsPage such a core content role, it will be much simpler to wrap into it TEI data, and any POV related to different styles of chapter/sections structure/naming could be avoided; html rendering will be unchanged, so saving IMHO conversion in ePub.

What do you think about?

Alex brollo

Attachments:

attachment.htm (text/html — 1.0 KB)

Show replies by date

billinghurst

26 Aug 26 Aug

2:54 a.m.

New subject: [Wikisource-l] Is nsPage merely a proofreading tool?

On Sun, 25 Aug 2013 07:46:22 +0200, Alex Brollo alex.brollo@gmail.com wrote:

...

Into a recent talk at en.source Scriptorium, it has been told that

nsPage

...

can be viewed merely as a proofreading tool, the ns0 transclusion/text being the real core of source content.

I have a different opinion, since I see nsPage code as the real core of source content, ns0 being merely a derived content, that could be

obtained

...

with complete automation with a set of data wrapped into a Lua/Scribunto set of structural data (wrapping any needed data for header template and for pages tag), so that any ns0 page/subpage could be obtained with a template {{Derive|index base page name}}.

Giving to nsPage such a core content role, it will be much simpler to

wrap

...

into it TEI data, and any POV related to different styles of chapter/sections structure/naming could be avoided; html rendering will

...

unchanged, so saving IMHO conversion in ePub.

What do you think about?

Alex brollo

I am fairly certain that 95% of our transcribers would have little or no concept about which you are talking, and I am not certain that I do either. Once we get out of the scope of the obvious, further suggestions start to be difficult.

The concept that we utilise at enWS is that * Page: ns is a working, non-presentation area. It is a means for formatting text for transclusion to the main ns (for straight transcription) and for translation (for WS sourced translations). * Main ns is the presentation layer of the work produced by the author.

We are not into the slavish concept of "the page" as produced by the printer as its own entity beyond it being a carriage for the text. I would think that any further interpretation about structural data is getting too weighed down in other considerations, not the concept of the capturing of the words of an author.

Regards, Billinghurst

Alex Brollo

7:41 a.m.

Thanks for illustrating clearly this point of view.

Nevertheless: are we digitalizing works or are we digitalizing books? It's different.

Apart from theory of difference between a work and a book, there's too some practical consequence. If nsPage is central as it is in my opinion, then any data, both regarding author's words and anything other useful to use them, to quote them, and to assemble them with illustrations, references, logical structure of chapters and so on, should be contained into nsPage, while presently such data are splitted into many different containers (nsIndex, nsPage, ns0 and their infoboxes). I vaguely feel, that there's something to fix in this apporach; and when I recently discovered TEI, and I went a little deeper into OCR text representation as it is, i.e., into abbyy.xml files, my feel became stronger.

In brief, my proposal is: can we consider the possibility to bring into nsPage any structural/logic data needed to build any possible non-paged representation of a work?

Alex

2013/8/26 billinghurst billinghurst@gmail.com

...

On Sun, 25 Aug 2013 07:46:22 +0200, Alex Brollo alex.brollo@gmail.com wrote:

...
Into a recent talk at en.source Scriptorium, it has been told that

nsPage

...
can be viewed merely as a proofreading tool, the ns0 transclusion/text being the real core of source content.

I have a different opinion, since I see nsPage code as the real core of source content, ns0 being merely a derived content, that could be

obtained

...
with complete automation with a set of data wrapped into a Lua/Scribunto set of structural data (wrapping any needed data for header template and for pages tag), so that any ns0 page/subpage could be obtained with a template {{Derive|index base page name}}.

Giving to nsPage such a core content role, it will be much simpler to

wrap

...
into it TEI data, and any POV related to different styles of chapter/sections structure/naming could be avoided; html rendering will

be

...
unchanged, so saving IMHO conversion in ePub.

What do you think about?

Alex brollo

I am fairly certain that 95% of our transcribers would have little or no concept about which you are talking, and I am not certain that I do either. Once we get out of the scope of the obvious, further suggestions start to be difficult.

The concept that we utilise at enWS is that

Page: ns is a working, non-presentation area. It is a means for

formatting text for transclusion to the main ns (for straight transcription) and for translation (for WS sourced translations).

Main ns is the presentation layer of the work produced by the author.

We are not into the slavish concept of "the page" as produced by the printer as its own entity beyond it being a carriage for the text. I would think that any further interpretation about structural data is getting too weighed down in other considerations, not the concept of the capturing of the words of an author.

Regards, Billinghurst

Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l

Andrea Zanni

28 Aug 28 Aug

1:15 p.m.

I'm not sure if I understand Alex's idea, but I understand the issue of having 2 different namespaces, one for editing and one for reading.

I think it's a great idea, but we should improve a lot the usability of that and ease the navigation from one ns to the other. For exaple, we should allow the reader to discover a typo in the ns0 and with one-click go and fix it, and then come back with a single click. For example, could we have a [edit] or [fix] label near the SAL icon in the ns0? Do we have any idea/tool to make pass from the nsPage to the ns0 in a single click in the right page/chapter?

...

Nevertheless: are we digitalizing works or are we digitalizing books? It's different.

I really don't have an easy answer for this.

My feeling is that *our software is not ready* for this kind of things. For example, I don't think we should allow TEI with the wikitext. We will, maybe, when the visual editor is ready. Or maybe better we should have definite namespace (like nsTEI), which would grab the text from the nsPage or ns0 and allwo TEI use.

...

In brief, my proposal is: can we consider the possibility to bring into nsPage any structural/logic data needed to build any possible non-paged representation of a work?

Again, I don't have a clear answer. It's true that sometimes we need a place for any structural/logic data. I saw your experiments with Lua modules in some work subpages, and I can even imagine a wikibase extension that allow wikisource to store, for example, SAL data there. But it's complex and I'd like a much broader discussion about these (visionary and very cool) ideas.

Aubrey

...

Alex

2013/8/26 billinghurst billinghurst@gmail.com

...
On Sun, 25 Aug 2013 07:46:22 +0200, Alex Brollo alex.brollo@gmail.com wrote:

...
Into a recent talk at en.source Scriptorium, it has been told that

nsPage

...
can be viewed merely as a proofreading tool, the ns0 transclusion/text being the real core of source content.

I have a different opinion, since I see nsPage code as the real core of source content, ns0 being merely a derived content, that could be

obtained

...
with complete automation with a set of data wrapped into a Lua/Scribunto set of structural data (wrapping any needed data for header template and for pages tag), so that any ns0 page/subpage could be obtained with a template {{Derive|index base page name}}.

Giving to nsPage such a core content role, it will be much simpler to

wrap

...
into it TEI data, and any POV related to different styles of chapter/sections structure/naming could be avoided; html rendering will

be

...
unchanged, so saving IMHO conversion in ePub.

What do you think about?

Alex brollo

I am fairly certain that 95% of our transcribers would have little or no concept about which you are talking, and I am not certain that I do either. Once we get out of the scope of the obvious, further suggestions start to be difficult.

The concept that we utilise at enWS is that

Page: ns is a working, non-presentation area. It is a means for

formatting text for transclusion to the main ns (for straight transcription) and for translation (for WS sourced translations).

Main ns is the presentation layer of the work produced by the author.

We are not into the slavish concept of "the page" as produced by the printer as its own entity beyond it being a carriage for the text. I would think that any further interpretation about structural data is getting too weighed down in other considerations, not the concept of the capturing of the words of an author.

Regards, Billinghurst

Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l

Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l

Alex Brollo

3:46 p.m.

Just a practical example. I like that all page numbers listed into the bottom words index of a book, even if are hundreds into a single page, are converted into running links pointing in in nsPage (from nsPage) and into ns0 (when transcluded into ns0). The link to nsPage is safe and can be automatized since "page" is a clear-cut "entity". On the contrary, the same book page number doesn't contain a safe relationship with a section/chapter, since one, two or sometimes three different chapters/sections can be contained (usually in part) by a single page. Automation is sometimes possible (by a complex page text analisis) but often fails.

Alex

2013/8/28 Andrea Zanni zanni.andrea84@gmail.com

...

I'm not sure if I understand Alex's idea, but I understand the issue of having 2 different namespaces, one for editing and one for reading.

I think it's a great idea, but we should improve a lot the usability of that and ease the navigation from one ns to the other. For exaple, we should allow the reader to discover a typo in the ns0 and with one-click go and fix it, and then come back with a single click. For example, could we have a [edit] or [fix] label near the SAL icon in the ns0? Do we have any idea/tool to make pass from the nsPage to the ns0 in a single click in the right page/chapter?

...
Nevertheless: are we digitalizing works or are we digitalizing books? It's different.

I really don't have an easy answer for this.

My feeling is that *our software is not ready* for this kind of things. For example, I don't think we should allow TEI with the wikitext. We will, maybe, when the visual editor is ready. Or maybe better we should have definite namespace (like nsTEI), which would grab the text from the nsPage or ns0 and allwo TEI use.

...
In brief, my proposal is: can we consider the possibility to bring into nsPage any structural/logic data needed to build any possible non-paged representation of a work?

Again, I don't have a clear answer. It's true that sometimes we need a place for any structural/logic data. I saw your experiments with Lua modules in some work subpages, and I can even imagine a wikibase extension that allow wikisource to store, for example, SAL data there. But it's complex and I'd like a much broader discussion about these (visionary and very cool) ideas.

Aubrey

...
Alex

2013/8/26 billinghurst billinghurst@gmail.com

...
On Sun, 25 Aug 2013 07:46:22 +0200, Alex Brollo alex.brollo@gmail.com wrote:

...
Into a recent talk at en.source Scriptorium, it has been told that

nsPage

...
can be viewed merely as a proofreading tool, the ns0 transclusion/text being the real core of source content.

I have a different opinion, since I see nsPage code as the real core of source content, ns0 being merely a derived content, that could be

obtained

...
with complete automation with a set of data wrapped into a

Lua/Scribunto

...
set of structural data (wrapping any needed data for header template

and

...
for pages tag), so that any ns0 page/subpage could be obtained with a template {{Derive|index base page name}}.

Giving to nsPage such a core content role, it will be much simpler to

wrap

...
into it TEI data, and any POV related to different styles of chapter/sections structure/naming could be avoided; html rendering will

be

...
unchanged, so saving IMHO conversion in ePub.

What do you think about?

Alex brollo

I am fairly certain that 95% of our transcribers would have little or no concept about which you are talking, and I am not certain that I do either. Once we get out of the scope of the obvious, further suggestions start to be difficult.

The concept that we utilise at enWS is that

Page: ns is a working, non-presentation area. It is a means for

formatting text for transclusion to the main ns (for straight transcription) and for translation (for WS sourced translations).

Main ns is the presentation layer of the work produced by the author.

We are not into the slavish concept of "the page" as produced by the printer as its own entity beyond it being a carriage for the text. I would think that any further interpretation about structural data is getting too weighed down in other considerations, not the concept of the capturing of the words of an author.

Regards, Billinghurst

Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l

Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l

Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l

4149

Age (days ago)

4152

Last active (days ago)

wikisource-l@lists.wikimedia.org

4 comments

3 participants

tags (0)

participants (3)

Alex Brollo
Andrea Zanni
billinghurst