Hello all,
one early question we are currently debating is how to store Wikitext documentation alongside with the structured data?
So, the label of the page and aliases and the actual content object are stored as JSON, but then we would like to have the documentation be more or less normal wikitext.
So in this mockup
https://meta.wikimedia.org/wiki/Abstract_Wikipedia/Early_mockups#/media/File...
The text "en:Multiplication is a mathematical operation that...", that's just wikitext. And it is different per language. Obviously, it would be great to represent that as abstract content, but we are not there yet. Until we get to that level of inception, the question is - where and how do we store that text and how is it combined with the structured data about the object on the page.
James wrote up a task with an overview of the options.
https://phabricator.wikimedia.org/T258953
That's a question that would benefit from input.
Thank you, Denny
On Tue, 28 Jul 2020, 23:27 Denny Vrandečić, dvrandecic@wikimedia.org wrote:
Hello all,
one early question we are currently debating is how to store Wikitext documentation alongside with the structured data?
So, the label of the page and aliases and the actual content object are stored as JSON, but then we would like to have the documentation be more or less normal wikitext. (...) The text "en:Multiplication is a mathematical operation that...", that's just wikitext. And it is different per language.
(...)
So, if I get it right, that wikitext is basically the equivalent of a Wikidata item description. Allow me to play dumb ("play"... heh), and ask: do we really need that description to be wikitext or do we want it to be similar to a Wikidata item description (i.e. similar to a caption)?
L.
On Wed, 29 Jul 2020 at 11:11, Luca Martinelli martinelliluca@gmail.com wrote:
On Tue, 28 Jul 2020, 23:27 Denny Vrandečić, dvrandecic@wikimedia.org wrote:
Hello all,
one early question we are currently debating is how to store Wikitext documentation alongside with the structured data?
So, the label of the page and aliases and the actual content object are stored as JSON, but then we would like to have the documentation be more or less normal wikitext. (...) The text "en:Multiplication is a mathematical operation that...", that's just wikitext. And it is different per language.
(...)
So, if I get it right, that wikitext is basically the equivalent of a Wikidata item description.
Not quite. The current design calls for there to be three different forms of human-readable text, with different availability depending on the type of ZObject:
All ZObjects will have a "*label*", which will be a set of short textual identifiers, up to one per supported language, shown as the "title" of the page and used in most places aimed at humans using them.
On https://meta.wikimedia.org/wiki/File:Wikilambda_early_mockup_multiply_functi... this is "multiply" or "multiplizieren" or whatever depending on your view language, as shown at the top of the content, ahead of the ZID (Z2303 in the example).
In combination with the 'type' of the ZObject, this is enforced to be unique in a given language (so you can have a 'multiply' function defined that works on two integers and responds with an integer, and another that works on hypercomplex numbers (or matrices or Game of Life implementations or…), but you can't have two that work on the exact same input/output types.
(Note: The current design doesn't have alias support, which Wikidata does.)
ZObjects which are function definitions (ZFunctions), and so may need a lot of documentation and information sharing for people to know how to use it, to expand it, to add new tests, or to add new implementations, will have a fully-featured *wikitext description* blob/set of blobs, for detailed documentation. These will be the main human starting points and work co-ordination places, so need a lot of the things our communities have come to expect – links, templates, categories, *etc.*.
On https://meta.wikimedia.org/wiki/File:Wikilambda_early_mockup_multiply_functi... this is "en:Multiplication is a mathematical…", as shown below the calculated type.
ZObjects which don't have wikitext description instead have a short, non-wikitext *description* field, again at a maximum of one per supported language, like how Wikidata works.
On https://meta.wikimedia.org/wiki/File:Wikilambda_early_mockup_multiply_recurs... this is "Calculates the sum of multiplicand …", as shown below the calculated type.
Allow me to play dumb ("play"... heh), and ask: do we really need that description to be wikitext or do we want it to be similar to a Wikidata item description (i.e. similar to a caption)?
I don't know. Maybe everything could just be done through talk pages?
J.
On Wed, Jul 29, 2020 at 6:41 AM James Forrester jforrester@wikimedia.org wrote:
[...]
(Note: The current design doesn't have alias support, which Wikidata does.)
At least in the github version of Denny's abstracttext, Z1K5 is for aliases, so it DOES have alias support... But maybe you're talking about the mockup design?
ZObjects which are function definitions (ZFunctions), and so may need a lot of documentation and information sharing for people to know how to use it, to expand it, to add new tests, or to add new implementations, will have a fully-featured *wikitext description* blob/set of blobs, for detailed documentation. These will be the main human starting points and work co-ordination places, so need a lot of the things our communities have come to expect – links, templates, categories, *etc.*.
On Wikidata, Wikidata: and other namespace pages have wikitext that can be translated, with all the translated versions linked together so they have the same sections which should be correlated - I think there's a MediaWiki extension for it though I don't know much about it. While we obviously can't go straight to the ultimate "abstract" representation, I think rather than independent wikitext descriptions in each language you want something that is linked together in this way. Otherwise how do you prevent the wikitext description of a function in Chinese from being totally different from its description in English?
Arthur
בתאריך יום ד׳, 29 ביולי 2020 ב-13:11 מאת Luca Martinelli < martinelliluca@gmail.com>:
On Tue, 28 Jul 2020, 23:27 Denny Vrandečić, dvrandecic@wikimedia.org wrote:
Hello all,
one early question we are currently debating is how to store Wikitext documentation alongside with the structured data?
So, the label of the page and aliases and the actual content object are stored as JSON, but then we would like to have the documentation be more or less normal wikitext. (...) The text "en:Multiplication is a mathematical operation that...", that's just wikitext. And it is different per language.
(...)
So, if I get it right, that wikitext is basically the equivalent of a Wikidata item description. Allow me to play dumb ("play"... heh), and ask: do we really need that description to be wikitext or do we want it to be similar to a Wikidata item description (i.e. similar to a caption)?
"Description" in Wikidata is a misnomer. It should have never been called "description". It should have been called a "disambiguator", which is its real purpose. The name "description" caused developers of several apps and extensions to actually use them as summaries in search results, which in turn caused the Wikidata-suspicious English Wikipedia community to ask for the weird {{short description}} feature to override the Wikidata "description" (and to top things off, some people are now thinking of undeploying the {{short description}} feature). It also caused some Wikidatans to start thinking of some very technical syntax for writing descriptions. I don't remember particular examples, but I do remember debates about writing very technical, tight descriptions for biological species. Which is supposed to be very pointless, because, um, Wikidata has properties for structured data.
Denny can correct me, but my memory tells me that Wikidata descriptions were made just for disambiguation, like the parentheses part in "Georgia (U.S. state)" and "Georgia (country)" in the English Wikipedia. They were made for items that have a label that is identical to the label of another item. They have always been supposed to be super-simple, optional and not even necessary in most items.
Documentation for code is completely different. It is needed more or less always, it can be long, and it often needs rich formatting. So it should probably allow real wikitext, as Denny suggests.
If I understand correctly, Wikilambda function are kind of similar to templates and modules. Simpler templates have their documentation in the templates page itself, and the more complicated ones have it in a /doc supbage. Templates can also have a bit of documentation in the TemplateData description, but this is not supposed to be comprehensive (and if I understand correctly, functions probably don't need something like TemplateData because according to the current plan, Z Objects will provide similar functionality in a more structured way).
Scribunto Modules are not wikitext, and they always have their documentation in a /doc subpage.
If functions do something similar on a /doc subpage, it will be totally fine, at least as a start. Occasionally there are suggestions to move TemplateData and template and module documentation to MCR slots, and this is probably fine, but optional.
L.
Abstract-Wikipedia mailing list Abstract-Wikipedia@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/abstract-wikipedia
On 29 July 2020 at 11:54 "Amir E. Aharoni" amir.aharoni@mail.huji.ac.il wrote:
It also caused some Wikidatans to start thinking of some very technical syntax for writing descriptions. I don't remember particular examples, but I do remember debates about writing very technical, tight descriptions for biological species. Which is supposed to be very pointless, because, um, Wikidata has properties for structured data.
This is too sweeping for the biomedical area, really, where the terminology problems are horrendous. But carry on, everyone.
Charles
Amir,
“some people are now thinking of undeploying the {{short description}} feature”
Who? Where is this being discussed? I am not sure what you mean by “undeploying the feature”
Agree that it was a badly handled problem and massive time-sink, but would prefer that the baby is not thrown out with the bathwater.
Cheers,
Peter
From: Abstract-Wikipedia [mailto:abstract-wikipedia-bounces@lists.wikimedia.org] On Behalf Of Amir E. Aharoni Sent: 29 July 2020 12:54 To: General public mailing list for the discussion of Abstract Wikipedia (aka Wikilambda) Subject: Re: [Abstract-wikipedia] How to store wikitext along the structured content?
בתאריך יום ד׳, 29 ביולי 2020 ב-13:11 מאת Luca Martinelli <martinelliluca@gmail.com>:
On Tue, 28 Jul 2020, 23:27 Denny Vrandečić, dvrandecic@wikimedia.org wrote:
Hello all,
one early question we are currently debating is how to store Wikitext documentation alongside with the structured data?
So, the label of the page and aliases and the actual content object are stored as JSON, but then we would like to have the documentation be more or less normal wikitext.
(...)
The text "en:Multiplication is a mathematical operation that...", that's just wikitext. And it is different per language.
(...)
So, if I get it right, that wikitext is basically the equivalent of a Wikidata item description. Allow me to play dumb ("play"... heh), and ask: do we really need that description to be wikitext or do we want it to be similar to a Wikidata item description (i.e. similar to a caption)?
"Description" in Wikidata is a misnomer. It should have never been called "description". It should have been called a "disambiguator", which is its real purpose. The name "description" caused developers of several apps and extensions to actually use them as summaries in search results, which in turn caused the Wikidata-suspicious English Wikipedia community to ask for the weird {{short description}} feature to override the Wikidata "description" (and to top things off, some people are now thinking of undeploying the {{short description}} feature). It also caused some Wikidatans to start thinking of some very technical syntax for writing descriptions. I don't remember particular examples, but I do remember debates about writing very technical, tight descriptions for biological species. Which is supposed to be very pointless, because, um, Wikidata has properties for structured data.
Denny can correct me, but my memory tells me that Wikidata descriptions were made just for disambiguation, like the parentheses part in "Georgia (U.S. state)" and "Georgia (country)" in the English Wikipedia. They were made for items that have a label that is identical to the label of another item. They have always been supposed to be super-simple, optional and not even necessary in most items.
Documentation for code is completely different. It is needed more or less always, it can be long, and it often needs rich formatting. So it should probably allow real wikitext, as Denny suggests.
If I understand correctly, Wikilambda function are kind of similar to templates and modules. Simpler templates have their documentation in the templates page itself, and the more complicated ones have it in a /doc supbage. Templates can also have a bit of documentation in the TemplateData description, but this is not supposed to be comprehensive (and if I understand correctly, functions probably don't need something like TemplateData because according to the current plan, Z Objects will provide similar functionality in a more structured way).
Scribunto Modules are not wikitext, and they always have their documentation in a /doc subpage.
If functions do something similar on a /doc subpage, it will be totally fine, at least as a start. Occasionally there are suggestions to move TemplateData and template and module documentation to MCR slots, and this is probably fine, but optional.
L.
_______________________________________________ Abstract-Wikipedia mailing list Abstract-Wikipedia@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/abstract-wikipedia
Virus-free. http://www.avg.com/email-signature?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=emailclient www.avg.com
בתאריך יום ה׳, 30 ביולי 2020 ב-13:08 מאת Peter Southwood < peter.southwood@telkomsa.net>:
Amir,
“some people are now thinking of undeploying the {{short description}} feature”
Who? Where is this being discussed? I am not sure what you mean by “undeploying the feature”
Agree that it was a badly handled problem and massive time-sink, but would prefer that the baby is not thrown out with the bathwater.
Cheers,
Peter
On https://en.wikipedia.org/wiki/Wikipedia:Village_pump_(WMF) now, and a lot of different and contradicting opinions are thrown around there.
On 30 July 2020 at 11:22 "Amir E. Aharoni" amir.aharoni@mail.huji.ac.il wrote:
בתאריך יום ה׳, 30 ביולי 2020 ב-13:08 מאת Peter Southwood < peter.southwood@telkomsa.net mailto:peter.southwood@telkomsa.net >: > >
Amir, “some people are now thinking of undeploying the {{short description}} feature” Who? Where is this being discussed? I am not sure what you mean by “undeploying the feature” Agree that it was a badly handled problem and massive time-sink, but would prefer that the baby is not thrown out with the bathwater. Cheers, Peter >
On https://en.wikipedia.org/wiki/Wikipedia:Village_pump_(WMF) now, and a lot of different and contradicting opinions are thrown around there.
Clearly, if there is a definite demand for scraps of human-readable text attached to things, some people will spend time arguing that the consumer is always wrong in these situations. Just as, if you point out that it provides another way to search, they will argue that people don't, can't or shouldn't search in such ways.
I feel there are fallacies lurking here, or anti-patterns, and would be happy to have names for them.
Charles
Charles,
There may well be fallacies. Perhaps you could point out one or two of them. There is also evasion of responsibility, and delaying of what should have been done already by an ethical and competent organisation, and as usual, a lot of commenting, queries and demands from people who do not know the background. We are accustomed to this.
Cheers,
Peter
From: Abstract-Wikipedia [mailto:abstract-wikipedia-bounces@lists.wikimedia.org] On Behalf Of Charles Matthews via Abstract-Wikipedia Sent: 30 July 2020 12:31 To: General public mailing list for the discussion of Abstract Wikipedia (aka Wikilambda) Subject: Re: [Abstract-wikipedia] Off-topic, was How to store wikitext along the structured content?
On 30 July 2020 at 11:22 "Amir E. Aharoni" amir.aharoni@mail.huji.ac.il wrote:
בתאריך יום ה׳, 30 ביולי 2020 ב-13:08 מאת Peter Southwood < peter.southwood@telkomsa.net>:
Amir,
“some people are now thinking of undeploying the {{short description}} feature”
Who? Where is this being discussed? I am not sure what you mean by “undeploying the feature”
Agree that it was a badly handled problem and massive time-sink, but would prefer that the baby is not thrown out with the bathwater.
Cheers,
Peter
On https://en.wikipedia.org/wiki/Wikipedia:Village_pump_(WMF) now, and a lot of different and contradicting opinions are thrown around there.
Clearly, if there is a definite demand for scraps of human-readable text attached to things, some people will spend time arguing that the consumer is always wrong in these situations. Just as, if you point out that it provides another way to search, they will argue that people don't, can't or shouldn't search in such ways.
I feel there are fallacies lurking here, or anti-patterns, and would be happy to have names for them.
Charles
Virus-free. http://www.avg.com/email-signature?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=emailclient www.avg.com
On 30 July 2020 at 12:54 Peter Southwood peter.southwood@telkomsa.net wrote:
Charles, There may well be fallacies. Perhaps you could point out one or two of them. There is also evasion of responsibility, and delaying of what should have been done already by an ethical and competent organisation, and as usual, a lot of commenting, queries and demands from people who do not know the background. We are accustomed to this.
Since my considered response to the over-aggressive rationalisation inherent in tech planning involves an appeal to the principles of Marie Kondo sorting, I fear I may be misunderstood. It would not be the first time.
Charles
Amir, That discussion is not so much about undeploying the {{short description}} feature, it is about WMF following up on what they undertook to do to disable display of unsourced and unchecked Wikidata content in a way that suggests it is Wikipedia content once Wikipedia met the WMFs unilaterally imposed conditions against Wikipedia project consensus. I would have expected the closing to have been worked out ahead of time, and a mere formality, but it seems that interminable squabbling is possible even in as straight forward appearing a situation as this.
Cheers,
Peter
From: Abstract-Wikipedia [mailto:abstract-wikipedia-bounces@lists.wikimedia.org] On Behalf Of Amir E. Aharoni Sent: 30 July 2020 12:22 To: General public mailing list for the discussion of Abstract Wikipedia (aka Wikilambda) Subject: Re: [Abstract-wikipedia] How to store wikitext along the structured content?
בתאריך יום ה׳, 30 ביולי 2020 ב-13:08 מאת Peter Southwood <peter.southwood@telkomsa.net>:
Amir,
“some people are now thinking of undeploying the {{short description}} feature”
Who? Where is this being discussed? I am not sure what you mean by “undeploying the feature”
Agree that it was a badly handled problem and massive time-sink, but would prefer that the baby is not thrown out with the bathwater.
Cheers,
Peter
On https://en.wikipedia.org/wiki/Wikipedia:Village_pump_(WMF) now, and a lot of different and contradicting opinions are thrown around there.
Virus-free. http://www.avg.com/email-signature?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=emailclient www.avg.com
abstract-wikipedia@lists.wikimedia.org