The Wikipedia article about Wangerooge describes an island and municipality.
https://en.wikipedia.org/wiki/Wangerooge
These two concepts, island and municipality, have discrete items.
Municipality: https://www.wikidata.org/wiki/Q25135 Island: https://www.wikidata.org/wiki/Q17081143
I would like Q17081143 to point to the Wangerooge Wikipedia article, but adding the link gives this error:
The link enwiki:Wangerooge is already used by item Q25135. You may remove it from Q25135 if it does not belong there or merge the items if they are about the exact same topic.
How can we fix this?
Disambiguation? In Openstreetmap it makes total sense to have 2 items for these very different concepts.
Jo
2014-09-08 23:04 GMT+02:00 Edward Betts edward@4angle.com:
The Wikipedia article about Wangerooge describes an island and municipality.
https://en.wikipedia.org/wiki/Wangerooge
These two concepts, island and municipality, have discrete items.
Municipality: https://www.wikidata.org/wiki/Q25135 Island: https://www.wikidata.org/wiki/Q17081143
I would like Q17081143 to point to the Wangerooge Wikipedia article, but adding the link gives this error:
The link enwiki:Wangerooge is already used by item Q25135. You may
remove it
from Q25135 if it does not belong there or merge the items if they are
about
the exact same topic.
How can we fix this?
Edward.
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
Create a third item in Wikidata, and use that for the language links. Any Wikipedia that has two separate articles can link to the separate items, any Wikipedia that has only one article can link to the single item.
On Mon, Sep 8, 2014 at 2:43 PM, Jo winfixit@gmail.com wrote:
Disambiguation? In Openstreetmap it makes total sense to have 2 items for these very different concepts.
Jo
2014-09-08 23:04 GMT+02:00 Edward Betts edward@4angle.com:
The Wikipedia article about Wangerooge describes an island and municipality.
https://en.wikipedia.org/wiki/Wangerooge
These two concepts, island and municipality, have discrete items.
Municipality: https://www.wikidata.org/wiki/Q25135 Island: https://www.wikidata.org/wiki/Q17081143
I would like Q17081143 to point to the Wangerooge Wikipedia article, but adding the link gives this error:
The link enwiki:Wangerooge is already used by item Q25135. You may
remove it
from Q25135 if it does not belong there or merge the items if they are
about
the exact same topic.
How can we fix this?
Edward.
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
Am 09.09.2014 01:40, schrieb Denny Vrandečić:
Create a third item in Wikidata, and use that for the language links. Any Wikipedia that has two separate articles can link to the separate items, any Wikipedia that has only one article can link to the single item.
That's a nice solution for the language link problem, but modelling the relationship of these three items on wikidata is kind of annoying/tricky. How would you do that?
-- daniel
The composite item seems to be a sort of composite geographical/human system, like an ecosystem (community of living organisms together with the nonliving components of their environment) https://www.wikidata.org/wiki/Q37813 a special kind of ecosystem maybe ...
2014-09-09 11:33 GMT+02:00 Daniel Kinzler daniel.kinzler@wikimedia.de:
Am 09.09.2014 01:40, schrieb Denny Vrandečić:
Create a third item in Wikidata, and use that for the language links. Any Wikipedia that has two separate articles can link to the separate items,
any
Wikipedia that has only one article can link to the single item.
That's a nice solution for the language link problem, but modelling the relationship of these three items on wikidata is kind of annoying/tricky. How would you do that?
-- daniel
-- Daniel Kinzler Senior Software Developer
Wikimedia Deutschland Gesellschaft zur Förderung Freien Wissens e.V.
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
On 09.09.2014 11:47, Thomas Douillard wrote:
The composite item seems to be a sort of composite geographical/human system, like an ecosystem (community of living organisms together with the nonliving components of their environment) https://www.wikidata.org/wiki/Q37813 a special kind of ecosystem maybe ...
There are certainly cases where such an interpretation is possible. But there are also cases where the composite is just a decision to describe two related topics in a single Wikipedia article, without this having any deeper meaning. Normally it is easy to tell from the Wikipedia article: for example, if it starts with "Wangeroog is a composite socio-geographic system" then this is what we should adhere to ;-). If not, then we should not interpret it in this way. The Wikipedia article should be our "grounding" that fully explains what an item is about. We should not invent new meanings in the transition from Wikipedia to Wikidata.
Cheers,
Markus
On 09.09.2014 11:33, Daniel Kinzler wrote:
Am 09.09.2014 01:40, schrieb Denny Vrandečić:
Create a third item in Wikidata, and use that for the language links. Any Wikipedia that has two separate articles can link to the separate items, any Wikipedia that has only one article can link to the single item.
That's a nice solution for the language link problem, but modelling the relationship of these three items on wikidata is kind of annoying/tricky. How would you do that?
Before the "how?" should come the "why?". The modelling should be chosen so that it best suits a given purpose (the purpose is the benchmark for deciding if a particular modelling approach is "good" or not). I guess the main thing we want to achieve here is to link the combined item to and from the single items. If this is true, then the "how?" question is basically a "which property to use?" question.
For this we should look more closely at the nature of the combined item. Let's distinguish "combined items" that are natural and meaningful concepts from those that are just different topics combined for editorial reasons in one article. The first kind of item involves things like bands (who have members, possibly with individual articles, but which are still meaningful concepts by themselves). The second kind of item involves the Wangerooge hybrid, but also many other things (e.g., plane crashes and the planes themselves; or people and events the people where involved in).
The problem with these second type of complex item is that it does not give you a good basis for adding data (you can't say properly which aspects of the thing you are talking about). It is also problematic since these things are not natural concepts that can be considered to have certain "joint properties". Rather, they are a kind of editorial trick to organise information in Wikipedia articles. For this reason, I would suggest to have a property for making links in this case that clearly refers to Wikipedia. Like "is a list of" or the item for "Wikipedia disambiguation page". I would avoid using properties that are normally used with real concepts, such as "has part" (which would make sense for "bands" -> "band member", but not for "Wangerooge hybrid" -> "Wangerooge island" (it's not a part since the former is not a proper concept to start with).
With such a editorial property, one could then also create items for parts that don't have Wikipedia article so as to be able to add data that would be confusing/wrong in the combined item.
Markus
My proposal became more clear to me over lunch:
For articles that are really about multiple different things that cannot be reconciled in a single natural concept:
* State "intance of:Wikipedia article with multiple topics" (we already have several other classes of Wikipedia articles). * Use some property, say "has topic", to link to items about the individual topics. * Optionally: use a property like "subject of" (P805) to link back from the individual items to the multi-topic pages.
The main proposal here is to treat these things like Wikipedia disambiguation pages: we have items, but the items are mainly about the page, not about any real-world concept we care about.
Example: https://en.wikipedia.org/wiki/Samoan_Clipper
It says "Samoan Clipper was one of ten Pan American Airways Sikorsky S-42 flying boats" but it includes an infobox that lists "fatalities". So the article describes both a specific airplane (the flying boat) and an event (crash of that plane). We should not try to invent a new concept of "machine-event system" to capture this, but have two items for the two things we have here.
We will have many cases where this is not necessary if we can find a natural "composite concept" that it makes sense to talk about. In these case, we will use different properties for the links (for example, a country article may sometimes be used to describe all the federal states of that country, yet we have a good way of linking individual state items to the country). As usual, there will be corner cases where it is not clear what to do; then we need specific discussions on these cases.
Cheers,
Markus
On 09.09.2014 11:57, Markus Krötzsch wrote:
On 09.09.2014 11:33, Daniel Kinzler wrote:
Am 09.09.2014 01:40, schrieb Denny Vrandečić:
Create a third item in Wikidata, and use that for the language links. Any Wikipedia that has two separate articles can link to the separate items, any Wikipedia that has only one article can link to the single item.
That's a nice solution for the language link problem, but modelling the relationship of these three items on wikidata is kind of annoying/tricky. How would you do that?
Before the "how?" should come the "why?". The modelling should be chosen so that it best suits a given purpose (the purpose is the benchmark for deciding if a particular modelling approach is "good" or not). I guess the main thing we want to achieve here is to link the combined item to and from the single items. If this is true, then the "how?" question is basically a "which property to use?" question.
For this we should look more closely at the nature of the combined item. Let's distinguish "combined items" that are natural and meaningful concepts from those that are just different topics combined for editorial reasons in one article. The first kind of item involves things like bands (who have members, possibly with individual articles, but which are still meaningful concepts by themselves). The second kind of item involves the Wangerooge hybrid, but also many other things (e.g., plane crashes and the planes themselves; or people and events the people where involved in).
The problem with these second type of complex item is that it does not give you a good basis for adding data (you can't say properly which aspects of the thing you are talking about). It is also problematic since these things are not natural concepts that can be considered to have certain "joint properties". Rather, they are a kind of editorial trick to organise information in Wikipedia articles. For this reason, I would suggest to have a property for making links in this case that clearly refers to Wikipedia. Like "is a list of" or the item for "Wikipedia disambiguation page". I would avoid using properties that are normally used with real concepts, such as "has part" (which would make sense for "bands" -> "band member", but not for "Wangerooge hybrid" -> "Wangerooge island" (it's not a part since the former is not a proper concept to start with).
With such a editorial property, one could then also create items for parts that don't have Wikipedia article so as to be able to add data that would be confusing/wrong in the combined item.
Markus
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
Am 09.09.2014 13:36, schrieb Markus Krötzsch:
My proposal became more clear to me over lunch:
For articles that are really about multiple different things that cannot be reconciled in a single natural concept:
- State "intance of:Wikipedia article with multiple topics" (we already have
several other classes of Wikipedia articles).
- Use some property, say "has topic", to link to items about the individual topics.
- Optionally: use a property like "subject of" (P805) to link back from the
individual items to the multi-topic pages.
The main proposal here is to treat these things like Wikipedia disambiguation pages: we have items, but the items are mainly about the page, not about any real-world concept we care about.
Thanks Markus, I like that a lot :)
-- daniel
Dear all,
A notable side effect :
At the moment, a page can contain multiple topic in few languages only At the moment, those page are "adopted" by one of the topic
It means that from now on, some pages that contains multiple topics in some languages will be attached to new Q** and it will appear in the language bar as if there is no equivalent in those languages
We should probably think about the long story too
Mohamed
On Tue, Sep 9, 2014 at 1:50 PM, Daniel Kinzler daniel.kinzler@wikimedia.de wrote:
Am 09.09.2014 13:36, schrieb Markus Krötzsch:
My proposal became more clear to me over lunch:
For articles that are really about multiple different things that cannot
be
reconciled in a single natural concept:
- State "intance of:Wikipedia article with multiple topics" (we already
have
several other classes of Wikipedia articles).
- Use some property, say "has topic", to link to items about the
individual topics.
- Optionally: use a property like "subject of" (P805) to link back from
the
individual items to the multi-topic pages.
The main proposal here is to treat these things like Wikipedia
disambiguation
pages: we have items, but the items are mainly about the page, not about
any
real-world concept we care about.
Thanks Markus, I like that a lot :)
-- daniel
-- Daniel Kinzler Senior Software Developer
Wikimedia Deutschland Gesellschaft zur Förderung Freien Wissens e.V.
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
The language links should then go to either the composite item, or the specitic item, if only one exits for a language.
What, if there are both? If the page itself is specific, choose specific one, else use the composite one.
That may intrduce some foggyness because links are unequal and you don#t know beforehand to which class the taget belngs, but in the end we should be as precise as we can get and having as many generally usaeful links as possible would benefot users most, imho.
Purodha
"Innovimax SARL" innovimax@gmail.com writes:
Dear all,
A notable side effect : At the moment, a page can contain multiple topic in few languages onlyAt the moment, those page are "adopted" by one of the topic
It means that from now on, some pages that contains multiple topics in some languages will be attached to new Q** and it will appear in the language bar as if there is no equivalent in those languages We should probably think about the long story too Mohamed On Tue, Sep 9, 2014 at 1:50 PM, Daniel Kinzler daniel.kinzler@wikimedia.de wrote:Am 09.09.2014 13:36, schrieb Markus Krötzsch:
My proposal became more clear to me over lunch:
For articles that are really about multiple different things that cannot be reconciled in a single natural concept:
- State "intance of:Wikipedia article with multiple topics" (we already have
several other classes of Wikipedia articles).
- Use some property, say "has topic", to link to items about the individual topics.
- Optionally: use a property like "subject of" (P805) to link back from the
individual items to the multi-topic pages.
The main proposal here is to treat these things like Wikipedia disambiguation pages: we have items, but the items are mainly about the page, not about any real-world concept we care about.
Thanks Markus, I like that a lot :)
-- daniel
-- Daniel Kinzler Senior Software Developer
Wikimedia Deutschland Gesellschaft zur Förderung Freien Wissens e.V.
_______________________________________________ Wikidata-l mailing list Wikidata-l@lists.wikimedia.org[Wikidata-l@lists.wikimedia.org] https://lists.wikimedia.org/mailman/listinfo/wikidata-l
-- Innovimax SARL Consulting, Training & XML Development 9, impasse des Orteaux 75020 Paris Tel : +33 9 52 475787 Fax : +33 1 4356 1746 http://www.innovimax.fr%5Bhttp://www.innovimax.fr] RCS Paris 488.018.631 SARL au capital de 10.000 €_______________________________________________ Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l%5Bhttps://lists.wiki...]
Me again. After some coffee and digesting the edit that Eric made to address the Samoan Clipper issue, I can see several (better?) alternatives to my first proposal. This also takes into account some comments of James, Mohamed, and Purodha.
I can see three patterns to solve such issues:
== (1) Main concept + sub-concept ==
Situation: There is a "main" concept, but it is closely connected to another concept (such as a particularly notable event). Most Wikipedias have both described in one article.
Example: The plane "Samoan Clipper" and the crash of that plane.
Solution: Keep all Wikipedias connected to the "main" concept (the plane) and create a new items for the subconcept (the crash) that is not linked to any Wikipedia. It can be somewhat arbitrary what we pick as the main concept in cases like this. This is what Emw did on https://www.wikidata.org/wiki/Q7409943.
== (2) Two first-class concepts ==
Situtation: There are several distinct concepts, even if some Wikipedias only have one article that includes a section on the other.
Example: "The Beatles" and "Paul McCartney".
Solution: Keep distinct items with distinct language links. Try to decide for each Wikipedia with only one article whether it is more about the one or about the other concept.
== (3) Several first-class concepts + multi-topic page(s) ==
Situation: Several Wikipedias have distinct articles for two distinct concepts, but in some Wikipedias there is only one article that combines the two.
Example: Wangerooge (island?, municipality?, both?)
Solution 1: Select a "main concept" for each article involved and link it only to this -> pattern (2).
Solution 2: Create a third item in the way that I suggested earlier (a page for the combined item that is linked to but distinct from the individual items).
== What's the best pattern now? ==
Of the above patterns, (1) has the most interlanguage connections. (2) has a medium amount of connections, and (3) has the least amount of connections. There is a kind of natural transition from (1) (split only inside Wikidata) to (2) (split visible in Wikipedias) to (3) (split and combinations visible in Wikipedias). It's the natural transition from coarse-grained/unified to fine-grained/diverse presentation in Wikipedia(s).
Considering all this, I would use my earlier suggestion ((3) with solution 2) only in exceptional cases. Indeed, Edward asked his original question since he saw a problem when trying to add *more* language links in a situation that applied pattern (2) so far. My suggestion would not fix this issue at all but rather reduce the linking further. I think (3) really only makes sense in cases where any assignment of a "main concept" would seem unacceptable for some article.
The main thing we have to take care about when splitting into several items is to ensure that we don't import statements from Wikipedia into the wrong item. Despite the obvious concern that an item should not be a plane and a crash at the same time, it would also be really bad to have two items about the same crash (one specific on the crash only and one combined with crash and plane) -- even counting the plane crashes on Wikidata would lead to misleading results then.
== What to do with Wangerooge? ==
It seems to me that (2) is already optimal in this situation from a Wikidata viewpoint. To get additional language links displayed in English, you could manually add them to the article (yes, I agree that this has other issues). Also, I don't know if you can have language links to multiple pages on the same other Wikipedia using manual links (which seems what Edward was trying to do by connecting the same enwiki article to multiple items).
Cheers,
Markus
On 09.09.2014 13:50, Daniel Kinzler wrote:
Am 09.09.2014 13:36, schrieb Markus Krötzsch:
My proposal became more clear to me over lunch:
For articles that are really about multiple different things that cannot be reconciled in a single natural concept:
- State "intance of:Wikipedia article with multiple topics" (we already have
several other classes of Wikipedia articles).
- Use some property, say "has topic", to link to items about the individual topics.
- Optionally: use a property like "subject of" (P805) to link back from the
individual items to the multi-topic pages.
The main proposal here is to treat these things like Wikipedia disambiguation pages: we have items, but the items are mainly about the page, not about any real-world concept we care about.
Thanks Markus, I like that a lot :)
-- daniel
While on this concept of modelling Wikidata items for multi-concept Wikipedia pages, I would like to remind you of another case, in which cross-project spam is deleted in only a subset of language Wikipedias, leaving a few links or just the Wikidata item. I would like there to be some trace of the previously existing Wikipedia links, that each link to the deletion discussion in the associated Wikipedia. Is this possible?
On Tue, Sep 9, 2014 at 5:13 PM, Markus Krötzsch < markus@semantic-mediawiki.org> wrote:
Me again. After some coffee and digesting the edit that Eric made to address the Samoan Clipper issue, I can see several (better?) alternatives to my first proposal. This also takes into account some comments of James, Mohamed, and Purodha.
I can see three patterns to solve such issues:
== (1) Main concept + sub-concept ==
Situation: There is a "main" concept, but it is closely connected to another concept (such as a particularly notable event). Most Wikipedias have both described in one article.
Example: The plane "Samoan Clipper" and the crash of that plane.
Solution: Keep all Wikipedias connected to the "main" concept (the plane) and create a new items for the subconcept (the crash) that is not linked to any Wikipedia. It can be somewhat arbitrary what we pick as the main concept in cases like this. This is what Emw did on https://www.wikidata.org/wiki/Q7409943.
== (2) Two first-class concepts ==
Situtation: There are several distinct concepts, even if some Wikipedias only have one article that includes a section on the other.
Example: "The Beatles" and "Paul McCartney".
Solution: Keep distinct items with distinct language links. Try to decide for each Wikipedia with only one article whether it is more about the one or about the other concept.
== (3) Several first-class concepts + multi-topic page(s) ==
Situation: Several Wikipedias have distinct articles for two distinct concepts, but in some Wikipedias there is only one article that combines the two.
Example: Wangerooge (island?, municipality?, both?)
Solution 1: Select a "main concept" for each article involved and link it only to this -> pattern (2).
Solution 2: Create a third item in the way that I suggested earlier (a page for the combined item that is linked to but distinct from the individual items).
== What's the best pattern now? ==
Of the above patterns, (1) has the most interlanguage connections. (2) has a medium amount of connections, and (3) has the least amount of connections. There is a kind of natural transition from (1) (split only inside Wikidata) to (2) (split visible in Wikipedias) to (3) (split and combinations visible in Wikipedias). It's the natural transition from coarse-grained/unified to fine-grained/diverse presentation in Wikipedia(s).
Considering all this, I would use my earlier suggestion ((3) with solution 2) only in exceptional cases. Indeed, Edward asked his original question since he saw a problem when trying to add *more* language links in a situation that applied pattern (2) so far. My suggestion would not fix this issue at all but rather reduce the linking further. I think (3) really only makes sense in cases where any assignment of a "main concept" would seem unacceptable for some article.
The main thing we have to take care about when splitting into several items is to ensure that we don't import statements from Wikipedia into the wrong item. Despite the obvious concern that an item should not be a plane and a crash at the same time, it would also be really bad to have two items about the same crash (one specific on the crash only and one combined with crash and plane) -- even counting the plane crashes on Wikidata would lead to misleading results then.
== What to do with Wangerooge? ==
It seems to me that (2) is already optimal in this situation from a Wikidata viewpoint. To get additional language links displayed in English, you could manually add them to the article (yes, I agree that this has other issues). Also, I don't know if you can have language links to multiple pages on the same other Wikipedia using manual links (which seems what Edward was trying to do by connecting the same enwiki article to multiple items).
Cheers,
Markus
On 09.09.2014 13:50, Daniel Kinzler wrote:
Am 09.09.2014 13:36, schrieb Markus Krötzsch:
My proposal became more clear to me over lunch:
For articles that are really about multiple different things that cannot be reconciled in a single natural concept:
- State "intance of:Wikipedia article with multiple topics" (we already
have several other classes of Wikipedia articles).
- Use some property, say "has topic", to link to items about the
individual topics.
- Optionally: use a property like "subject of" (P805) to link back from
the individual items to the multi-topic pages.
The main proposal here is to treat these things like Wikipedia disambiguation pages: we have items, but the items are mainly about the page, not about any real-world concept we care about.
Thanks Markus, I like that a lot :)
-- daniel
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
2014-09-09 13:36 GMT+02:00 Markus Krötzsch markus@semantic-mediawiki.org:
My proposal became more clear to me over lunch:
*** cut ***
Markus, "yuo are of genius!" (cit.) :)
/me deletes straight away his own proposal
L.
For articles that are really about multiple different things that cannot be reconciled in a single natural concept:
- State "intance of:Wikipedia article with multiple topics" (we already
have several other classes of Wikipedia articles).
- Use some property, say "has topic", to link to items about the
individual topics.
- Optionally: use a property like "subject of" (P805) to link back from
the individual items to the multi-topic pages.
Can we make do without annotation statements like "instance of: Wikipedia page with multiple topics"? In my opinion, such statements would unnecessarily clutter a significant portion of our items and would be better inferred by the presence of *subject of* (P805) claims. I think it's better to reserve *instance of* for talk about the essence of the subject itself.
The closest inverse property for* subject of* is probably *facet of* (P1269).
Example: https://en.wikipedia.org/wiki/Samoan_Clipper
See https://www.wikidata.org/wiki/Q7409943 for an initial pass at modelling that.
Note how that Wikipedia page says "The aircraft developed an engine problem (caused by an oil leak)", which ultimately caused the in-flight explosion. We currently have no generic way to model causes. Coincidentally enough, I just posted a detailed/long-winded proposal to address that. Please see https://www.wikidata.org/wiki/Property_talk:P828#A_better_way_to_model_causa... and give any feedback there!
Cheers, Eric
On Tue, Sep 9, 2014 at 7:36 AM, Markus Krötzsch < markus@semantic-mediawiki.org> wrote:
My proposal became more clear to me over lunch:
For articles that are really about multiple different things that cannot be reconciled in a single natural concept:
- State "intance of:Wikipedia article with multiple topics" (we already
have several other classes of Wikipedia articles).
- Use some property, say "has topic", to link to items about the
individual topics.
- Optionally: use a property like "subject of" (P805) to link back from
the individual items to the multi-topic pages.
The main proposal here is to treat these things like Wikipedia disambiguation pages: we have items, but the items are mainly about the page, not about any real-world concept we care about.
Example: https://en.wikipedia.org/wiki/Samoan_Clipper
It says "Samoan Clipper was one of ten Pan American Airways Sikorsky S-42 flying boats" but it includes an infobox that lists "fatalities". So the article describes both a specific airplane (the flying boat) and an event (crash of that plane). We should not try to invent a new concept of "machine-event system" to capture this, but have two items for the two things we have here.
We will have many cases where this is not necessary if we can find a natural "composite concept" that it makes sense to talk about. In these case, we will use different properties for the links (for example, a country article may sometimes be used to describe all the federal states of that country, yet we have a good way of linking individual state items to the country). As usual, there will be corner cases where it is not clear what to do; then we need specific discussions on these cases.
Cheers,
Markus
On 09.09.2014 11:57, Markus Krötzsch wrote:
On 09.09.2014 11:33, Daniel Kinzler wrote:
Am 09.09.2014 01:40, schrieb Denny Vrandečić:
Create a third item in Wikidata, and use that for the language links. Any Wikipedia that has two separate articles can link to the separate items, any Wikipedia that has only one article can link to the single item.
That's a nice solution for the language link problem, but modelling the relationship of these three items on wikidata is kind of annoying/tricky. How would you do that?
Before the "how?" should come the "why?". The modelling should be chosen so that it best suits a given purpose (the purpose is the benchmark for deciding if a particular modelling approach is "good" or not). I guess the main thing we want to achieve here is to link the combined item to and from the single items. If this is true, then the "how?" question is basically a "which property to use?" question.
For this we should look more closely at the nature of the combined item. Let's distinguish "combined items" that are natural and meaningful concepts from those that are just different topics combined for editorial reasons in one article. The first kind of item involves things like bands (who have members, possibly with individual articles, but which are still meaningful concepts by themselves). The second kind of item involves the Wangerooge hybrid, but also many other things (e.g., plane crashes and the planes themselves; or people and events the people where involved in).
The problem with these second type of complex item is that it does not give you a good basis for adding data (you can't say properly which aspects of the thing you are talking about). It is also problematic since these things are not natural concepts that can be considered to have certain "joint properties". Rather, they are a kind of editorial trick to organise information in Wikipedia articles. For this reason, I would suggest to have a property for making links in this case that clearly refers to Wikipedia. Like "is a list of" or the item for "Wikipedia disambiguation page". I would avoid using properties that are normally used with real concepts, such as "has part" (which would make sense for "bands" -> "band member", but not for "Wangerooge hybrid" -> "Wangerooge island" (it's not a part since the former is not a proper concept to start with).
With such a editorial property, one could then also create items for parts that don't have Wikipedia article so as to be able to add data that would be confusing/wrong in the combined item.
Markus
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
On 09.09.2014 14:23, Emw wrote: ...
Example: https://en.wikipedia.org/wiki/Samoan_Clipper
See https://www.wikidata.org/wiki/Q7409943 for an initial pass at modelling that.
Thanks, that's a huge improvement over the previous state (=me looking at the article and giving up on adding anything to Wikidata ;-).
Note how that Wikipedia page says "The aircraft developed an engine problem (caused by an oil leak)", which ultimately caused the in-flight explosion. We currently have no generic way to model causes. Coincidentally enough, I just posted a detailed/long-winded proposal to address that. Please see https://www.wikidata.org/wiki/Property_talk:P828#A_better_way_to_model_causa... and give any feedback there!
Another challenging topic. Can't offer any meaningful comment now.
Markus
What are the implications of this for sidebar links ?
IMO it's a good thing if the wikidata items become more fine-grained and more conceptually precise.
But wouldn't this mean we would be losing (some) sidebar links, so people wouldn't necessarily know any more that some of the information they're reading might be available in a language they might prefer.
A classic case is where some wikis have separate "concept" and "list" articles for a thing, whereas others have "list with concept" articles. This often gives rise to disputes as to what should be interwiki'd to what.
(OT: should a "list with concept"-style item ever be acceptable as a target of P301 'category's main topic' ?)
So: should the sidebar of the wiki page in a language that has a separate article for the concept also include a new section "related article" that could eg present "list with concept" articles from languages with no direct concept <-> concept equivalents.
Similarly one wiki might include an article on an individual artist; but in other languages the best information on that artist might be presented in an article on the family of the artist; or a small tight-knit group of related artists.
Would a mechanism that could offer such articles from languages where there was no direct equivalence be worth considering, if we start plumbing in properties like "has topic" ?
-- James.
On 09/09/2014 12:36, Markus Krötzsch wrote:
My proposal became more clear to me over lunch:
For articles that are really about multiple different things that cannot be reconciled in a single natural concept:
- State "intance of:Wikipedia article with multiple topics" (we already
have several other classes of Wikipedia articles).
- Use some property, say "has topic", to link to items about the
individual topics.
- Optionally: use a property like "subject of" (P805) to link back from
the individual items to the multi-topic pages.
The main proposal here is to treat these things like Wikipedia disambiguation pages: we have items, but the items are mainly about the page, not about any real-world concept we care about.
Example: https://en.wikipedia.org/wiki/Samoan_Clipper
It says "Samoan Clipper was one of ten Pan American Airways Sikorsky S-42 flying boats" but it includes an infobox that lists "fatalities". So the article describes both a specific airplane (the flying boat) and an event (crash of that plane). We should not try to invent a new concept of "machine-event system" to capture this, but have two items for the two things we have here.
We will have many cases where this is not necessary if we can find a natural "composite concept" that it makes sense to talk about. In these case, we will use different properties for the links (for example, a country article may sometimes be used to describe all the federal states of that country, yet we have a good way of linking individual state items to the country). As usual, there will be corner cases where it is not clear what to do; then we need specific discussions on these cases.
Cheers,
Markus
On 09.09.2014 11:57, Markus Krötzsch wrote:
On 09.09.2014 11:33, Daniel Kinzler wrote:
Am 09.09.2014 01:40, schrieb Denny Vrandečić:
Create a third item in Wikidata, and use that for the language links. Any Wikipedia that has two separate articles can link to the separate items, any Wikipedia that has only one article can link to the single item.
That's a nice solution for the language link problem, but modelling the relationship of these three items on wikidata is kind of annoying/tricky. How would you do that?
Before the "how?" should come the "why?". The modelling should be chosen so that it best suits a given purpose (the purpose is the benchmark for deciding if a particular modelling approach is "good" or not). I guess the main thing we want to achieve here is to link the combined item to and from the single items. If this is true, then the "how?" question is basically a "which property to use?" question.
For this we should look more closely at the nature of the combined item. Let's distinguish "combined items" that are natural and meaningful concepts from those that are just different topics combined for editorial reasons in one article. The first kind of item involves things like bands (who have members, possibly with individual articles, but which are still meaningful concepts by themselves). The second kind of item involves the Wangerooge hybrid, but also many other things (e.g., plane crashes and the planes themselves; or people and events the people where involved in).
The problem with these second type of complex item is that it does not give you a good basis for adding data (you can't say properly which aspects of the thing you are talking about). It is also problematic since these things are not natural concepts that can be considered to have certain "joint properties". Rather, they are a kind of editorial trick to organise information in Wikipedia articles. For this reason, I would suggest to have a property for making links in this case that clearly refers to Wikipedia. Like "is a list of" or the item for "Wikipedia disambiguation page". I would avoid using properties that are normally used with real concepts, such as "has part" (which would make sense for "bands" -> "band member", but not for "Wangerooge hybrid" -> "Wangerooge island" (it's not a part since the former is not a proper concept to start with).
With such a editorial property, one could then also create items for parts that don't have Wikipedia article so as to be able to add data that would be confusing/wrong in the combined item.
Markus
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l