[Cross-posting from the Wikidata chat]
Hi everyone,
Following some feedback by Azertus (thanks!), I collected statistics on the most frequent Web domains that occur in Discogs [1] and MusicBrainz [2]. It looks like some of them may be candidates for identifier property creation, while others stem from a failed match against known properties, mainly due to inconsistencies in URL match pattern (P8966), format as a regular expression (P1793), and formatter URL (P1630) values.
You can have a look at them here [3].
It would be great to gather thoughts on the next steps. Two main questions: 1. should we go for a property proposal for each of the candidates? 2. what's the best way to fix URL match pattern (P8966), format as a regular expression (P1793), and formatter URL (P1630) values, so that the next time we can convert URLs to proper identifiers?
Cheers,
Marco
[1] https://www.discogs.com/ [2] https://musicbrainz.org/ [3] https://meta.wikimedia.org/wiki/Grants:Project/Hjfocs/soweego_2/Timeline#Jul...
We already have properties for most of these links? I'm not sure what you're asking as I have little knowledge of the context of the situation...
lectrician1,
On Thu, Jul 29, 2021 at 8:56 AM Marco Fossati fossati@spaziodati.eu wrote:
[Cross-posting from the Wikidata chat]
Hi everyone,
Following some feedback by Azertus (thanks!), I collected statistics on the most frequent Web domains that occur in Discogs [1] and MusicBrainz [2]. It looks like some of them may be candidates for identifier property creation, while others stem from a failed match against known properties, mainly due to inconsistencies in URL match pattern (P8966), format as a regular expression (P1793), and formatter URL (P1630) values.
You can have a look at them here [3].
It would be great to gather thoughts on the next steps. Two main questions:
- should we go for a property proposal for each of the candidates?
- what's the best way to fix URL match pattern (P8966), format as a
regular expression (P1793), and formatter URL (P1630) values, so that the next time we can convert URLs to proper identifiers?
Cheers,
Marco
[1] https://www.discogs.com/ [2] https://musicbrainz.org/ [3]
https://meta.wikimedia.org/wiki/Grants:Project/Hjfocs/soweego_2/Timeline#Jul... _______________________________________________ Wikidata mailing list -- wikidata@lists.wikimedia.org To unsubscribe send an email to wikidata-leave@lists.wikimedia.org
Dear Seth,
The short answer is yes. For more details, you can have a look at the discussion in the Wikidata chat: https://www.wikidata.org/wiki/Wikidata:Project_chat#URLs_statistics_for_Disc...)
Cheers,
Marco
On 8/1/21 3:24 AM, Seth Deegan wrote:
We already have properties for most of these links? I'm not sure what you're asking as I have little knowledge of the context of the situation...
lectrician1,
On Thu, Jul 29, 2021 at 8:56 AM Marco Fossati <fossati@spaziodati.eu mailto:fossati@spaziodati.eu> wrote:
[Cross-posting from the Wikidata chat] Hi everyone, Following some feedback by Azertus (thanks!), I collected statistics on the most frequent Web domains that occur in Discogs [1] and MusicBrainz [2]. It looks like some of them may be candidates for identifier property creation, while others stem from a failed match against known properties, mainly due to inconsistencies in URL match pattern (P8966), format as a regular expression (P1793), and formatter URL (P1630) values. You can have a look at them here [3]. It would be great to gather thoughts on the next steps. Two main questions: 1. should we go for a property proposal for each of the candidates? 2. what's the best way to fix URL match pattern (P8966), format as a regular expression (P1793), and formatter URL (P1630) values, so that the next time we can convert URLs to proper identifiers? Cheers, Marco [1] https://www.discogs.com/ <https://www.discogs.com/> [2] https://musicbrainz.org/ <https://musicbrainz.org/> [3] https://meta.wikimedia.org/wiki/Grants:Project/Hjfocs/soweego_2/Timeline#July_2021 <https://meta.wikimedia.org/wiki/Grants:Project/Hjfocs/soweego_2/Timeline#July_2021> _______________________________________________ Wikidata mailing list -- wikidata@lists.wikimedia.org <mailto:wikidata@lists.wikimedia.org> To unsubscribe send an email to wikidata-leave@lists.wikimedia.org <mailto:wikidata-leave@lists.wikimedia.org>
Wikidata mailing list -- wikidata@lists.wikimedia.org To unsubscribe send an email to wikidata-leave@lists.wikimedia.org