[Cross-posting from the Wikidata chat]
Hi everyone,
Following some feedback by Azertus (thanks!), I collected statistics on the most frequent Web domains that occur in Discogs [1] and MusicBrainz [2]. It looks like some of them may be candidates for identifier property creation, while others stem from a failed match against known properties, mainly due to inconsistencies in URL match pattern (P8966), format as a regular expression (P1793), and formatter URL (P1630) values.
You can have a look at them here [3].
It would be great to gather thoughts on the next steps. Two main questions: 1. should we go for a property proposal for each of the candidates? 2. what's the best way to fix URL match pattern (P8966), format as a regular expression (P1793), and formatter URL (P1630) values, so that the next time we can convert URLs to proper identifiers?
Cheers,
Marco
[1] https://www.discogs.com/ [2] https://musicbrainz.org/ [3] https://meta.wikimedia.org/wiki/Grants:Project/Hjfocs/soweego_2/Timeline#Jul...