[Cross-posting from the Wikidata chat]

Hi everyone,

Following some feedback by Azertus (thanks!), I collected statistics on the most frequent Web domains that occur in Discogs [1] and MusicBrainz [2]. It looks like some of them may be candidates for identifier property creation, while others stem from a failed match against known properties, mainly due to inconsistencies in URL match pattern (P8966), format as a regular expression (P1793), and formatter URL (P1630) values.

You can have a look at them here [3].

It would be great to gather thoughts on the next steps.
Two main questions:
1. should we go for a property proposal for each of the candidates?
2. what's the best way to fix URL match pattern (P8966), format as a regular expression (P1793), and formatter URL (P1630) values, so that the next time we can convert URLs to proper identifiers?

Cheers,

Marco

[1] https://www.discogs.com/
[2] https://musicbrainz.org/
[3] https://meta.wikimedia.org/wiki/Grants:Project/Hjfocs/soweego_2/Timeline#July_2021
_______________________________________________
Wikidata mailing list -- wikidata@lists.wikimedia.org
To unsubscribe send an email to wikidata-le...@lists.wikimedia.org

Reply via email to