Hi all,

I just noticed that we have a number of "orphaned items" which were created and imported from some Wikipedia article that then got deleted. The result is an item with almost no data, no sitelinks, and all references claiming "imported from X Wikipedia".

Example:
https://www.wikidata.org/wiki/Q9386774

Here is what happened:
https://www.wikidata.org/w/index.php?title=Q9386774&action=history

It would be good to have a process for dealing with such cases. I am not saying that we must delete such items immediately, but it seems obvious that they need some special attention to become self-sustaining even without Wikipedia articles associated.

Things that would be important to keep such items:
* Links to other external datasets that confirm the existence of the thing.
* Links to authoritative web sites that confirm the existence of the thing.
* Proper references for all data (we always want that, but here it's even more critical: "imported from Wikipedia" is never great, but at least it leaves some hope of finding proper references if the Wikipedia page still exists).

In cases like the above, deletion seems to be the most reasonable solution (the little data that is there can easily be added again if needed in the future). It seems that one could automatically collect such candidates for deletion (pages that are not used as property values, have no site links, have no identifier properties, were not edited since more than a month, an have less than, say, ten properties+labels+descriptions).

Regards,

Markus

_______________________________________________
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata

Reply via email to