Wikidata:WikiProject Duplicates/VIAF members
This page contains a list of Listeria-lists of violations of distinct-values constraint (Q21502410) for some Wikidata property for authority control by VIAF member (Q55586529) and gives suggestions for the solution of these violations; it also given suggestions for future imports of Wikidata property for authority control by VIAF member (Q55586529) from Virtual International Authority File (Q54919).
Solutions for violations[edit]
Each case of violation of distinct-values constraint (Q21502410) has three possible ways of solution:
Situation | Action | Scheme | Guideline |
---|---|---|---|
duplicate items: the items clearly represent the same entity |
|
Help:Merge | |
conflated item(s): one of the items (or both) contains statements and/or IDs about more than one entity |
|
Help:Conflation | |
conflated ID(s): the items are OK, but the ID itself contains statements and/or IDs about more than one entity |
|
Wikidata:Data round-tripping |
See commons:Category:Conflations and duplications in Wikidata schemes for all the schemes (including the ones of violations of single-value constraint (Q19474404)).
Best practices for imports[edit]
This paragraphs collects from past discussions[1] some suggestions for the improvement of the quality of future imports of Wikidata property for authority control by VIAF member (Q55586529) from Virtual International Authority File (Q54919):
Action | Reason | Field |
---|---|---|
leave out VIAF clusters containing a Wikidata ID but not-linked from Wikidata through VIAF ID (P214) | if the link is absent in Wikidata, the probability of a clusterization error is higher | VIAFs to be considered |
leave out VIAF clusters which are a deprecated VIAF ID (P214) value in Wikidata | the clusterization error is nearly certain | VIAFs to be considered |
leave out non-personal VIAF clusters (i.e. corporate, geographic, works) | the clusterization errors are significantly more frequent than in personal clusters | VIAFs to be considered |
avoid adding IDs which are already present in the item (as deprecated) or in other items; instead, list all these cases in a table for manual checks | the probability of a clusterization error is significant, but these cases can also help solving previous errors made in Wikidata | IDs to be added |
compose references to the added IDs in the following way: stated in (P248)Wikidata property for authority control by VIAF member (Q55586529) + VIAF ID (P214)source cluster + retrieved (P813)date of retrieval of VIAF | this complete reference makes future checks on the evolution of VIAF clusterization much easier | IDs to be added |
use some sort of comparison between the data of the added ID (mainly name and dates) and the data in the corresponding Wikidata item (mainly label in the same language, date of birth (P569), date of death (P570)) | this check, although difficult, would significantly reduce the number of wrong IDs added due to clusterization errors | IDs to be added |
avoid using VIAF clusters as references for dates (date of birth (P569), date of death (P570)), using VIAF member IDs instead | VIAF clusters are unstable (clusterization may change), so the verifiability is difficult; moreover, in general using the direct source (the VIAF member) is always better than using a derived source (the VIAF cluster) | statements to be added |
Lists of violations[edit]
Listeria-lists of unique-value constraint violations to be emptied:
See also User:Difool/viaf already somewhere.