Wikidata:Requests for permissions/Bot/SIOAbot
The following discussion is closed. Please do not modify it. Subsequent comments should be made in a new section. A summary of the conclusions reached follows.
- Approved--Ymblanter (talk) 18:26, 28 October 2021 (UTC)[reply]
SIOAbot[edit]
SIOAbot (talk • contribs • new items • new lexemes • SUL • Block log • User rights log • User rights • xtools)
Operator: Fuzheado (talk • contribs • logs)
Task/s: This bot is designed to add authority control/external identifiers to Wikidata items corresponding to Smithsonian Institution (US) objects namely:
Code: Bot has been implemented on PAWS/Jupyter - Public notebook code can be found here.
Function details:
Since the launch of SI's Open Access program in 2020, a new identifier scheme has been launched, and the two properties:
- Smithsonian resource ID (P7851) - ex. saam_1911.4.1
- Smithsonian ARK ID (P9473) - ex. k7d0817311-3a88-4af7-8bc5-745395507a83
The bot works by querying existing Smithsonian items in Wikidata that have identifiers specific to those holding units (eg. Smithsonian American Art Museum, Cooper-Hewitt Design Museum, et al) and crafts a new identifier for Smithsonian resource ID (P7851) by using the holding unit abbreviation and the accession number of the object, and adds that statement. It then looks up the ARK identifier from the Smithsonian database, and then adds Smithsonian ARK ID (P9473) to Wikidata. These should be rather uncontroversial as they are identifiers and do not affect any modeling or other claims in the item.
Safeguards/quality checks in place:
- The bot will check the crafted Smithsonian resource ID (P7851) identifier for validity (Example: saam_1911.4.1) against the Smithsonian API.
- The bot will verify the returned Smithsonian ARK ID (P9473) from the Smithsonian API, or visit the Smithsonian object page for finding the ARK ID.
- If either Smithsonian resource ID (P7851) or Smithsonian ARK ID (P9473) are set already in the Wikidata item, nothing will be added. This is to honor the best practice of deferring to hand-editing over bot editing if possible.
More info about the identifiers for all Smithsonian units can be found at Wikidata:GLAM/Smithsonian_Institution/Structure
Examples and tests:
- An example set of edits to a single item can be seen at this diff
- A set of 50 items (100 edits) can be seen at this list
--Fuzheado (talk) 17:12, 3 August 2021 (UTC)[reply]
- FYI the code link is slightly broken. It's [1]. I skimmed the code and it seems reasonable but I don't think I have enough contextual knowledge of the properties/APIs to comment on this task. BrokenSegue (talk) 18:18, 9 August 2021 (UTC)[reply]
- Thanks for that! Fixed. - Fuzheado (talk) 19:03, 18 August 2021 (UTC)[reply]