Property talk:P8590
Documentation
article identifier in online version of 1906 Jewish Encyclopedia
List of violations of this constraint: Database reports/Constraint violations/P8590#Format, SPARQL
List of violations of this constraint: Database reports/Constraint violations/P8590#Single value, SPARQL
List of violations of this constraint: Database reports/Constraint violations/P8590#Entity types
List of violations of this constraint: Database reports/Constraint violations/P8590#Scope, SPARQL
This property is being used by: Please notify projects that use this property before big changes (renaming, deletion, merge with another property, etc.) |
Importing and matching[edit]
@LimaMario: Before Wikidata existed, there was some extensive work trying to match en-wiki articles to the JE, at pages like en:Wikipedia:Jewish_Encyclopedia_topics/A1.
Would it make sense to try to harvest these linkages somehow? As a start, Petscan could be used to gather the wikilinks from each page and find their Wikidata Q-numbers, eg [1]. These could then be matched back to each line-entry on the Wikipage. For efficency, I think one might be able to make a PagePile list of the JE tracking pages, then gather the links from all of them at once.
Some issues:
- The links to the JE are in the JE's original URL format, eg http://www.jewishencyclopedia.com/view.jsp?letter=A&artid=4 . I think that it's reasonable possible to write a script to see where the JE site now redirects these to.
- Some of the links point to redirects. I think these could be addressed by getting a list of pages that are redirects from PetScan, then looking up onward links from that list of pages.
- Some of the links will point to dab pages. This is easy enough to pick up at the Wikidata stage. Maybe then just removes the JE id from such Wikidata items, and leaves them to Mix'n'match.
- Some of the links may point to pages with a rather more general scope, eg "History of the Jews in X" -> X. I think these are probably reasonably harmless, but it's maybe a good idea to try to include a subject named as (P1810) qualifier to indicate just what title the JE article has.
Do you have any plans for something like this? Jheald (talk) 13:52, 18 September 2020 (UTC)
- @Jheald: I've done some biographies because they were easiest to match with place of birth (P19) and date of birth (P569). I also plan to do capital cities because articles follows the pattern which is quite easy to process via regex. For example:
- Budapest – The capital of Hungary.
- Copenhagen – The capital of Denmark.
- Munich – Capital of Bavaria, Germany.
- Northampton – Capital of Northamptonshire, England.
- Other topics require lot of manual work.
- @Jheald: There is a little mess in this data, e.g.
- FEZ is linked with fez (Q189862) instead of Fez (Q80985)
- LAIBACH is linked with Laibach (Q325359) instead of Ljubljana (Q437)
- QUEBEC is linked with Quebec (Q176) instead of Quebec City (Q2145)
- It takes a lot of effort to clean it up.
- United States of America-related properties
- All Properties
- Properties with external-id-datatype
- Properties used on 1000+ items
- Properties with format constraints
- Properties with single value constraints
- Properties with unique value constraints
- Properties with entity type constraints
- Properties with scope constraints