Property talk:P8590

From Wikidata
Jump to navigation Jump to search

Documentation

Jewish Encyclopedia ID
article identifier in online version of 1906 Jewish Encyclopedia
[create Create a translatable help page (preferably in English) for this property to be included here]
Format “[1-9]\d*: value must be formatted using this pattern (PCRE syntax). (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P8590#Format, SPARQL
Single value: this property generally contains a single value. (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P8590#Single value, SPARQL
Distinct values: this property likely contains a value that is different from all other items. (Help)
List of violations of this constraint: Database reports/Constraint violations/P8590#Unique value, hourly updated report, SPARQL (every item), SPARQL (by value)
Allowed entity types are Wikibase item (Q29934200): the property may only be used on a certain entity type (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P8590#Entity types
Scope is as main value (Q54828448), as reference (Q54828450): the property must be used by specified way only (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P8590#Scope, SPARQL

Please notify projects that use this property before big changes (renaming, deletion, merge with another property, etc.)

Importing and matching[edit]

@LimaMario: Before Wikidata existed, there was some extensive work trying to match en-wiki articles to the JE, at pages like en:Wikipedia:Jewish_Encyclopedia_topics/A1.

Would it make sense to try to harvest these linkages somehow? As a start, Petscan could be used to gather the wikilinks from each page and find their Wikidata Q-numbers, eg [1]. These could then be matched back to each line-entry on the Wikipage. For efficency, I think one might be able to make a PagePile list of the JE tracking pages, then gather the links from all of them at once.

Some issues:

  • The links to the JE are in the JE's original URL format, eg http://www.jewishencyclopedia.com/view.jsp?letter=A&artid=4 . I think that it's reasonable possible to write a script to see where the JE site now redirects these to.
  • Some of the links point to redirects. I think these could be addressed by getting a list of pages that are redirects from PetScan, then looking up onward links from that list of pages.
  • Some of the links will point to dab pages. This is easy enough to pick up at the Wikidata stage. Maybe then just removes the JE id from such Wikidata items, and leaves them to Mix'n'match.
  • Some of the links may point to pages with a rather more general scope, eg "History of the Jews in X" -> X. I think these are probably reasonably harmless, but it's maybe a good idea to try to include a subject named as (P1810) qualifier to indicate just what title the JE article has.

Do you have any plans for something like this? Jheald (talk) 13:52, 18 September 2020 (UTC)[reply]

@Jheald: I've done some biographies because they were easiest to match with place of birth (P19) and date of birth (P569). I also plan to do capital cities because articles follows the pattern which is quite easy to process via regex. For example:
Other topics require lot of manual work.
@Jheald: There is a little mess in this data, e.g.
It takes a lot of effort to clean it up.