Wikidata:Property proposal/TMDB company ID
Jump to navigation
Jump to search
TMDB company ID
[edit]Originally proposed at Wikidata:Property proposal/Authority control
Description | identifier for a company at The Movie Database |
---|---|
Represents | The Movie Database (Q20828898) |
Data type | External identifier |
Domain | company (Q783794) |
Allowed values | [1-9]\d* |
Example 1 | Lucasfilm (Q242446)TMDB company ID1 |
Example 2 | United Artists Corporation (Q219400)TMDB company ID60 |
Example 3 | Stanley Kramer Productions (Q17386674)TMDB company ID893 |
Source | https://www.themoviedb.org/ |
Expected completeness | always incomplete (Q21873886) |
Formatter URL | https://www.themoviedb.org/company/$1/ |
Notified participants of WikiProject Movies
Motivation
[edit]TMDb is a crowd-sourced website, whose data and API are reused by applications. This would be useful because it would open up TMDb company to Wikidata item referencing, and allow expansion of other external identifiers and metadata when researching.
Methodology
[edit]- Obtain dumps of TMDB and Wikidata production companies with their respective IDs
- For each company on TMDB and Wikidata, download a complete set of media IDs for that company
- Match the company names using a fuzzy string compare method
- Further refine matches by comparing the overlap of media IDs between TMDB and Wikidata companies
- Fuzzy label how good a match is
External links
[edit]- Example results: https://github.com/rohfle/tmdb-wikidata-company-matching/blob/main/result_2023-05-10.csv
- Source code: https://github.com/rohfle/tmdb-wikidata-company-matching
Discussion
[edit]Support
Don't ping me NMaia (talk) 20:01, 5 May 2023 (UTC)
Support Matthias M. (talk) 21:49, 13 May 2023 (UTC)
- @Rohfle, Matthias M.:
Done: TMDB company ID (P11806). --Horcrux (talk) 12:53, 2 June 2023 (UTC)