Wikidata:WikiProject Tabular data

From Wikidata
Jump to navigation Jump to search

Commons tabular data offers an alternative to Wikidata for data that can be especially useful for time series, numerical data or data that are only available under CC-BY or CC-BY-SA licence. In order to make them more usable, we need to link the from Commons and standarize them using Wikidata conventions and identifiers.


Comparison between Wikidata and tabular data on Commons

[edit]
SubjectWikidata itemsTabular data
FormatWikibaseJson
Comment on formatRelatively complex and expressive. Each statement can have an arbitrary number of qualifiers and sources of various typesLightweight.
Adapté pourCan deal with complex, heterogenous dataEasier to deal with for harmonized time series or standardiez data data
Data structureSemantical structure: 1 concept = 1 item, with various links between items Organized by file
1 There can be several files on the same topic, with different sources, timescales etc.
Indexation and searchSearch engine on Wikidata, and powerful search functionalities using SparqlData can be hard to find when they are not linked / documented from another page. Commons data namespace supports neither caterorization nor template-style documentation
Use of data on WikisData can be retrieved using Wiki mark-up, or various templates that have been locally developed. However, some statements have rather intricated or hackish data structure, resulting in unexpected results when no ad hoc template has been developed Data are simpler, and are easy to use, provided the data structure in known.
External useDumps and sparql endpoints allow various external reusesData can be downloaded
Human readabilityStatements are multilingual. However, items can be long, and messy, and the exact meaning of the various properties may not alway be clear to the casual reader. Clean, spreadsheet-like pages, but the content itself might sometimes be obscure in the absence of complete documentation
Edition manuelleEditeur interactif.Modification du code source Json.
MultilinguismeMultilinguisme natif pour les données de type "élément".Possibilité de traduction des textes, mais fichier par fichier. Possibilité d'utiliser les identifiants Commons pour automatiser la traduction sur le site client.
Risque de vandalismeModéré. Résumé et historique de modifications précis, mais la diversité des données et le grand nombre de modifs peuvent rendre le suivi en temps réel difficile. Risque de modification bien intentionnées mais contre-productives.Sans doute faible. Données peu visibles. Contraintes formelles empêchant les modifications hâtives.
Bots et outilsCommunauté importante, outils variés.Rien pour l'instant ?
LicenceCC0 (équivalent domaine public). CC0,CC attribution, ou CC attribution share alike.

Properties

[edit]

Up-to-date list on Sparql: https://w.wiki/ZC9

Potential Wikipedia usecases

[edit]

Data structure

[edit]

Data linked from the same property should usually have similar data structures. When possible, the names of the fields should contain a Wikidata identifier for machine-readability.

Demographic data

[edit]

tabular population (P4179)

.

Participants

[edit]
[+] Add yourself to the list

The participants listed below can be notified using the following template in discussions:
{{Ping project|Tabular data}}

See also

[edit]