User:Fnielsen/Autolists/Datasets
< User:Fnielsen | Autolists
Dataset used in works.
This list is periodically updated by a bot. Manual changes to the list will be removed on the next update!
WDQS | PetScan | TABernacle | Find images | Recent changes | Query:select DISTINCT ?item where { ?work wdt:P4510 ?item . ?item wdt:P31/wdt:P279* wd:Q1172284 . }
OWL ontology[edit]
artikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
Simple Knowledge Organization System | https://www.w3.org/TR/skos-reference/ | |||||||
The Data Cube vocabulary | https://www.w3.org/TR/vocab-data-cube/ | http://purl.org/linked-data/cube |
Wiktionary language edition[edit]
artikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
German Wiktionary | German | https://de.wiktionary.org/ | ||||||
English Wiktionary | 2002-12-12 | English multiple languages |
Creative Commons Attribution-ShareAlike 3.0 Unported | https://en.wiktionary.org/ |
bibliographic database[edit]
artikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
Web of Science | en:WoS | 2016 1997 |
English | Bibliographic Scan of Digital Scholarly Communication Infrastructure | https://clarivate.com/products/web-of-science/ | |||
MEDLINE | 1966 | https://www.nlm.nih.gov/medline/index.html | https://www.nlm.nih.gov/bsd/medline.html https://www.nlm.nih.gov/databases/databases_oldmedline.html | |||||
Web of Knowledge | http://wokinfo.com/ http://www.isiwebofknowledge.com/ |
|||||||
CINAHL | 1961 | https://www.ebsco.com/products/research-databases/cinahl-database | ||||||
Crossref | 2000 | Bibliographic Scan of Digital Scholarly Communication Infrastructure Open Science Thesaurus |
https://www.crossref.org/ | |||||
Embase | ||||||||
PsycINFO | http://www.apa.org/psycinfo/ | |||||||
CNKI | 1996 | https://www.cnki.net/ | ||||||
OpenAlex | 2022-01-03 | American English | Open Science Thesaurus | https://openalex.org/ | https://blog.ourresearch.org/openalex-update-june/ https://openalex.org/about |
biological database[edit]
chemical database[edit]
artikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
PubChem | English | PubChem in 2021: new data content and improved web interfaces The Bioregistry Nucleic Acids Research (NAR) database |
free content | http://pubchem.ncbi.nlm.nih.gov | ||||
ChEMBL | The ChEMBL database in 2017 The Bioregistry |
Creative Commons Attribution-ShareAlike 3.0 Unported | https://www.ebi.ac.uk/chembl/ http://www.ebi.ac.uk/chembl |
|||||
GNPS | English | Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking | https://gnps.ucsd.edu/ |
clinical trials registry[edit]
artikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
ClinicalTrials.gov | English | http://www.clinicaltrials.gov | ||||||
International Clinical Trials Registry Platform | 2005 | https://www.who.int/ictrp |
data set[edit]
database[edit]
artikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
Cochrane Library | https://www.cochranelibrary.com/ | |||||||
OpenCitations Corpus | en:OCC | The varying openness of digital open science tools | Creative Commons CC0 License | http://opencitations.net/corpus | ||||
SciGraph | 2017 | https://scigraph.springernature.com/explorer | https://www.springernature.com/gp/researchers/scigraph | |||||
AACT Database | https://www.ctti-clinicaltrials.org/aact-database | |||||||
ClinWiki | MIT License | https://www.clinwiki.org/ | ||||||
GeoDanmark | https://www.geodanmark.dk | |||||||
National Inpatient Sample | ||||||||
Det Centrale Ordregister | da:COR | Danish | https://ordregister.dk/ |
digital library[edit]
artikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
Wikisource | 2003-11-24 | https://wikisource.org/ | ||||||
Project Gutenberg | 1971-07-04 | multiple languages | Unlicense | https://gutenberg.org | ||||
PubMed Central | en:PMC | English | Open Science Thesaurus The varying openness of digital open science tools |
http://www.ncbi.nlm.nih.gov/pmc/ https://www.ncbi.nlm.nih.gov/pmc/ |
||||
HathiTrust | 2008 | Bibliographic Scan of Digital Scholarly Communication Infrastructure | https://www.hathitrust.org/ | https://tapor.ca/tools/1461 https://marketplace.sshopencloud.eu/tool-or-service/VUsxa0 |
free and open-source software[edit]
artikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
Open Science Framework | en:OSF | Bibliographic Scan of Digital Scholarly Communication Infrastructure Open Science Thesaurus The varying openness of digital open science tools |
Apache License, Version 2.0 | https://osf.io | https://tapor.ca/tools/742 https://marketplace.sshopencloud.eu/tool-or-service/ROkULj | |||
QLever | QLever: A Query Engine for Efficient SPARQL+Text Search | Apache License, Version 2.0 |
free software[edit]
artikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
World Atlas of Language Structures | en:WALS | 2008 | Creative Commons Attribution 4.0 International | http://wals.info | ||||
Wikibase | GNU General Public License, version 2.0 or later | https://wikiba.se/ |
graph database[edit]
artikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
Blazegraph | awesome RDF github page | GNU General Public License, version 2.0 proprietary license |
https://www.blazegraph.com/ https://blazegraph.com/ |
|||||
Stardog | awesome RDF github page OntoCommons Report D4.3 |
proprietary license | https://www.stardog.com |
image database[edit]
artikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
CBCL Face Database | http://cbcl.mit.edu/software-datasets/FaceData2.html | http://www.ai.mit.edu/courses/6.899/lectures/faces.tar.gz | ||||||
imSitu | Situation Recognition: Visual Semantic Role Labeling for Image Understanding | http://imsitu.org/ | https://s3.amazonaws.com/my89-frame-annotation/public/of500_images.tar |
image dataset[edit]
knowledge base[edit]
knowledge graph[edit]
artikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
Artificial Intelligence Knowledge Graph | AI-KG: An Automatically Generated Knowledge Graph of Artificial Intelligence | |||||||
CaLiGraph | http://caligraph.org/ |
knowledge graph of science[edit]
artikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
Microsoft Academic Graph | 2015-06-05 | English | https://www.microsoft.com/en-us/research/project/microsoft-academic-graph/ | |||||
Open Research Knowledge Graph | en:ORKG | http://orkg.org/ |
lexical database[edit]
artikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
WordNet | 1998 | English | WordNet: An Electronic Lexical Database WordNet: a lexical database for English |
BSD licenses | https://wordnet.princeton.edu/ | |||
FrameNet | 1997 | English | FrameNet: Theory and Practice | https://framenet.icsi.berkeley.edu/fndrupal/ | https://framenet.icsi.berkeley.edu/fndrupal/WhatIsFrameNet | |||
VerbNet | English | https://verbs.colorado.edu/verbnet/ | ||||||
NorthEuraLex | Creative Commons Attribution-ShareAlike 4.0 International | http://northeuralex.org/ |
oncology[edit]
online database[edit]
open-access repository[edit]
artikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
CiteSeer | http://citeseer.ist.psu.edu | |||||||
Figshare | 2011-01-12 | Bibliographic Scan of Digital Scholarly Communication Infrastructure Open Science Thesaurus The varying openness of digital open science tools Directory of Open Access Preprint Repositories |
https://figshare.com/ | https://tapor.ca/tools/1045 https://marketplace.sshopencloud.eu/tool-or-service/mdEbYT |
open-source software[edit]
artikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
Virtuoso Universal Server | awesome RDF github page OntoCommons Report D4.3 |
GNU General Public License, version 2.0 proprietary license |
https://virtuoso.openlinksw.com/ | |||||
Apache Jena Fuseki | Apache License, Version 2.0 | https://jena.apache.org/documentation/fuseki2/index.html |
organization[edit]
artikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
World Register of Marine Species | en:WoRMS | 2008 | English | Creative Commons Attribution 4.0 International | https://www.marinespecies.org | |||
Orphanet | 1997 | English French Spanish German Italian Portuguese Dutch Polish |
Representation of rare diseases in health information systems: the Orphanet approach to serve a wide range of end users The Bioregistry |
Creative Commons Attribution-NoDerivs 3.0 Unported | https://orpha.net |
question-answering dataset[edit]
semantic network[edit]
artikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
GermaNet | German | http://www.sfs.uni-tuebingen.de/GermaNet/ | ||||||
ConceptNet | https://www.conceptnet.io/ |
software[edit]
artikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
BabelNet | multiple languages | https://babelnet.org/ | ||||||
BridgeDb | Providing gene-to-variant and variant-to-gene database identifier mappings to use with BridgeDb mapping services The BridgeDb framework: standardized access to gene, protein and metabolite identifier mapping services |
https://www.bridgedb.org/ https://bridgedb.github.io/ |
text corpus[edit]
trait database[edit]
artikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
AmphiBIO | ||||||||
TRY | 2007 | TRY - a global database of plant traits | http://www.try-db.org/ |
treebank[edit]
artikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
Penn Treebank | English | https://catalog.ldc.upenn.edu/ldc99t42 | ||||||
Hamburg Dependency Treebank | German |
video streaming service[edit]
artikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
YouTube | en:YT | 2005-02-14 | multiple languages | Lentapedia Bibliographic Scan of Digital Scholarly Communication Infrastructure |
end-user license agreement | https://www.youtube.com/ | ||
PlayStation Now | 2014 | https://www.playstation.com/ps-now |
voice dataset[edit]
artikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
Common Voice | 2017-06-19 | multiple languages | Common Voice: A Massively-Multilingual Speech Corpus | Creative Commons CC0 License | https://commonvoice.mozilla.org/ | |||
LibriSpeech | Librispeech: An ASR corpus based on public domain audio books | Creative Commons Attribution 4.0 International | ||||||
VoxPopuli | VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation | https://github.com/facebookresearch/voxpopuli | ||||||
VoxLingua107 | VoxLingua107: a Dataset for Spoken Language Recognition | http://bark.phon.ioc.ee/voxlingua107/ |
website[edit]
artikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
PubMed | en:PM | 1997 | English | Nucleic Acids Research (NAR) database | https://pubmed.ncbi.nlm.nih.gov/ https://pmlegacy.ncbi.nlm.nih.gov |
|||
LibraryThing | 2005-08-29 | LibraryThing: A Review | https://librarything.com | |||||
DNA Data Bank of Japan | Creative Commons Attribution 2.1 Japan | http://www.ddbj.nig.ac.jp/ | ||||||
ScienceDirect | 1997-03 | English | The Serials Librarian The varying openness of digital open science tools |
https://www.sciencedirect.com/ | ||||
Media Cloud | 2009 | https://mediacloud.org | ||||||
Semantic Scholar | Semantic Scholar. Bibliographic Scan of Digital Scholarly Communication Infrastructure |
https://www.semanticscholar.org | ||||||
Nextstrain | https://nextstrain.org/ | |||||||
Dimensions | 2018-01-15 2014 |
English | Bibliographic Scan of Digital Scholarly Communication Infrastructure The varying openness of digital open science tools |
https://app.dimensions.ai/discover/publication https://www.dimensions.ai |
word analogy dataset[edit]
word net[edit]
artikel | short name | inception | language of work or name | described by source | copyright license | official website | URL | described at URL |
---|---|---|---|---|---|---|---|---|
plWordNet | 2005 | Polish | BSD licenses | http://plwordnet.pwr.wroc.pl | ||||
DanNet | 2009 | Danish | DanNet: the challenge of compiling a wordnet for Danish by reusing a monolingual dictionary | MIT License Creative Commons Attribution 4.0 International |
http://www.wordnet.dk/ https://cst.ku.dk/projekter/dannet/ |
http://www.wordnet.dk/owl/instance/ | ||
Arabic WordNet | 2006 | Arabic | The Use of Arabic WordNet in Arabic Information Retrieval | |||||
Chinese WordNet | da:CWN | Constructing chinese wordnet: Design principles and implementation | ||||||
MultiWordnet of Portuguese | en:MWN.PT | Portuguese | ||||||
KeNet | Turkish | Constructing a WordNet for Turkish Using Manual and Automatic Annotation | http://haydut.isikun.edu.tr/kenet.html | |||||
odenet | de:odenet | German | https://ikum.mediencampus.h-da.de/projekt/open-de-wordnet-initiative/ |
word similarity dataset[edit]
Misc[edit]
End of automatically generated list.