Wikidata:WikidataCon 2017/Notes/Wikidata + Wiktionary: lexicographical data for everyone
Title: Wikidata + Wiktionary: lexicographical data for everyone
Speaker(s)[edit]
Name or username: Lydia Pintscher
Abstract[edit]
Wikidata now covers general purpose data about a huge number of concepts in the world in order to support Wikipedia, its sister projects and projects outside Wikimedia. Work is also underway to support data about multimedia files on Wikimedia Commons. One large field that we still want to cover is lexicographical data - data about words. In this talk you'll learn about the current state of the project, the opportunities lexicographical data in Wikidata opens up especially for small languages and how you can help.
Collaborative notes of the session[edit]
Wiktionary
Each language using its own templates and structures
Not machine readable
Few editors
66 of 149 were not active the last 3 months (comparing with other sister projects would be interesting)
On StackOverflow, plenty of people ask « how to parse Wiktionary »
Few tools available
Exchange of data on words...
Dictionary applications
dict.leo.org
Write language lessons
Research on language
http://wikidata-lexeme.wmflabs.org/index.php/Main_Page
Questions / Answers[edit]
Under which license will be released contributions to this project?
Done :)
thank you :)
Undecided question. Lydia’s opinion is that Wikidata being under CC0 has brought a lot of good things: people reuse data. If we want data to be used, Lydia believes it should be CC0 − but that needs to be discussed. could it be possible to have an exploration of an other "ontology", based on "vocable" (concrete segments of discourse transcription) rather than "lexems" (language elements abstracted under flexionnal paradigm), in parallel with this project?
well, too late, but if anyone can go ask the question and make a report here it would be very kind