Wikidata:WikidataCon 2017/Notes/Wikidata + Wiktionary: lexicographical data for everyone

From Wikidata
Jump to navigation Jump to search

Title: Wikidata + Wiktionary: lexicographical data for everyone

Speaker(s)[edit]

Name or username: Lydia Pintscher

Abstract[edit]

Wikidata now covers general purpose data about a huge number of concepts in the world in order to support Wikipedia, its sister projects and projects outside Wikimedia. Work is also underway to support data about multimedia files on Wikimedia Commons. One large field that we still want to cover is lexicographical data - data about words. In this talk you'll learn about the current state of the project, the opportunities lexicographical data in Wikidata opens up especially for small languages and how you can help.

Collaborative notes of the session[edit]

Wiktionary

Each language using its own templates and structures

Not machine readable

Few editors

66 of 149 were not active the last 3 months (comparing with other sister projects would be interesting)

On StackOverflow, plenty of people ask « how to parse Wiktionary »

Few tools available

Exchange of data on words...

Dictionary applications

dict.leo.org

Write language lessons

Research on language

http://wikidata-lexeme.wmflabs.org/index.php/Main_Page

Questions / Answers[edit]

Under which license will be released contributions to this project?

Done :)

thank you :)

Undecided question. Lydia’s opinion is that Wikidata being under CC0 has brought a lot of good things: people reuse data. If we want data to be used, Lydia believes it should be CC0 − but that needs to be discussed. could it be possible to have an exploration of an other "ontology", based on "vocable" (concrete segments of discourse transcription) rather than "lexems" (language elements abstracted under flexionnal paradigm), in parallel with this project?

well, too late, but if anyone can go ask the question and make a report here it would be very kind