Wikidata:Property proposal/member of lexicon
member of lexicon
[edit]Originally proposed at Wikidata:Property proposal/Lexemes
Description | A property which marks a lexeme (in this case usually a single word) which is accepted as being a member of a standard lexicon, or word list. |
---|---|
Data type | Item |
Domain | lexemes (typically single words which are not proper nouns, abbreviations, acronyms, etc.) |
Example 1 | abrupts → SOWPODS (Q17082091) |
Example 2 | apple → Official Tournament and Club Word List (Q7079577) |
Example 3 | beu → L'Officiel du jeu Scrabble (Q3204447) |
Planned use | For checking whether a given word is valid in a particular lexicon, in the context of word games and word lookup software. |
Expected completeness | eventually complete (Q21873974) |
Robot and gadget jobs | If accepted, I plan to write a bot that can add the property to the appropriate lexeme entries, or add new lexemes if necessary. |
Motivation
[edit]This property will be useful in several situations, such as in word games and spell checking apps. It will enable a free content source for checking words' validity.
Unlike with dictionaries, where an ID property can be created (e.g. Oxford English Dictionary entry ID (pre-July 2023) (P5275)), word lists have no IDs to use, so simply flagging acceptance in a given lexicon is more appropriate.
Using Wikidata items as the lexicon/word list values (as shown in the examples above) ensures they are consistently named (to avoid errors during lookups and accurate stats) and have corresponding linked Wikipedia articles. Qualifiers can be used to specify the edition/year in which a word was added to a lexicon.
A bot will be written which will add the property, check its use, add/suggest new lexemes if necessary, and generate statistics, so correctness and completeness should be possible.
Sorry if there's already a property that would be appropriate for this application! Thanks - odg (talk) 01:23, 22 July 2020 (UTC)
Discussion
[edit]- Comment is the selection of this or that word not protected by database law? If it is, I think that such a property has no place here. Pamputt (talk) 05:42, 22 July 2020 (UTC)
- Membership of a lexicon is verifiable from publicly available sources, in the same way membership of various dictionaries is. Dictionary IDs are allowed on Wikidata, as they are simply a reference to an external source and no nonfree copyrighted definition is copied. Common words themselves are not copyrightable, and most already exist on Wikidata. I'm not sure if word lists are copyrightable anyway, due to not containing any definitions and thus having no originality, but that's beside the point. We wouldn't be reproducing word lists verbatim. - odg (talk) 09:33, 22 July 2020 (UTC)
- Support Assuming this is not a copyright problem. ArthurPSmith (talk) 18:57, 22 July 2020 (UTC)
- Comment Do Scrabble word lists map cleanly to lexemes or their forms/senses? While like Arthur I'd ordinarily be fine with it assuming copyright issues could be gotten past, I wonder about the placement of this property within a lexeme item (especially with regard to homograph lexemes and forms). Mahir256 (talk) 16:22, 23 July 2020 (UTC)
- Oppose Why don't you do ?apart of (P361)?c, where ?cinstance of (P31)lexicon (Q8096) or ?cinstance of (P31)dictionary (Q23622)? --Tinker Bell ★ ♥ 06:49, 15 October 2020 (UTC)
- Comment would this proposal apply also for assigning lexemes to certain "Wordschatz" level, as in, for example, "belonging to list of words, defined by Goethe Institut for B1 comprehension level" ? (see also this discussion) or would in this case suggestion by User:Tinker Bell also apply? --62mkv (talk) 12:15, 20 December 2020 (UTC)
Not done Opposition expressed and not responded to, stale. JesseW (talk) 19:09, 18 March 2021 (UTC)