Property talk:P5187

From Wikidata
Jump to navigation Jump to search

Documentation

Representsword stem (Q210523)
Data typeMonolingual text
Example
According to this template:
  • 行う (ja) → "行" (ja), "おこな" (ja-Hira)
  • 美しい (ja) → "美し" (ja), "うつくし" (ja-Hira)
  • 綺麗だ (ja) → "綺麗" (ja), "きれい" (ja-Hira)
  • 率いる (ja) → "率" (ja), "ひき" (ja-Hira)
  • する (ja)no value
  • Lexeme:L47 → aim
According to statements in the property:
no label (L830) → not applicable
no label (L11440)
おこな
no label (L5112)美し
うつくし
When possible, data should only be stored as statements
See alsoconjugation class (P5186), combines lexemes (P5238)
Lists
Proposal discussionProposal discussion
Current uses
Total145,876
Main statement145,773>99.9% of uses
Qualifier103<0.1% of uses
[create Create a translatable help page (preferably in English) for this property to be included here]
Allowed entity types are Wikibase lexeme (Q51885771): the property may only be used on a certain entity type (Help)
List of violations of this constraint: Database reports/Constraint violations/P5187#Entity types, hourly updated report
Scope is as main value (Q54828448): the property must be used by specified way only (Help)
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P5187#Scope, SPARQL
Lexical category: verb (Q24905), adjective (Q34698): this property should only be applied to lexemes with these lexical categories
Exceptions are possible as rare values may exist. Exceptions can be specified using exception to constraint (P2303).
List of violations of this constraint: Database reports/Constraint violations/P5187#lexical category
This property is being used by:

Please notify projects that use this property before big changes (renaming, deletion, merge with another property, etc.)

Inconsistency between the languages
Language of the stem should be the same as the language of the main lemma. (Help)
Violations query: SELECT * WHERE { ?item wikibase:lemma ?lemma ; wdt:P5187 ?stem . BIND ( SUBSTR(LANG(?lemma),1,2) AS ?langLemma ) BIND ( SUBSTR(LANG(?stem),1,2) AS ?langStem ) FILTER ( ( ?langLemma != ?langStem ) && ( ?langLemma != 'pn' ) && ( ?langLemma != 'pa' ) && ( ?langLemma != 'hi' ) && ( ?langLemma != 'ur' ) && ( ?langLemma != 'tg' ) && ( ?langLemma != 'fa' ) ) }
List of this constraint violations: Database reports/Complex constraint violations/P5187#Inconsistency between the languages

Format: "porta", "port-a", or "port[a]"[edit]

Which form should be used on Lexeme:L1642 (Latin portare). I think an indication of the part of the stem that is replaced in conjugation would be helpful. I used "port-a" for now.
--- Jura 11:04, 27 May 2018 (UTC)[reply]

@Jura1: first a detail but this property is limited to Japanese. I would be preferable to change that before using it on non-Japanese verb.
Then, good question. Port-a could fit. But wouldn't it be better to use multiple values? Here it would be port and porta (which are the perfect stem and the present stem, if I recall correctly my Latin, that can be indicated in qualifier).
Cdlt, VIGNERON (talk) 13:55, 2 June 2018 (UTC)[reply]

Link missing[edit]

@Jura1, VIGNERON: This property was proposed with the motivation: "needed to determine the manner of conjugation of Japanese verbs and adjectives". If we use it for word stems in general how can I create a link from Assoziationsüberschuss (Lexeme:L2169) to:

--Zitatesammler (talk) 13:28, 2 June 2018 (UTC)[reply]

@Zitatesammler: I wouldn't use it for « word stems in general », but only of « stem of a verb in general » (and it's already debatable).
Anyhow, this property is a monolingual string so it's not possible to link to lexemes. For your case, I think it's not really stems but compounds and there is a proposal for that Wikidata:Property proposal/compound of.
Cdlt, VIGNERON (talk) 13:47, 2 June 2018 (UTC)[reply]

Extension to all verbs[edit]

@Okkn, Jura1, Fnielsen, Şêr, Kareyac: (top-5 users of this property)

Any objection to extend this property to all verbs? (and maybe all lexemes?)

At least, I suggest to add non-Japanese examples (that can be taken among the WhatLinksHere).

Cheers, VIGNERON (talk) 15:24, 13 June 2018 (UTC)[reply]

@VIGNERON: I think there is no problem to use this property on non-Japanese lexemes. In addition, I have added this property not only to Japanese verbs, but also to Japanese adjectival nouns (such as Lexeme:L2454). --Okkn (talk) 15:41, 13 June 2018 (UTC)[reply]
@VIGNERON:  Support in general. Can it confict with derived from lexeme (P5191) ? - Kareyac (talk) 16:28, 13 June 2018 (UTC)[reply]

✓ Done documentation changed Special:Diff/695698956. Cdlt, VIGNERON (talk) 14:23, 14 June 2018 (UTC)[reply]

Multiple values[edit]

Hi y'all,

@Fnielsen, Okkn, Mitrolayzing, Liamjamesperritt, Circeus, So9q: @Shlomo, Adrijaned, Iniquity, Pamputt: (top-10 users of this property)

Some case are simple and there is only one stem; but what is the best way to use this property when there is multiple values?

Here some examples I found :

I came here after a discussion with Uziel302 about Lexemes like ales (L256931) where he put the information in the representation of each form. At least it specific but it's a bit strange, it doesn't feel to me to be the right place.

What do you think? any idea, comment, remark?

Cheers, VIGNERON (talk) 15:40, 1 April 2020 (UTC)[reply]

I think we should have stem both on lexeme level for the usual case and on the form level for cases of different stems for different forms. Uziel302 (talk) 16:02, 1 April 2020 (UTC)[reply]
I fixed patrino (L270299). It has only one stem "patrin" at least as I understand stems from the wikipedia page about them.--So9q (talk) 16:55, 1 April 2020 (UTC)[reply]
Hi @So9q, VIGNERON in Esperanto (and some other languages, I suppose) every part of word can be a stem (Q111029)/root (Q210523) (-in- -> ino / ina / ineto etc.). In my opinion it is worth to mentioning individual morphemes/roots (maybe there is a better tool for it?). Wiktionaries for example: eo: patrino, en: patrino, pl: patrino.
In this case, "the main root/stem" is patr- (or patrin- ?), I agree with that. However, there are also compound words like vapor-o-ŝip-o, or reĝ-id-in-o and reĝ-in-id-o. Unfortunately I don't know what to do with words like these (maybe something like: root (P5920), combines lexemes (P5238)?) --ThelmOSO (talk) 21:20, 1 April 2020 (UTC)[reply]
@ThelmOSO: yes, in almost all languages very various part of word *can be* a stem; but that doesn't mean it *actually is* When you describe a specific form of a word, there is one and only one stem (I don't know any exception) and usually a lexemes also have only one stem (but here there is exception, this is exactly what this discussion is about). Thanks @So9q: for the fix, remark: I would have used combines lexemes (P5238) and not derived from lexeme (P5191) (and no, root (P5920) is something entirely different, it's closer to root, not stem ; sadly English only has 2 words for that so it's a bit confusing, in French we have 3 words : "racine", "radical" and "thème" ).
Cdlt, VIGNERON (talk) 08:48, 2 April 2020 (UTC)[reply]
@VIGNERON: I agree :) I meant to use combines, fixed.--So9q (talk) 20:59, 7 April 2020 (UTC)[reply]
 Comment actually 3 stems in زبان تاجیکی/забони тоҷикӣ/zaboni toçikī (L230362) are not from different scripts, they are from different words in the phrase. Is it appropriate use? --Infovarius (talk) 02:26, 3 April 2020 (UTC)[reply]
@Infovarius: thanks for noticing, I look too quickly and thought تاجیکستان/Тоҷикистон/Toçikiston (L220599) and زبان تاجیکی/забони тоҷикӣ/zaboni toçikī (L230362) where in the same situation but indeed the second one is not the same. I don't know Tajik but isn't the solution to move this data to the property combines lexemes (P5238)? (a bit like for the Esperanto, qv. supra). @Mitrolayzing: could you give some clarification here? Cheers, VIGNERON (talk) 14:58, 3 April 2020 (UTC)[reply]
Agree, combines lexemes (P5238) is better. --Infovarius (talk) 22:12, 4 April 2020 (UTC)[reply]

Before I add that statement on this for equivalent property (P1628) ... I think this is the same as http://www.lexinfo.net/ontology/2.0/lexinfo#radical but just double-checking with others here since "radical" that I noticed as an alias for French, and currently "radical" is a messy linguistic term in Latin-based languages, so I'm not 100% sure of the equivalent nature until I can get someone's "yes".

@Thadguidry: sorry for the late reply. It seems that this Lexinfo property is not the equivalent of word stem (P5187) but of radical (P5280). Cheers, VIGNERON (talk) 06:34, 15 August 2022 (UTC)[reply]
@VIGNERON Ah! Thanks, I've added it to radical (P5280) Thadguidry (talk) 16:20, 15 August 2022 (UTC)[reply]