Wiktionary:Votes/2019-10/Application of idiomaticity rules to hyphenated compounds

Application of idiomaticity rules to hyphenated compounds
Voting on: Clarifying the application of CFI idiomaticity rules to hyphenated compounds.

Please vote separately on Option 1 and Option 2 in the respective sections below. Please vote on either or both. In the presumably unlikely event that both options pass, neither will be automatically implemented.

Schedule:
 * Vote starts: 00:00, 7 November 2019 (UTC)
 * Vote ends: 23:59, 6 December 2019 (UTC)
 * Vote created: Mihia (talk) 22:48, 31 October 2019 (UTC)

Discussion:
 * [[Image:Wikt rei-artur3.svg|20px]] Beer_parlour
 * [[Image:Wikt rei-artur3.svg|20px]] Wiktionary talk:Votes/2019-10/Application of idiomaticity rules to hyphenated compounds

Option 1
Insert the following words into the "Idiomaticity" section of the CFI:

Idiomaticity rules apply to hyphenated compounds in the same way as to spaced phrases. For example, wine-lover, green-haired, harsh-sounding and ex-teacher are all excluded as they mean no more than the sum of their parts, while green-fingered and good-looking are included as idiomatic.

Support

 * 1)  ←₰-→  Lingo Bingo Dingo (talk)  07:54, 7 November 2019 (UTC)
 * 2)   --Lambiam 07:46, 8 November 2019 (UTC)
 * 3)  Stelio (talk) 10:26, 8 November 2019 (UTC)
 * 4) . Canonicalization (talk) 13:17, 9 November 2019 (UTC)
 * 5)  Equinox ◑ 00:57, 10 November 2019 (UTC)
 * 6)  — SGconlaw (talk) 08:43, 11 November 2019 (UTC)
 * 7) . -  TheDaveRoss  14:28, 21 November 2019 (UTC)
 * 8)  Pppery (talk) 16:41, 23 November 2019 (UTC)
 * 9) . Andrew Sheedy (talk) 17:18, 27 November 2019 (UTC)

Oppose

 * 1) Weakly -  too strict a standard to document the natural state of the English language in a dictionary? Geographyinitiative (talk) 01:32, 8 November 2019 (UTC)
 * 2)  At the very least, wording about how WT:COALMINE applies to hyphenated phrases too should be added to the proposal. However, I think that hyphenation, at least in English orthography, can often show that collocations have came to be perceived as unitary phrases to a sufficient degree that their inclusion is justified. Of course, this doesn't stop ad-hoc hyphenations from slipping in, but this may be able to be handled (e.g. by having higher attestation requirements for hyphenated phrases) and doesn't necessarily justify reducing hyphenated collocations from being reduced to the same status as unhyphenated ones. --Hazarasp (parlement · werkis) 10:50, 9 November 2019 (UTC)
 * Not a full rebuttal of your point, but a remark: the OED stopped distinguishing hyphens versus spaces in compound terms some years ago, IIRC. If there was consistent usage once, there isn't now. Equinox ◑ 00:59, 10 November 2019 (UTC)
 * 1) . DonnanZ (talk) 10:14, 2 December 2019 (UTC)

Abstain

 * 1)  for now as proposer. Good examples, including any noun-noun examples, of what you think might be lost by implementing this are welcomed from opposers. Mihia (talk) 00:16, 8 November 2019 (UTC)
 * 2)  FYI Some words have hyphens even they are not compounds nor idiomatic such as สุไหงโก-ลก and น้ำเป-ล่า, these are for reading aid. --Octahedron80 (talk) 09:38, 11 November 2019 (UTC)
 * The issue of whether this proposal applies in the same way to all languages is one that did actually occur to me, but specifically mentioning "English or English-like languages" seemed to me to be too obtrusive in a section that seems throughout to assume an English-like concept of words, word separation, and orthography generally. General clarification of the way in which various CFI rules should or should not apply to non-English-like languages seems to me to be a separate exercise. Mihia (talk) 00:22, 12 November 2019 (UTC)
 * It would've gone a long way to exclude hyphens when they're solely used as reading aids. In Finnish for example, if the former component of a compound word ends in a vowel and the latter component begins with that same vowel, a hyphen is placed between them (like +  > ). &mdash; surjection &lang;?&rang; 10:02, 12 November 2019 (UTC)
 * Is your point that linja-auto would be excluded by the proposed wording, but it should not be excluded? Mihia (talk) 23:36, 12 November 2019 (UTC)
 * I doubt itself would be, but some other compounds that would pass without a hyphen (if they just had different sounds) might. The question of whether such compounds should be actually included willy-nilly just because they're one word is still an open matter, though. I've personally created those that I've either found in dictionaries, or are fairly common and at least somewhat opaque in meaning. &mdash; surjection &lang;?&rang; 06:38, 13 November 2019 (UTC)
 * If thought necessary, we can, if this option passes, add a disclaimer to the wording indicating that it is presently intended to apply to English and to languages that use hyphens in an English-like way, and that other languages are TBA. My slight concern about this, as I intimated, is that it seems to imply that the rest of the CFI is fully robust for all languages, whereas I think if we really picked apart the CFI against all known languages we would find a number of anomalies and lacunae. Mihia (talk) 14:14, 13 November 2019 (UTC)
 * CFI is actually more consistent for languages other than English than one would think at first. &mdash; surjection &lang;?&rang; 20:34, 19 November 2019 (UTC)
 * Getting slightly off-topic, but a significant omission, for example, is information about how idiomaticity rules apply to languages that do not use spaces. The opening sentence of the section reads:


 * An expression is idiomatic if its full meaning cannot be easily derived from the meaning of its separate components. Non-idiomatic expressions are called sum-of-parts (SOP).


 * "expression" is unhelpfully left undefined, but the only sensible way to interpret this text seems to be that "expressions" are made up of two or more separate words, and that "components" are individual words, not parts of words. Otherwise single (English) words would be subject to deletion as sum-of-parts. However, in languages that do not use spaces (e.g. Japanese) it is not necessarily clear what is and is not a "single word". Any kanji concatenation, however much sum-of-parts, may be perceived as a "single word", and yet it would clearly be undesirable (impossible) for us to include every such concatenation.
 * Also, there is no mention of languages which do use spaces, but which allow arbitrary SOP spaceless concatenations of words to arbitrary length. Mihia (talk) 18:46, 21 November 2019 (UTC)

Option 2
Insert the following words into the "Idiomaticity" section of the CFI:

Hyphenated compounds are not subject to idiomaticity rules, and are included even if they mean no more than the sum of their parts. For example, wine-lover, green-haired, harsh-sounding and ex-teacher are all eligible for inclusion.

Support

 * 1) . DonnanZ (talk) 10:15, 2 December 2019 (UTC)

Oppose

 * , quite strongly. ←₰-→ Lingo Bingo Dingo (talk)  07:54, 7 November 2019 (UTC)
 * , although with the understanding that this entire vote would only concern English or languages where the hyphenation in compounds works similarly, which isn't the case for all languages. &mdash; surjection &lang;?&rang; 09:59, 7 November 2019 (UTC)
 * 1) . Canonicalization (talk) 12:56, 7 November 2019 (UTC)
 * 2) . Mihia (talk) 00:12, 8 November 2019 (UTC)
 * 3) . Imagine what Wonderfool would do with this. Equinox ◑ 00:46, 8 November 2019 (UTC)
 * 4) .  --Lambiam 07:48, 8 November 2019 (UTC)
 * 5)  Stelio (talk) 10:26, 8 November 2019 (UTC)
 * 6)  with the proviso mentioned above by surjection. (For instance, in Scottish Gaelic "sgian-arain" is a breadknife but "sgian arain" would mean a knife made out of bread; "taigh beag" is simply an SOP meaning "little house", but "taigh-beag" means "water closet" &c&c.) -- Droigheann (talk) 09:22, 10 November 2019 (UTC)
 * 7)  --Octahedron80 (talk) 09:28, 11 November 2019 (UTC)
 * 8)  — SGconlaw (talk) 17:08, 12 November 2019 (UTC)
 * 9) . Andrew Sheedy (talk) 17:19, 12 November 2019 (UTC)
 * 10) . -  TheDaveRoss  14:27, 21 November 2019 (UTC)
 * 11)  Pppery (talk) 16:42, 23 November 2019 (UTC)

Decision
Option 1 passes 9–3–2 (75% support), and WT:CFI edited accordingly. Option 2 fails 1–13–0. —Μετάknowledge discuss/deeds 01:25, 12 December 2019 (UTC)