Template:ru-IPA/documentation

This template automatically generates pronunciation for Russian words, relying on Module:ru-pron.

'''NOTE: This documentation is incomplete. Please contact User:Benwing2 for questions.'''

Examples
Most of the time you can just use the regular spelling of the word, with appropriate accent marks. Use an acute accent for primary stress, and a grave accent for secondary stress.

Use a circumflex to indicate "tertiary stress", i.e. an unstressed syllable where the vowel isn't reduced when it normally would be.

Sometimes you will have to respell a word; this especially happens in borrowed words where written е is pronounced as э. In such cases, use phon=, which causes the respelled form to be displayed.

Respelling is automatic in genitival -ого and -его, in что and related words, and in the sequences гк and гч. A list of exceptions is maintained where -ого and -его should be pronounced as-is.

Certain short words like не, ни, за, до, по, на, etc. are automatically rendered with reduced vowels and attached to the following word, as indicated by  (similarly, certain words like -то are attached to the preceding word). This linking and vowel reduction won't happen if the word has a stress mark. You can force linking/reduction by explicitly joining words with.

Commas and dashes are rendered as IPA foot boundaries.

Gemination can be controlled explicitly using y to force gemination, n to force no gemination, or opt to force optional gemination. Otherwise, sensible defaults will be used; the exact rules are described below.

The sequences зж and жж, except at prefix boundaries, automatically trigger the generation of two pronunciations, the first one with non-palatalized [ʐː] and the second with palatalized [ʑː]. This can be overridden using n to prevent the generation of both variants, or y to force the generation of both variants even at prefix boundaries. Use ӂӂ to indicate only palatalized [ʑː] and сж to indicate only non-palatalized [ʐː]. Examples:

Parameters

 * 1
 * Word or phrase, including accent marks. If omitted, defaults to the page name (only useful when the word is one syllable or includes ё, or when phon is used). Can be either in Cyrillic or transliteration.


 * phon
 * Phonological respelling of the word or phrase; use in place of 1. If used, this should be Cyrillic. This is used, for example:
 * When a written е is pronounced э, as in many borrowed words, such as ;
 * When a written ч is pronounced ш, e.g. in ;
 * When a written в is omitted, e.g. in ;
 * When other letters are left out, as in casual pronunciations, such as of.
 * Do not use in the following situations:
 * For indicating gemination or lack thereof;
 * For indicating the pronunciation of final -е;
 * For г pronounced [ɣ];
 * For зж or жж pronounced palatalized (i.e. as [ʑː]), as in or ;
 * For indicating the pronunciation of сч, either as [ɕː] or [ɕt͡ɕ].
 * Accent marks other than primary (acute) and secondary (grave) are suppressed in the output.


 * noadj
 * Suppress the default transformation of adjectival -го to -во, сегодня to севодня and сегодняшн– to севодняшн-. (Note, there are exceptions for много, немного, лого, лего, сого, ого.)


 * noshto
 * Suppress the default transformation of что to што (and likewise for ничто, чтобы, чтоб).


 * gem
 * If, force gemination to be preserved in places it otherwise wouldn't. If  , force degemination in places it otherwise would be preserved. If  , force optional gemination, including in places it wouldn't normally be preserved. Can also include multiple values separated by slashes, one per word. See below.


 * pos
 * Part of speech code for controlling phonological outcomes that depend on the grammatical role of the word or words in question. Can also include multiple values separated by slashes, one per word. Currently used only for final unstressed -е and for words in final -ться or -дься. See below.


 * raw
 * Render the result "raw", e.g. instead of.


 * ann
 * If non-empty, display a boldfaced annotation before the pronunciation. This is intended to display the spelling of the term when there are multiple pronunciations given in the pronunciation section, based off of differently-accented forms. If equal to, display the result in phon (if specified) or 1 (although the value in phon is rarely what you want); accent marks other than primary (acute) and secondary (grave) are suppressed in the output. If any other non-empty value, display that value. Example:   displays.


 * bracket
 * Control whether brackets appear around the generated IPA. Default is yes. Use n to suppress the bracket.


 * zhpal
 * If, and force generation of both [ʐː] and [ʑː] variants of words with зж or жж in them, even across prefix boundaries. If  , generate only the [ʐː] variant in all cases.

Part of speech codes
The value of the pos parameter can be either a single part of speech code or a series of such codes separated by slashes. In the former case, the same part of speech applies to all words in a multi-word phrase. In the latter case, there should be as many codes as there are words (otherwise, an error will result, tell you the expected number of words). Rules for counting number of words:
 * Both spaces and hyphens separate words. Hence ка-гэ-бэ is three words.
 * A comma, en dash (–) or em dash (—) counts as a word.
 * Two or more words joined by the ‿ sign count as a single word.

As mentioned above, part of speech codes are currently only used in two circumstances:
 * For determining the correct pronunciation of unstressed final -е (which may be pronounced [ə], [e], [ɪ] or [ɨ], depending on part of speech and the preceding letters).
 * For determining the correct pronunciation of final -ться (which is pronounced unpalatalized in infinitives but palatalized in imperatives).

The possible part of speech codes for final -е are:
 * omitted or  = Default ([ə] in the ending -ое, else same as  )
 * or  = Neuter noun in the nominative/accusative  (but not ending in adjectival -ое or -ее; those should be considered as adjectives)
 * = prepositional case singular
 * = dative case singular (treated same as prepositional)
 * = vocative case (currently treated as )
 * = noun nominative plural in -е (гра́ждане, боя́ре, армя́не); not adjectival plurals in -ие or -ые, including adjectival nouns (да́нные, а́вторские)
 * = invariable noun or other word (currently treated as )
 * or  = adjective or adjectival noun (typically either neuter in -ое or -ее, or plural in -ие, -ые, or -ье, or short neuter in unpaired sibilant + -е)
 * or  = comparative (typically either in -ее or sibilant + -е)
 * = adverb
 * = preposition (treated same as adverb)
 * ,,   = verbal ending (usually 2nd-plural in -те); not participle forms, which should be treated as adjectives
 * = pronoun (кое-, какие-, ваше, сколькие)
 * = number (двое, трое, обе, четыре; currently treated as )
 * = prefix (treated as  because integral part of word)
 * or  = force high values ([ɪ] or [ɨ])
 * = force mid values ([e] or [ɨ])
 * ,,   = force low/schwa value ([ə])

The possible part of speech codes for final -ться are:
 * (none) = infinitive, pronounced as [t͡sːə] or [t͡sə]
 * = imperative, pronounced as [tʲ͡sʲsʲə]

Accent marks

 * Indicate primary stress with an acute accent, e.g. е́ or ы́.
 * Indicate secondary stress with a grave accent, e.g. ѐ or ы̀.
 * Indicate "tertiary" stress with a circumflex or double grave accent, e.g. е̂ or ы̂ or е̏ or ы̏ (i.e. where a vowel is to be treated as stressed but should not be marked as possessing either primary or secondary stress).
 * Force the reduced pronunciation of a vowel that would otherwise be stressed (e.g. in a single-syllable word) with a dot above, e.g. е̇ or ы̇.
 * Force a semi-reduced pronunciation ([ə] or [ɐ], instead of [ɪ]) of я or palatal а with a dot below, i.e. я̣ or а̣. This should be used in noun and verb endings, i.e. third-plural -ят, dative plural -ям, instrumental plural -ями, prepositional plural -ях.

Special consonants

 * Use h or ɣ to indicate [ɣ] (the voiced fricative pronunciation of the letter г).
 * Use ӂ to indicate [ʑ] (the soft pronunciation of ж, as in words like, or ).
 * Use щч or ɕч to force the pronunciation [ɕt͡ɕ]. By default, some instances of сч are pronounced [ɕt͡ɕ] and some are pronounced [ɕː]; see below.

Special punctuation

 * Use / to force a syllable boundary before a consonant. This also forces the "long" pronunciation [t͡ss] of тс, and similarly [d͡zz] for дз, and is useful particularly when another consonant follows, because otherwise the "short" pronunciations [t͡s] and [d͡z] would be used. An example is ветслу́жба; if written вѐтслу́жба, the result is, but if written вѐт/слу́жба, the result is . See below for the exact rules of when short and long pronunciations of тс and дз are used.
 * Use _ between consonants to prevent all assimilations between consonants, including voicing assimilation, palatalization assimilation, consonant reductions (e.g. лнц to нц, стл to сл), etc.
 * Use (ь) after a consonant to indicate optional palatalization of the consonant.
 * Use ‿ to force close juncture between words (treat the combination as a single word). Examples: becomes  and  becomes.
 * Comma and em/en dashes (– and —) are converted to foot boundaries ( | ).

Phonological rules
The following sections detail some of the phonological rules used when generating pronunciation.

Gemination
Geminated consonants (including cases like зж that become geminated after assimilation) may or may not be rendered with gemination. Use gem to override the defaults, e.g.: As an exception, щ is always rendered [ɕː], and ӂӂ is always rendered [ʑː], regardless of gem. The same thing applies to combinations that are pronounced as щ (e.g. some сч, see above), regardless of gem.
 * 1)  becomes
 * 2)  becomes
 * 3)  becomes

Optional gemination can also be forced by writing C(C), e.g. becomes.

By default, gemination is normally preserved:
 * When occurring directly after the stress and between vowels, except for лл.
 * In the combinations йя, йе, йи, йьо, йю when occurring directly after the stress and between vowels; these are treated as containing /jj/.
 * When occurring at the beginning of a word in вв-, сс-, жж-, зз-, шш- and combinations pronounced the same (e.g. сз-).
 * Whenever the pronunciation [ɕː] (e.g. from щ and initial сч-) and [ʑː] (from ӂӂ) occurs. As mentioned above, this cannot be overridden by gem.
 * In the combinations жж, зж, сж, шж when occurring between vowels or word initially.
 * In the combination нн when occurring between vowels before the stress.
 * In the combination сск when occurring directly after the stress.
 * At boundaries between certain prefixes and roots (e.g. рас-, раз-, ис-, из-, под-, от-, в-, с-, нис-, низ-, контр-; also the same when additionally prefixed by не-).

By default, gemination is normally rendered as optional:
 * In the combination нн when occurring between vowels after but not directly after the stress: becomes.

By default, gemination is normally not preserved in all other circumstances. Note that this includes most occurrences of gemination not between vowels, e.g. becomes.

For multiword expressions, the value of gem can include multiple codes, separated by slashes. For example,  produces. There must either be only one code and no slashes (in which case that code applies to all words), or as many codes as there are words. If the default behavior is desired for a particular word, use a blank code, as in the example just given.

Pronunciation of сч
сч is sometimes rendered as [ɕt͡ɕ] and sometimes as [ɕː], according to the following rules: As mentioned above, this can be overridden using щч to force the pronunciation [ɕt͡ɕ], and щ to force the pronunciation [ɕː]. Hence счи́тывать must be written щчи́тывать, and несча́стье must be written неща́стье.
 * 1) [зж]ч and [сзшж]щ are pronounced [ɕː], like щ.
 * 2) сч at the beginning of a word and in the sequences счёт and счит is pronounced [ɕː]: hence сча́стье, просчёт , подсчёт , пересчи́тывать.
 * 3) Other сч are pronounced [ɕt͡ɕ].

Pronunciation of тс, дз, тш, дж
тс and дз are sometimes rendered with the "short" pronunciations [t͡s] and [d͡z], and sometimes with the "long" pronunciations [t͡ss] and [d͡zz]. тш and дж always use "long" variants. Specifically:
 * 1) [дт] с and [дт] з across a word boundary use the long pronunciation, as do [дт]/с and [дт]/з with explicitly written slash.
 * 2) [дт]с + vowel and [дт]з + vowel use the long pronunciation.
 * 3) [дт]ьс and [дт]ьз use the long pronunciation, with palatalized [tʲ͡sʲ] and [dʲ͡zʲ].
 * 4) Word-initial отс-, подс- use the long pronunciation, because there is a morpheme boundary involved.
 * 5) Other [дт]с and [дт]з use the short pronunciation.
 * 6) [дт]ш and [дт]ж, including with an intervening ь, always use the long pronunciations [t͡ʂʂ] and [d͡ʐʐ] (possibly with the first part palatalized).