Wiktionary:About Marshallese

This page proposes how Marshallese word entries may be maintained on Wiktionary. To learn more about the language itself, see the article for the on Wikipedia.

Alphabet issues
There are currently display issues with five Marshallese letters:

The characters given here are only approximations of the actual characters that are used in careful typesetting, but they are conditionally used until a better solution is found. Note especially that the sequences involving the combining macron character (U+304) or the combining cedilla character (U+0327) will not display correctly in the majority of fonts.

Only the standard diacritics are used in Wiktionary entries for Marshallese. Alternative schemes (particularly the Ḷ ḷ Ṃ ṃ Ṇ ṇ Ñ ñ Ọ ọ alternatives promoted by the online version of the Marshallese-English Dictionary) are not used. Three other letters with diacritics, Ā ā Ō ō Ū ū, are already well-displayed in most modern default browser fonts; alternative forms Ã ã Ä ä Õ õ Ö ö Ũ ũ Ü ü are not used.

Compare:
 * : Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * : Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * : Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * : Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * : Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * : Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * : Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * : Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * : Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * : Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * : Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * : Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * : Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * <span style="font-size: larger !important; font-family: 'DejaVu Sans Mono' !important;">: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * <span style="font-size: larger !important; font-family: 'DejaVu Serif' !important;">: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * <span style="font-size: larger !important; font-family: 'Doulos SIL' !important;">: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * <span style="font-size: larger !important; font-family: 'Gentium' !important;">: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * <span style="font-size: larger !important; font-family: 'Gentium Basic' !important;">: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * <span style="font-size: larger !important; font-family: 'Gentium Book Basic' !important;">: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * <span style="font-size: larger !important; font-family: 'Gentium Plus' !important;">: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * <span style="font-size: larger !important; font-family: 'Inconsolata' !important;">: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * <span style="font-size: larger !important; font-family: 'Junicode' !important;">: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * <span style="font-size: larger !important; font-family: 'Linux Libertine' !important;">: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * <span style="font-size: larger !important; font-family: 'Lucida Sans Unicode' !important;">: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * <span style="font-size: larger !important; font-family: 'Noto Sans' !important;">: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * <span style="font-size: larger !important; font-family: 'Noto Sans Mono' !important;">: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * <span style="font-size: larger !important; font-family: 'Noto Serif' !important;">: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * <span style="font-size: larger !important; font-family: 'Open Sans' !important;">: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * <span style="font-size: larger !important; font-family: 'Segoe UI' !important;">: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * <span style="font-size: larger !important; font-family: 'Source Code Pro' !important;">: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * <span style="font-size: larger !important; font-family: 'Source Sans Pro' !important;">: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * <span style="font-size: larger !important; font-family: 'Source Serif Pro' !important;">: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * <span style="font-size: larger !important; font-family: 'Tahoma' !important;">: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
 * <span style="font-size: larger !important; font-family: 'Times New Roman' !important;">: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū

Spelling
Separate word entries may be provided in both the old orthography and the new orthography, though references are easier to provide for words spelt in the new orthography because this is what the Marshallese-English Dictionary uses, and as such Marshallese word entries on Wiktionary are expected to overwhelmingly be from the new orthography. Where more than one spelling for the same word is used, including differences reflecting old and new orthographies, their entries can cross-reference each other by placing the other spellings in "Alternative forms" section of the entry, with a qualifier of which different orthography, if any, the linked word uses.

Pronunciation template
Marshallese pronunciations are embedded using a special template,, using phonological conversion algorithms described in Module:mh-pronunc. For example, <tt> </tt> embeds this:

The template describes both :
 * MED s are a pronunciation convention originally devised by Byron W. Bender which appears alongside word entries in the Marshallese-English Dictionary, published in 1976. Words are represented with non-IPA symbols for the phonemes of the language, including just four s, and consonants spelt in a way that resembles written Marshallese , and bracketed in curly brace characters: &#123; &#125;.  Consonants each imply a certain  that directly influences the  and  of neighboring vowels, but this exact interplay is not directly indicated in the transcription other than what can be inferred for the spellings involved.
 * (IPA) phonemic transcription, bracketed in forward slash characters: . Much like MED phoneme notation it only describes individual consonant and vowel phonemes, but represented in a more IPA-friendly format.  Again, as vowel phonemes are specified only for height and not for backness or roundedness, they are given symbols corresponding to the s .  The consonants, for the most part, are specified orthogonally for primary articulation (,  and ), secondary articulation (,  and ), and basic manner of articulation (,, ,  or ), with the resulting symbols .  They are not specified for  or , as these are not phonemic in Marshallese and can occur in.
 * Either one or two phonetic transcriptions based on the Rālik (western) and Ratak (eastern) dialects of Marshallese. Where both dialects agree on phonetic reflexes for the same word, only one form is displayed.

The pronunciation template cannot rely directly on the spelling of Marshallese word entries to guess their pronunciation, as none of the common orthographies in use have a one-to-one phonemic correspondence, though the newer orthography is significantly more phonologically consistent than the older orthography. Instead, the template uses a code format which is essentially a simplified -only modification of Bender's pronunciation guide for the MED. For example, the <tt> </tt> example used earlier in this section uses the code <tt>mhahjelh</tt>, which is similar to  in the MED. The code uses a fairly strict syntax, but is case-insensitive.

The supported vowel phoneme symbols are:

The supported consonant symbols are:

The template's code syntax also supports the use of any number of plain apostrophes (<tt>'</tt>) to disambiguate symbol spellings. For example, <tt> </tt> embeds this:

...whereas <tt> </tt> embeds this instead:

The syntax also permits any number of ASCII and ASCII plain s (<tt>-</tt>), as well as s (<tt>,</tt>) to separate multiple different examples. In the case of commas, the module script processes each comma-separated piece of code separately, and shows the converted result of each fragment with duplicate results removed. For example, <tt> </tt> embeds this:

Besides these symbols, the template allows hyphens and whitespace characters in any combination intermixed with the code sequences, but besides these, the template only accepts code representations of certain sequences of consonants and vowels for each comma-separated fragment of code, and will display an error if that code is malformed. In particular:
 * Code may begin or end with bare vowel phonemes (not normally allowed in the Marshallese language), but vowels may never directly neighbor each other, and they must be separated by at least one consonant. If a bare vowel is present, the module script will assume that the code represents a prefix and/or suffix, and will generate multiple phonetic IPA pronunciations depending on whether the bare vowel is attached to a consonant that is labialized, velarized or rounded.  For example, <tt>  </tt> embeds this:<ul></ul>
 * Code may only begin with single consonants or double consonants of the same phoneme (geminates). Where a word begins with a double consonant, an epenthetic vowel will be inserted in phonetic representation in differents ways for the Rālik and Ratak dialects.
 * Code may begin with pseudo-glides <tt>h_ w_ y_</tt> or end with identical pseudo-glides <tt>_h _w _y</tt> attached directly to otherwise bare vowels. These pseudo-glides do not show up in phonemic transcriptions, either in the MED's format or in IPA, but for the phonetic transcription, the module script will use this as a hint to produce only certain vowel contours of the otherwise bare vowels, rather than trying to generate all three possible combinations.  For example, <tt>  </tt> embeds this:<ul></ul>

Entry collation
When sorting entries for categorization, a simple ASCII-based transcription can properly collate Marshallese entries in Marshallese word categories. Use all lowercase for sorting, and include spaces and dashes as normally included in the entry. And for letters with diacritics: So, a word like would be collated <tt>m~ajol~</tt>.

When using template code like <tt> </tt>, Wiktionary properly and automatically collates entries in associated Marshallese word categories, but collation may still be necessary for topical categories like Category:mh:Islands or Category:mh:Animals which are subcategories of Category:mh:List of sets. For example, in the page for the Marshallese country name, the word entry is not categorized with the markup <tt> </tt> by itself, but collation syntax is added to produce the markup <tt>  </tt> instead.

Marshallese–English Dictionary reference template
The Marshallese–English Dictionary is the only complete Marshallese dictionary in existence, and has one significant online location. The template links to that location, and the template can be updated in case that location changes. The template can accept up to five arguments, each a separate reference. For each reference, only the URL substring immediately following  need be provided.