Wiktionary:About Indonesian

This policy explains considerations for Indonesian entries that are not covered by WT:ELE and other general policies.

A very simple example
This is a simple entry for the word, and shows the most fundamental elements of an article:

Etymology
From, from.

Noun

 * 1) dictionary

Guidelines

 * Wiktionary may be used by learners who are not proficient in Indonesian.
 * Only entries that fulfill the criteria for inclusion can be created.
 * Lemma: The lemma form in Indonesian is the, and  written in  based on the the latest spelling standard (2015 Ejaan Bahasa Indonesia).
 * Spelling: In most cases, the latest spelling standard, which is 2015 Ejaan Bahasa Indonesia and KBBI V, used to determine which variant spelling to place a word’s definition at.
 * Pronunciation: The pronunciation may be entered in both phonemic (using / /) and phonetic form (using [ ]).

Lemma entries
A lemma is the canonical form of an inflected word; i.e., the form usually found as the headword in a dictionary, such as the nominative singular of a noun, the bare infinitive of a verb, etc. In Indonesian dictionaries, such as KBBI,  treated as lemma, and most derived words (except preffixed di-, -kah, -ku, -lah, -mu, -nya) treated as sublema (lit. sub-lemma). For example, is a lemma, and  is a sublema, but  is not a lemma. However, for practical consideration, lema and sublema are treated as lemma in this Wiktionary.

As Indonesian is written only in the Latin script, the lemma entry in Indonesian is the in Latin script using the latest spelling (2015 Ejaan Bahasa Indonesia).

The layout for lemma entries in Indonesian is based on Entry layout. For non-lemma entries, see non-lemma entries for specific considerations.

Headings before the definitions
As for editing, shall be added to current edit if necessary.

Alternative forms
Alternative forms are variations of the same word kept in multiple pages. Examples include:
 * Regional variations.
 * Obsolete forms and historical variations, such as words no longer used due to spelling reforms. See spelling reforms for more.
 * Hyphenation or spacing of compounds, such as and.
 * Variation in style such as uncertain capitalization.
 * Common misspellings.

The standard form (bentuk baku) listed in is usually considered the standard form. Warning: Do not list the word which standard by KBBI but never used by speaker as standard form, use commom form.

Entries that are alternative forms will have templates such as these at the definition line to link the entry back to the standard form.

The "Alternative forms" header can also be listed after the part of speech, if the spelling is specific to that part of speech.

Etymology
The first header below the language heading is usually the level 3 “Etymology” header. The etymology is given right below the header without indentation. Etymology essentially shows where the word comes from. This may show the forms in other languages that underlie the word. For many modern words it may show who coined the word. If a word is derived from another in the same language by a regular rule, such as formation of an English adverb by adding “ly”, it is not necessary to repeat the complete details of the word’s origin on the page for the derived word.

Sometimes two words with different etymologies belong in the same entry because they are spelled the same (they are homographs). In such a case there will be more than one “Etymology” header, which we number. Note that in the case of multiple etymologies, all subordinate headers need to have their levels increased by 1 in order to comply with the fundamental concept of showing dependence through nesting.

Origin of Word: Inherited
A significant category of words in a language are the so-called ‘native’ or ‘inherited’ words; in some languages, but not all, they form the majority of words. This means that they have developed from an earlier form of the language which may or may not have gone by the same name. The ancestor languages of Indonesian are in this order:
 * Malay
 * Classical Malay
 * Old Malay
 * Proto-Malayic
 * Proto-Malayo-Chamic
 * Proto-Malayo-Sumbawan
 * Proto-Sunda-Sulawesi
 * Proto-Malayo-Polynesian
 * Proto-Austronesian

For words with an unbroken chain of inheritance, is used. If the source of the inherited word is borrowed from a nonancestral language, e.g. a Malay word borrowed from Sanskrit/Portuguese/Arabic or a Proto-Malayo-Polynesian word borrowed from Proto-Oceanic, then is used instead of. See Template:inherited for a more detailed explanation.

For informative purpose, add words from Classical Malay, only show if those words attested.

Origin of Word: Borrowings/Loanword
Some words have been borrowed from other languages, either because of a historical occupation or co-existence, or simply through exposure to other languages. Borrowings can be ancient or recent. When words are first borrowed into a language they may still ‘seem’ foreign. After a while they become more naturalized. Eventually they seem completely native. To flag a borrowing, use or the shortcut. For extra consideration for borrowings/loanword, see Borrowings/Loanword below.

Specific usage of other foreign derivation templates

 * Calques. For calques or loan translations, it is necessary to provide the source language out of which the lexeme, compound or a phrase has been calqued. The template should be used. For example, the term of  is a calque of . Thus, it is written as.
 * Learned Borrowing. The template should be used. For example, the term of  is a learned borrowing of . Thus, it is written as.
 * Phono-Semantic Matching. The template should be used. For example, the term of  is a phono-semantic matching of . Thus, it is written as.
 * Semantic Loan. The template should be used. For example, the term of   is a semantic loan of . Thus, it is written as.

Word formation

 * Affixation: Affixes are morphemes that are attached to a word stem to form a new word or word form. They can be derivational, like English and, or inflectional, like English plural  and past tense . They are bound morphemes.
 * Prefix, use in.
 * Suffix, use in.
 * Infix, use in.
 * Circumfix, use in.
 * Back-formations. Words that look like a regular formation can have the formation reversed (especially, removing apparent affixes) to yield a new word. This is called back-formation, and the template helps here. For example,  is a back-formation of, thus it uses  in the  page.
 * Blends. A blend or portmanteau word is a word which was originally formed by combining two other words. For example, can be used for.
 * Calques. For calques or loan translations, it is necessary to provide the source language out of which the lexeme, compound or a phrase has been calqued. The template should be used. For example, the term of  is a calque of . Thus, it is written as.
 * Clipping use
 * Reduplication, use.

Pronunciation
The "Pronunciation" section includes the transcriptions, audio pronunciations, rhymes, hyphenations and homophones.


 * followed by the transcription in the is used. Phonemic transcriptions are placed between diagonal strokes ( / / ) while phonetic transcriptions are placed between square brackets ([ ]).
 * If regional variations or accents exist, then (e.g. ) is used before.
 * Hyphenation: Hyphenation describes how a word is broken across line breaks. For example, is hyphenated as.
 * Homophones: List any homophones of the word in alphabetical order using . For example, and . If a word is a homophone in a particular dialect, then Jakarta can be used to specify the dialect for that particular homophone.
 * The template can be used to request a pronunciation in a Wiktionary entry.

Part of speech and headword line
The part of speech (POS) is a descriptor like “Noun” or “Adjective”. The allowed POS are:
 * Parts of speech: Adjective, Adverb, Ambiposition, Article, Circumposition, Classifier, Conjunction, Contraction, Counter, Determiner, Interjection, Noun, Numeral, Participle, Particle, Postposition, Preposition, Pronoun, Proper noun, Verb
 * Morphemes: Circumfix, Combining form, Infix, Interfix, Prefix, Root, Suffix
 * Symbols and characters: Diacritical mark, Letter, Ligature, Number, Punctuation mark, Syllable, Symbol
 * Phrases: Phrase, Proverb, Prepositional phrase
 * Varieties.

The headword line is the line directly below the part of speech header, in which the word is repeated, along with a alternative spelling with diacritics (see below) if applicable. For example, headword line of is.

Headwords containing "e"
Although diacritics are often omitted in daily usage, diacritic is used in dictionary (mainly KBBI) to differentiate distinctive phoneme of "e", specifically /ə/ and /e/. The phoneme /e/ can be realized as [e] and [ɛ]. While the used diacritic are varied on preference of the author, the diacritic based on KBBI.

If the letter "e" is present in the headword, then an additional parameter head is required with the normal "e" replaced by one of the following:
 * ê (e with circumflex) to represent (mid central vowel)
 * é (e with acute accent) to represent (close-mid front unrounded vowel)
 * è (e with grave accent) to represent (open-mid front unrounded vowel)

e.g. for.

Remember not to do add diacritics to hyphenation section.

Definitions
The definitions are in the POS section, below the headword line. The definitions are organized as a numbered list. The numbers are generated by adding the number sign (#) at the start of each definition in the wikitext. The key terms of a definition should be linked to the respective entries.

Context label
A context label is placed before a definition to indicate any one of the following: Many context label templates also categorize an entry into a relevant category, but they must not be used merely for categorization. Examples of recognized labels can be found in Appendix:Glossary. Context labels commonly used in Indonesian entries include the following:
 * 1) The definition applies only to a restricted context.
 * 2) The definition occurs only in a limited geographic region or temporal period.
 * 3) The definition is only used by specialists in a particular field and not by the general population.



Shortening "form of" templates
Several "form of" template are permitted in definition section. There are several shortening "form of" templates, such as:
 * acronym: an abbreviation formed by the initial letters of other words, sometimes exclusively such abbreviations when pronounced as a word (as "laser") rather than as individual letters (initialisms such as "TNT"). For example, in.
 * Clipping use
 * Apheresis, involved a loss or omission of a sound or syllable from their beginning. For example, in.
 * Apocope, involved a loss or omission of a sound or syllable(s) from their end. For example, in.
 * Syncope, involved a loss or omission of a sound or syllable from their interior. For example, in.
 * Contractions: shortened words or phrases. For example, in.
 * ellipsis: the omission of a word or phrase that can be inferred from the context. For example, in  or  in.
 * initialism: a term formed from the initial letters of several words or parts of words, but which is itself pronounced letter by letter. For example, in.
 * short form, involved removing the syllable(s), and considered a word in its own right rather than an abbreviation. For example, in.

Affixed "form of" templates
Indonesian is a agglutinative language, thus words may contain different morphemes to determine their meanings.
 * For the superlative form, use . For example, for.
 * For the verbal construction, see Template:adj form of.

Spelling "form of" templates

 * Alternative spelling: or
 * Dated spelling:

Quotations
Quotations can be used to provide evidence that a word or sense exists (as required by WT:ATTEST), and to provide examples of how the word is used. For words such as alternative spellings or nonstandard compound words that are attested in Indonesian but not found in major dictionaries such as, the addition of quotations is highly recommended.


 * 1) Definition of word in English

Example sentences
Generally, every definition should be accompanied by a quotation illustrating the definition. If no quotation can be found, it is strongly encouraged to create an example sentence.

e.g., the Indonesian word for ministry:
 * 1) Definition of word in English

Semantic relations
Synonym, antonym, hypernym, hyponym, meronym, holonym, and troponym are examples of semantic relations. Detailed explanation and usage are covered in Semantic relations.

If there are multiple parts of speech and it is not known from which part of speech a certain derivative was formed, then semantic relations will be on the same level as the part of speech headings.

Derived terms
List terms in the same language that are morphological derivatives. In Indonesian, affixed word (kata berimbuhan) and compound word (kata majemuk) are part of derived terms.

Usage of is recommended for affixed derivatives.

Related terms
List words in the same language that have strong etymological connections but are not derived terms. Each term should be wikified with. Note that words with similar meanings but are not etymologically related are listed under the "See also" header.

Descendants
The template is used to list terms in other languages that have borrowed or inherited words from Indonesian. The etymology of these terms should then link back to the Indonesian lemma.

See List of languages for a complete list of language codes.

Anagrams
The "Anagrams" section is for anagrams such as, and. Anagrams are listed after the "Further reading" section.

e.g. entry for :

Categorization
Topic categories such as Category:id:Animals or custom categories such as Category:Indonesian basic words can be added by appending the following at the bottom of entries:

The complete list of topic categories available can be found at Category:List of topics (note that the first letter is capitalized). List of custom categories available can be found at Category:Indonesian language.

Affixed words and compound words
An affixed word (kata berimbuhan) is a word which has been affixed with prefix, infix, or suffix, while a compound word (kata majemuk) is a lexeme (less precisely, a word or sign) that consists of more than one stem, but used in single unit. In most aspects, the entry layout is similar to Lemma entries. However, there are several points.

Etymology

 * The compound word is written using template . For example, uses.
 * The affixed words use template in Category:Morphology templates. For example, uses,  uses , and  uses . It is important to use standard affixes (not use the assimilated form) as Tata Bahasa Baku Bahasa Indonesia, i.e.  uses , instead of.

Part of speech and headword line
Part of speech is similar in Lemma entries, however the headword line uses the (POS) form in order to be categorized as a non-lemma. For example, the entry for is written as below:

Verb

 * 1)  to tell a story

Possessive forms of nouns
Possessive forms for nouns such as rumahku (my house), rumahmu (your house), rumahnya (his/her/their house) are automatically displayed using which is equivalent to the following:

Note that separate entries are not created for these possessive forms.

Historical variations
During the course of time, the Indonesian language has undergone several spelling reforms. This resulted in different historical spellings of the same word. To reduce duplication of content, historical form entries would serve as soft redirects to standard form entries, and have no definitions or other information except for quotations. All other information belong to standard form entries.

e.g. entry for :

Proper noun


The categories for historical variation are: archaic, dated, obsolete. See Classifications of old words for specific considerations.

Nonstandard forms
Several words have nonstandard forms which can be found in everyday setting. To reduce duplication of content, nonstandard form entries would serve as soft redirects to standard form entries, and have no definitions or other information except for quotations. All other information belong to standard form entries.

Noun


The standard form entries shall list these nonstandard forms under the Alternative Forms header:

Alternative forms


If there is an entry with both standard form and nonstandard form that have different definitions, then that entry shall be separated into several etymologies. The etymology with standard form shall be entered as lemma, while the one with nonstandard form shall be entered as nonstandard forms. Overall entry layout is similar to the lemma entry.

Regional Languages
Note that also introduce lemmas from other languages to popularization purpose, these terms not yet attested, such as:
 * Balinese (labeled as Bl)
 * Batak (labeled as Bt) - consists of several varieties such as Toba Batak and Karo Batak.
 * Betawi or Jakarta Malay (labeled as Jk)
 * Javanese (labeled as Jw)
 * Lampung (labeled as Lp) - consists of three varieties: Nyo, Api and Komering.
 * Madurese (labeled as Mdr)
 * Malay (labeled as Mal)
 * Minangkabau (labeled as Mk)
 * Manado Malay (labeled as Mnd)
 * Musi or Palembang Malay (labeled as Plb)
 * Sundanese (labeled as Sd)
 * Tombulu or Minahasan (labeled as Mn)

Words with these labels in KBBI deserve entries in their respective languages. Editors are encouraged to create entries in the original languages first before creating the Indonesian entry as some of the words may not be attestable in spoken or written forms of Indonesian.

Indonesian vs Malay
Indonesian and Standard Malay (the national language of Malaysia, Singapore, Brunei) have a common origin that can be traced back to the Classical Malay of the Johor-Riau Sultanate. Historically, Malay was the lingua franca used by maritime traders in the Indonesian archipelago. During the Dutch colonial era, Malay in Latin script was introduced as a school subject around late 19th century.

The earliest mention of "bahasa Indonesia" (Indonesian language) was in the 1928 Sumpah Pemuda (Youth Pledge). After independence in 1945, the language underwent further expansion in vocabulary with addition of new technical, scientific and professional terms by the Lembaga Bahasa dan Budaya (Institute for Language and Culture).

As of 2019, Indonesian and Malay are treated as separate languages on Wiktionary. Examples of differences between the two languages can be found at Comparison of Standard Malay and Indonesian.

Lemmas that existed during the period of Dutch colonization are recorded in modern dictionaries such as Kamus Besar Bahasa Indonesia (labeled as Bld) and Kamus Dewan (labeled as IB).

Lemmas before 1945
Generally, words attested before 1928 are to be placed under the Malay language. (See Sumpah Pemuda)

For words attested between 1928 until independence in 1945 the following considerations are made:
 * If the lemma has become obsolete and is not found in modern Indonesian, then a Malay language entry shall be created with the context label before the definition to categorize the entry into Category:Dutch East Indies Malay. This format is also applicable for words attested before 1900.


 * If the lemma continues to be used in modern Indonesian with minor changes in spelling, then an Indonesian language entry shall be created with as its definition.

Note that spelling from 1901 to 1947 is based on Van Ophuijsen’s Kitab Logat Melayu: Woordenlijst voor de spelling der Malaisch taal met Latijnsch karakter (Malay Vocabulary: Wordlist for the spelling of the Malay language with Latin characters).

Historical variations: spelling reforms
In principle, the latest spelling standard is used to determine which variant spelling to place a word’s definition at. Indonesian has experienced several spelling reforms. The spelling reforms are summarized as below:

Borrowings/Loanword
If the word is a modern borrowing from a contemporaneous language, is used. For indirect borrowings or earlier borrowings, is used instead.

For modern borrowings from extinct languages such as Sanskrit or Latin, or its shortcut  is used.

For long chains borrowing considered using. For example:

Etymology
From.

See List of languages for a complete list of language codes.

Chinese languages
In Indonesian, most loanwords from Chinese language are not based on Mandarin Chinese (／／／, ), but are based on one of the following: Thus, it is important to input the specific language source in Traditional Chinese characters along with the pronunciation and related definition. Otherwise, the Mandarin romanization is displayed by default. For example,, which is based on Min Nan , is written as below:
 * 1) Min Nan, which can be further divided into:
 * 2) Hokkien
 * 3) Amoy
 * 4) Changchow
 * 5) Teochew (／,  )
 * 6) Hakka
 * 7) Cantonese.

Etymology
From

Sanskrit language
As part of the Indosphere, Indonesian has significant influence from Sanskrit (Bahasa Sanskreta). This influence can be borrowed and inherited through Malay, or borrowed through Javanese or other languages, such as Balinese. These multiple sources can result in multiple borrowings, such as and. Thus, for borrowings for Sanskrit, the etymology and original word in Sanskrit shall be entered appropriately depending on how it is borrowed.

For modern borrowings, i.e. learned borrowings, the following is used: From. For terms inherited through 19th century Malay or Classical Malay, the following is used: From, from. For terms borrowed through local languages such as Javanese and Balinese, the following is used. From, from.

Do not use as Sanskrit and Indonesian are not contemporaneous languages.

European languages
In Indonesian, the majority of European loanwords are from Dutch or English. Some words may also be calques, e.g..

For modern borrowings, is used while  is used for calques. The ancestor of the borrowed word, e.g. Proto-Germanic, Proto-Indo-European can be additionally stated using  to categorize the lemma into categories such as Category:Indonesian terms derived from Proto-Germanic.

e.g. entry for

Etymology
From, from , from , from , from.

Pronunciation
The pronunciation may be entered in both phonemic (using / /) and phonetic form (using [ ]). If both are provided, the two transcriptions are separated using.

Classifications of old words
The following tags are not intended to dictate whether or how to use the tagged entries, but to inform the reader of the modern rarity and possible connotations within modern contexts.

Classical Malay
Classical Malay words (Etymology-only code ms-cla), used 14th to 18th century, are spelt differently from current spelling, or completely different in meaning, as to be virtually a foreign language to modern Indonesian speakers. Entries for such terms are treated as foreign words with the L2 language heading ==Malay==, categorized within Category:Classical Malay, or treated as Indonesian if 'used' in Modern Indonesian (see explanation below). Classical Malay terms which absence in modern time labelled as 'kl' in KBBI.

Obsolete
No longer in use; found only in very old texts. Can also apply to a no longer understood definition of a word. Examples: "to put" definition of.
 * Virtually no one would currently use the word or meaning, and very, very few would understand the word or meaning if it were used in speech or text.

For vocabularies inherited from Classical Malay and never used in modern day treated as obsolete. For words still used or re-used in modern day usually treated poetic or literary.

Archaic
No longer in general use, but still found in some contemporary texts (e.g. the Bible). Example,.
 * Generally understood by educated people, but rarely used in current texts or speech.

The term arkais (archaic) used in KBBI have slightly different usage compared to Wiktionary partice. It's can be translated as dated, archaic or obsolete.

Dated
Still in use, but generally only by older people, and considered unfashionable, superseded or dated, particularly by younger people. Examples: (in sense "toothpaste")

Please keep in mind that what may be considered "unfashionable" or "dated" in one region may not be in another, so where possible please include regional information. If in doubt, make a Usage Note stating that the term may be considered unfashionable or dated in some areas.

Verb
Verb is a word that indicates an action, event, or state. Verbal prefix in Indonesian consist of, , and. Verbal suffix in Indonesian consist of and.