Wiktionary:About Proto-Indo-European

These are guidelines for Proto-Indo-European (= PIE = ine-pro) entries. Note that since Proto-Indo-European is a reconstructed language, all entries should be subpages in the Reconstruction namespace.

Phonology and transcription
The following scheme is the canonical transcription for PIE words and forms on Wiktionary, which is used within entries as well as the names of entries themselves (but see below on category sorting). Other spellings are permitted, but should redirect to the canonical spelling and not have any entries themselves.

Obstruents are never syllabic, and therefore cannot form syllables on their own.


 * Voiceless stops: *p *t *ḱ *k *kʷ (not k̂ or k̑, ku̯)
 * Voiced stops: *b *d *ǵ *g *gʷ (not ĝ or g̑, gu̯)
 * Aspirated stops: *bʰ *dʰ *ǵʰ *gʰ *gʷʰ (not bh, dh, ǵh/g'h/ĝh/g̑h, gh, gʷh/gu̯ʰ etc.)
 * Fricatives: *s *h₁ *h₂ *h₃ (unknown laryngeal: *H; not ə₁, ə₂, ə₃, and never with a circle below)

Sonorants are sometimes syllabic, and form pairs depending on whether they are considered a “syllable nucleus” (simply said, “vowels”) or not. The general rule is that sonorants are non-syllabic when they are next to a syllabic (a vowel or another syllabic sonorant), and syllabic when they stand in between non-syllabics.


 * Non-syllabic sonorants: *y *w *l *r *m *n (not j, i̯, u̯)
 * Syllabic sonorants: *i *u *l̥ *r̥ *m̥ *n̥

Vowels are always syllabic. Roots that show ablaut (the vast majority) are normally cited in the e-grade, or full grade. A few roots seem to have other underlying vowels, usually a, but also occasionally i or u that never seem to alternate with their non-syllabic counterparts. Underlying long vowels are likewise rare, and long high vowels even more so.


 * Short vowels: *e *o *a *i *u
 * Long vowels: *ē *ō *ā *ī *ū

Diphthongs are sequences of a vowel followed by a nonsyllabic sonorant, and are written as such:
 * *ey *oy *ew *ow etc. (not ei oi eu ou)

Every fully-formed verb, noun or adjective form (with an inflectional ending) has at least one accented syllable. The accented syllable of such words is indicated with an acute accent (´), and always specified in entry names as well as within entries themselves. Current practice is to include it as well on single-syllable words, because there was probably a distinction between presence and lack of an accent. In particular, many clitics and adverbs seem to have been inherently accentless in at least some circumstances.


 * Accented short vowels: *é *ó *á *í *ú
 * Accented long vowels: *ḗ *ṓ *ā́ *ī́ *ū́
 * Accented syllabic sonorants: *í *ú *ĺ̥ *ŕ̥ *ḿ̥ *ń̥

Linking and entry names
In principle, the spelling guidelines outlined above apply to entry names as well. If you see an entry with a name that does not conform to the scheme above, feel free to move it and correct any links to it so that they point to the proper name.


 * Use “Early PIE” (laryngealist) forms by default (, not or ;, not ). Doubtful laryngeal qualities are expressed as simple.
 * Surface (phonetic) forms of reconstructions should generally be used, and not the underlying (phonological) forms with or without separated constituent morphemes, unless it is the universal practice to do so (e.g. ignoring the “laryngeal coloring” effects for vowels). This means application of the synchronic PIE phonological rules. E.g. and not ,  and not  (, from ).
 * Roots are to be cited in the e-grade; entries should not be created for the other grades. Redirects are okay, but fixing the links is better!
 * Cite roots and stems (forms that are not fully inflected words) with a hyphen:.
 * Roots do not carry accents on vowels.
 * Fully-formed words (as opposed to roots) have an inherent accent, which should be present in the page name. You may add a word without an accent if you do not know where the accent should be placed, but when the accent placement becomes known the entry should be moved/renamed to reflect this.
 * Verbs are lemmatised to the third-person singular indicative form (present tense, for verbs having tense distinctions). Nouns are lemmatised to the nominative singular form, and adjectives are lemmatised to the masculine nominative singular form. If the exact form is unknown but the accent and ablaut grades are, use the accent and ablaut grades that these forms would have, but with a hyphen instead of the ending.
 * Words should not be broken up into pieces. Write a word as one whole, without hyphens separating the parts: (not ). Separating hyphens can be used in the displayed form of links to clarify the formation, as long as it is not used in the page name itself.
 * The language code for Proto-Indo-European is ine-pro. Use this code with the and  templates for references in etymologies. When linking to Proto-Indo-European reconstructions, you must add an asterisk (*) before the word (to indicate that it is reconstructed) or an error message will be displayed.
 * Example:
 * Produces:


 * Linked lemmas without a corresponding entry page should be placed in the alternative parameter, avoiding a redlink, e.g., en, ine-pro.


 * Terms with ablauting paradigms should display both patterns in their links, with nouns in the genitive, and verbs in the plural:

Creating Proto-Indo-European entries
Entries should normally begin with to alert readers to the reconstructed nature of the entry and the meaning of the asterisk. Following that, in principle, PIE entries are laid out the same as other entries in the main namespace. So they have a language header, properly nested part-of-speech headers, and possibly also declension and conjugation tables, derived and related terms and other subheadings. You should try to keep to WT:ELE in general, but you can deviate from it a little bit if it's necessary. If you're not sure how to format an entry, you can look at existing entries.

Nouns
The following template can be used for nouns:

Noun

 * 1)

Descendants
The template takes one positional parameter: gender. Wiktionary uses Post-Anatolian stage of Proto-Indo-European as is usual, which had masculine, feminine and neuter genders which can be specified as,  ,   respectively.

Inflection can be either requested using the template as shown above, or can be specified either manually, using the  template, or simply using templates for some simple cases (thematic stems): see Category:Proto-Indo-European noun inflection-table templates. You should also add the entry to Category:Proto-Indo-European thematic nouns, Category:Proto-Indo-European athematic nouns or one of the latter's subcategories, depending on how much is known about the inflection type.

User is advised to specify declensional pattern, as well as oblique case forms, in case full inflection is not provided.

Sorting
Indo-European entries should normally be sorted with a sort key, because of the many special characters that are present in the language. The default sorting order is as follows:


 * Subscripts are replaced with base characters: ₁ ₂ ₃ > 1 2 3
 * Superscripts are replaced with base characters preceded by a lone macron: ʰ ʷ > ¯h ¯w (so that e.g. gʰ always sorts after g rather than between ge and gi)
 * Acute accents, macrons and rings below are removed: á é í ó ú > a e i o u, ā ē ī ō ū = a e i o u, r̥ l̥ m̥ n̥ > r l m n
 * The acute for palatovelars becomes a following apostrophe: ǵ ḱ > g' k'

All of the standard templates such as as well as all the Proto-Indo-European specific templates apply this sorting by default, so you do not need to do anything for those templates. To ensure proper sorting, use or  to add a category that is not added by another template.

Derived terms and descendants
Every entry must have either a “Derived terms” or a “Descendants” section.

Derived terms

 * Derived terms must be ordered first by verbs, followed by all other formations, and then arranged alphabetically.
 * Verbs should specify their verb type in plain text within the pos parameter, e.g., ine-pro. This does not apply to any other formations.

Descendants

 * Organize descendants into language families, each represented by a proto reconstruction. If a proto form isn't reconstructed per guidelines, place a hyphen in the term parameter, e.g., hyx-pro.
 * Use desc for links, rather than l.
 * Where a language specifies that there should be no entry link, place the term in the alt parameter, e.g., itc-pro.
 * Always provide either a term, transliteration, or transcription for attested languages.
 * For terms with a corresponding entry page containing further descendants, place see desc after the link. Do not use desctree.