User:-sche/missing codes

ISO 639 includes 7865 codes; this is a list of those codes which are not present in Wiktionary. Some are intentionally excluded by us or merged by us into other codes, but others are missing by simple oversight, or in some cases because naming conflicts years ago prevented importation.

For a list of codes which the ISO has retired but which Wiktionary still includes, see User:-sche/retired codes. (Items also listed there and in the BP should be struck here.)

For a list of codes which are present in Wiktionary but which should not be, see User:-sche/notlanguages. Also note Spurious languages: we currently include and encode some lects which WP thinks may not exist, including Yiddish Sign Language, Parsi 'prp' and Parsi-Dari 'prd'.


 * See also Beer parlour/2015/January.

= uncoded lects fr.Wikt has =


 * clear candidates for addition
 * Pa Na, Bana (Hmongic lect in China)


 * need more info
 * Anggreso (tribe in Indonesia; but what evidence of it is there of a separate lect?)
 * Chali (with āva as the word for "water"); different from the ngf?
 * Brunei Dusun and Paper Dusun; compare dtp
 * Gao of Dongkou
 * Long Anap Kenyah
 * Long Dunin Kenyah
 * Long San Kenyah
 * Khiajari
 * Khoznini
 * Lisum
 * Me’phaa of Huehuetepec, of Huitzapula, of Nanzintla, of Teocuitlapa, and of Zapotitlan Tablas
 * Miao of Xiaozhang
 * Miri (???)
 * Murut Nabaay
 * Seminole (?)
 * Nadou
 * Nanga Ira'
 * Nunu of Bama
 * Penan Benalui
 * Sagz-âbâdi (a variety of Tati?)
 * Saynawa
 * Seputan
 * Shang
 * Shinman
 * Tingalan
 * Tongzha
 * Chemehuevi and Southern Paiute (dialects...)
 * (part of Mwotlap?)
 * Huron (Wyandotte)


 * Yulparija, a
 * Zandui (a Hlai lect)


 * Bengni (dialect or alt name of ?)
 * Limbang Bisaya (included by us in bsb)
 * Botnian (included by us in fi)


 * Bularnu (a dialect of )
 * all varieties of the or Bunu (which we include in bwx), oddly coded: bwx is used for "Dongnu of Dahua" ("Dongnu", "Tung Nu"), and bwx-bun for "Bunuo" ("Pu No"), and bwx-nao, bwx-num and bwx-nun for "Naoklao" ("Baonao", "Nao Klao"), "Numao" ("Nu Mhou"), and "Nunu of Lingyun" ("Nunu", "Nu Nu").


 * Dongmeng and Dongnu of Du'an, related to the preceding
 * Barghu as "bxu-bar" (a dialect of Mongolian?)
 * Mutsun and Rumsen (Southern Ohlone, css)
 * Chalon and chochenyo (Northern Ohlone, cst)
 * Eudeve (another name of opt, Opata, Heve, Hegue)
 * White Gelao of Diyingshao, of Judu, of Moji, of Niupo, of Pudi, of Yueliangwan, and of Wantao; Red Gelao of Fanpo, of Longjia, and of Na Khê; Green Gelao of Liangshui, of Qinglong, of Sanchong, and of Zhenfeng (presumably included in an existing code)
 * Gudjal (dialect of the )
 * Hoanya (dialect of the
 * Kalis (dialect of the )
 * Long Napir Kelabit (dialect of the )
 * Fu-Yu Kirghiz (dialect of Kirghiz)
 * Kotoxo (dialect of the )


 * Kumaná is kuo
 * Juaneño
 * Southern Patwin (dialect of the )
 * Sonqor (Azerbaijani)


 * Taokas (dialect of Babuza)


 * already under discussion
 * Indanga (see RFM)
 * Lazé / Laze (see RFM)
 * Ma Pnaan (see RFM)
 * Mitchiti (see RFM)
 * Plodarisch, and Saurano, and Tischlbongarisch (see WT:T:ADE)

and


 * Baisha, Baoting, Changjiang, Ha Em, Lauhut, Moyfaw, Yuanmen (Hlai lects in China)
 * In Baisha, Baoting, Changjiang, Ha Em, Moyfaw, and Tongzha, "water" is nam³ per Peter Norquest, A phonological reconstruction of Proto-Hlai. In Bouhin and Lauhut, it is nom³, in Jiamao it is naːm¹, in Yuanmen and Zandui it is nam⁶, and in Nadou it is nan³.

= missing codes =

2016

 * [kgd] Kata(a)ng was removed and split into [ncq] Northern Katang and [sct] Southern Katang.


 * szs Solomon Islands Sign Language -
 * npx Noipx - (?)
 * lth Thur - says "Labwor (Thur), once considered a dialect of Acholi, may not be intelligible with it."
 * wbs West Bengal Sign Language - part of ins?
 * ibh Bih - variety of rad? see
 * nql Ngendelengo - autonym Olungendelengo - a Herero group (dialect of hz?)
 * xdo Kwandu - split from mho? see

G

 * gll — Garlali / Kullili - should have a code, though cf notes about ekc

J

 * jan — Jandai
 * jbi — Badjiri - possibly not even Karnic; cf my notes about ekc above and on User:-sche/retired codes
 * jbw — Yawijibaya
 * jgk — Gwak
 * jjr — Bankal
 * jog — Jogi
 * jui — Ngadjuri

N

 * nmx — I; L; Nama (Papua New Guinea); - Nama (New Guinea) - distinguished from naq "Nama", a dialect of nza "Tigon Mbembe", a Jukunoid language of Cameroon and Nigeria
 * npg — I; L; Ponyo-Gongwang Naga;
 * nqy — I; L; Akyaung Ari Naga;
 * nsf — I; L; Northwestern Nisu;
 * nwg — I; E; Ngayawung;
 * nxk — I; L; Koki Naga;

O

 * olk — I; E; Olkol;
 * orc — I; L; Orma;

S

 * sfe; — I; L; Eastern Subanen; -
 * sgj; — I; L; Surgujia; - Suraji, Surguja, Surgujia-Chhattisgarhi, Surjugia

T

 * tgj ; — I; L; Tagin; -
 * tgz ; — I; E; Tagalaka; -
 * tjl ; — I; L; Tai Laing; -
 * tst ; — I; L; Tondi Songway Kiini; -


 * other concerns
 * ; — I; L; Tobagonian Creole English; - merge?
 * ; — I; L; Turks And Caicos Creole English; - merge with other similar Creole Englishes from that area?

Y

 * yhs — Yan-nhaŋu Sign Language - signed by 10 people, not that distinct from ygs

= Zapotec, Zhuang =

In each case, we have both the macrolanguage and a lot of dialect codes, which is incoherent.

Also, agi Agaria is spurious according to Glottolog.

= Languages to possibly rename or merge =

Sections with Chins not yet having family info, etc: T, V, W.

Languages to possible rename or merge:
 * 1) 		Antigua and Barbuda Creole English
 * 2) 		Judeo-Tunisian Arabic
 * 3) 		Judeo-Moroccan Arabic
 * 4) 		Guerrero Amuzgo
 * 5) 		Eastern Egyptian Bedawi Arabic
 * 6) 		Aruá (Amazonas State)
 * 7) 		Aruá (Rodonia State)
 * 8) 		Ivbie North-Okpela-Arhe
 * 9) 		Anus
 * 10) 		Austral


 * 1) 		Remontado Dumagat
 * (is this a distinct language? vide b.g.c. "Sinauna")


 * 1)   Emberá-Baudó
 * 2)   Indonesian Bajau
 * 3)   West Coast Bajau
 * 4)   Balkan Gagauz Turkish
 * 5)   Bohtan Neo-Aramaic
 * 6)   Vaghat-Ya-Bijim-Legeri
 * 7)   Biao-Jiao Mien


 * 1)   Rinconada Bikol
 * 2)   Bura-Pabir
 * 3)     Bafaw-Balong
 * 4)     Bu-Nao Bunu
 * 5)   Belize Kriol English
 * 6)   Kɛlɛngaxo Bozo


 * 1)   Cutchi-Swahili
 * 2)   Malaccan Creole Malay
 * 3)   Cinda-Regi-Tiyal
 * 4)   Chiga
 * 5)   Chocangacakha
 * 6)   Chinook jargon
 * 7)   Chipewyan
 * 8)   Cakchiquel-Quiché Mixed Language
 * 9)   Central Mnong
 * 10)   Chonyi-Dzihana-Kauma
 * 11)   Ucayali-Yurúa Ashéninka
 * 12)   Ajyíninka Apurucayali
 * 13)   Cappadocian Greek
 * 14)   Chinese Pidgin English
 * 15)   Pichis Ashéninka
 * 16)   South Ucayali Ashéninka
 * 17)   Chuanqiandian Cluster Miao
 * 18)   El Nayar Cora
 * 19)   Seselwa Creole French
 * 20)   Knaanic


 * check names
 * ; — I; L; Diriku;


 * other concerns
 * — Pidgin Delaware


 * — Dogri (individual language)
 * — Dogri (macrolanguage)


 * — Northwestern Dinka
 * — Dixon Reef
 * — Dar Daju Daju
 * — Dong
 * — Toura (Papua New Guinea)


 * — Fang (Cameroon)
 * — Fang (Equatorial Guinea)
 * — Flinders Island

= 2015 =

gjr, fnb, dtn

= special cases (split codes, etc) =

Gbaya
Gbaya is the name of quite a few languages. 'krs', a Gbaya of Sudan, is able to be called "Kresh". But there is also 'gba', the Gbaya family(!), many of the members of which are called Gbaya. See Gbaya languages. 'gba' may need to be split.

= codes to (possibly) delete =
 * , Mbo (Cameroon), is part of a dialect cluster with 'bss' – Akoose, 'bqz' – Kaka (Central Mbo), and 'bsi' – Sosi


 * White and Green Hmong could be merged into e.g. "Laotian Hmong". This could either use one of those lects' codes, or have an exceptional code, e.g. hmn-wgn (White+Green/Njua).

= existing codes to (possibly) rename =


 * (bdl) – Sulawesi [Bajaw]
 * (bdr) – Sabah West Coast [Bajaw]
 * (sjm) – Mapun [Bajaw]

= langrev functionality =

If you know a language's code (for example, "en") and you want to find out its canonical name, you can use this:
 * (returns "English")

If you know a language's canonical name (for example, "English") and you want to find out its code, use this:
 * (returns "en")

All of these functions are subst:able (type ...).