Wiktionary:Forms and spellings/sample

To aid in the discussion at Wiktionary talk:Alternative spellings, here is a sample of articles using any of these headings (standard and nonstandard): Alternative spellings, Alternate forms, Roman spelling, Alternative forms, Devanagari spelling, Cyrillic spelling, Forms and variants, Urdu spelling, Latin/Roman spelling, Alternative Spelling, Variants, Alternate spellings, Alternate spelling, Older forms, Other Forms, Devanāgarī spelling, Alternative spelling, Alternative Spellings.

The sample was taken from WiktionaryDev, which has been randomly importing one article from Wiktionary per hour for a couple of months. There are less than 2,000 articles so far and each one with one of the above headings is included below.

I've (partially) analysed some of them but I'm running out of time.


 * 1) کرنا &rarr; करना (different script)
 * 2) چمکنا &rarr; चमकना (different script)
 * 3) چابی &rarr; चाबी (different script), چابھی
 * 4) šokk &rarr; shokk
 * 5) τσούχτρα &rarr; τσούκτρα
 * 6) a &rarr; á (Spanish obsolete)
 * 7) aardvark &rarr; aard-vark (compounding variant)
 * 8) assegai &rarr; assagai, assagaie, assagay, assegay, azagaia, hassagay, hassaguay, zagaie, zagaye (most of them obsolete)
 * 9) Basic &rarr; BASIC (capitalization)
 * 10) BASIC &rarr; Basic (capitalization)
 * 11) blimey &rarr; blimey O'Reilly, blimey O'Riley, cor blimey, gawd blimey, gorblimey (variants maybe but not alternative spellings)
 * 12) Bo &rarr; Beau (Dutch given name), Bosse (Swedish hypocoristic form??)
 * 13) bo &rarr; boh (Italian interjection)
 * 14) bushbaby &rarr; bush baby (compounding variant)
 * 15) café &rarr; café (accentuation)
 * 16) cafe &rarr; cafe (accentuation)
 * 17) cloche &rarr; clutch, cloch (Spanish)
 * 18) Cuna &rarr; Kuna
 * 19) лифт &rarr; lift (different script)
 * 20) презиме &rarr; prezime (different script)
 * 21) див &rarr; div (different script)
 * 22) дивљи &rarr; divlji (different script)
 * 23) џиновски &rarr; džinovski (different script)
 * 24) dogend &rarr; dog-end, dog end (compounding variants)
 * 25) сено &rarr; seno (different script)
 * 26) façade &rarr; facade (accentuation)
 * 27) facade &rarr; façade (accentuation)
 * 28) gannai &rarr; kunnai (minority language)
 * 29) Großbritannien &rarr; Grossbritannien (different national orthography)
 * 30) have another think coming &rarr; have another thing coming
 * 31) hiccup &rarr; hiccough
 * 32) homma &rarr; homa (minority language)
 * 33) horsefly &rarr; horse fly (compounding variant)
 * 34) I haven't the foggiest &rarr; I haven't the foggiest idea, I haven't the foggiest notion (variants of idiomatic phrases yes but not alternative spellings)
 * 35) تھوک &rarr; थूक (different script)
 * 36) Ian &rarr; Iain
 * 37) id &rarr; ide (fish), id. (abbreviation)
 * 38) Kefalovryssion &rarr; Kefalovrysi, Kefalovryssi, Kefalovrisi, Kefalovrissi (all marked as alternate), Kefalovrysion, Kefalovrision, Kefalovrission (all marked as older forms)
 * 39) koala &rarr; coala (Spanish)
 * 40) kreativnost &rarr; креативност (different script)
 * 41) kuna &rarr; guna (minority language)
 * 42) Kuna &rarr; Cuna
 * 43) la &rarr; lade (Swedish, seems to be alternate past tense form rather than alternative spelling)
 * 44) learnt &rarr; learned
 * 45) miâ-jī &rarr; miâ-lī
 * 46) milliwatt &rarr; milli-watt (compounding variant)
 * 47) Moctezuma &rarr; Montenchuma, Muteçuma, Moteçuma, Montezúma, Moctezoma, Motecuzoma, Motecuhzoma, Moctezuma, Moteuczoma, Montezuma (different forms of foreign proper name used over varying periods of time)
 * 48) more &rarr; море (different script)
 * 49) naïve &rarr; naif (different pronunciation too?), naive (accentuation)
 * 50) naive &rarr; naif, naïf, naïve (note more variants listed than for previous entry!)
 * 51) nooblet &rarr; n00blet, newblet, nublet
 * 52) Nova Jorca &rarr; Nova York
 * 53) キリン &rarr; 麒麟 (different script)
 * 54) カエサル &rarr; シーザー (different pronunciation too)
 * 55) one &rarr; оне (different script)
 * 56) भूगोल &rarr; بھوگول (different script)
 * 57) धूना &rarr; دھونا (different script)
 * 58) otorhinolaryngology &rarr; otolaryngology
 * 59) ころ &rarr; ごろ (variant pronunciation too)
 * 60) palaeontography &rarr; paleontography
 * 61) panentheist &rarr; pan-en-theist, Panentheist, PanenTheist, Pan-en-theist, Pan-en-Theist (both hyphenation and capitalization vary)
 * 62) philibeg &rarr; filibeg
 * 63) prefect &rarr; præfect (æ &rarr; ae &rarr& e development)
 * 64) prosty &rarr; prostie
 * 65) Pushto &rarr; Pashto, Pashtu, Poshto, Pushtu (romanization of script vs romanization of various regional pronunciations?)
 * 66) résumé &rarr; resume, resumé (accentuation)
 * 67) Renée &rarr; Renee
 * 68) scorpius &rarr; scorpio, scorpios
 * 69) shmuck &rarr; schmuck (English vs German spelling of ʃ sound)
 * 70) signaler &rarr; signaller (l vs ll in US vs UK)
 * 71) ἰδέα &rarr; εἰδέα, ἰδέη
 * 72) Ἰούδας &rarr; Ἰουδά
 * 73) Ἰερεμίας &rarr; Ἱερεμίας
 * 74) stumbling block &rarr; stumbing-block (compounding variants)
 * 75) sulfuric acid &rarr; sulphuric acid (from US vs UK variants of sulfur/sulphur)
 * 76) tessellate &rarr; tesselate (l vs ll in US vs UK)
 * 77) tire-pressure &rarr; tyre-pressure (from US+Canada vs UK variants of tire/tyre)
 * 78) touch-tone &rarr; touchtone (compounding variants)
 * 79) 違い &rarr; 違う (Related terms seems more appropriate)
 * 80) whack-a-mole &rarr; Whac-A-Mole, whac-a-mole
 * 81) Yana &rarr; Яна (I didn't think Bulgarian was ever written in Latin script!)
 * 82) you &rarr; ya, yah, yer, -cha, -ja, u, yoo, eu, iow, yew, yewe, yo, yoow, youe, yow, yowe, yu, yw, ȝewe, ȝhow, ȝhu, ȝo, ȝou, ȝoue, ȝow, ȝowe