Wiktionary:Forms and spellings/sample/User:Ruakh

To aid in the discussion at Wiktionary talk:Alternative spellings, here is a sample of articles using any of these headings (standard and nonstandard): Alternative spellings, Alternate forms, Roman spelling, Alternative forms, Devanagari spelling, Cyrillic spelling, Forms and variants, Urdu spelling, Latin/Roman spelling, Alternative Spelling, Variants, Alternate spellings, Alternate spelling, Older forms, Other Forms, Devanāgarī spelling, Alternative spelling, Alternative Spellings.

The sample was taken from WiktionaryDev, which has been randomly importing one article from Wiktionary per hour for a couple of months. There are less than 2,000 articles so far and each one with one of the above headings is included below.

I've (partially) analysed some of them but I'm running out of time.


 * Hindustani — Hindi uses Devanagari, Urdu uses an Arabic-based script:
 * کرنا &harr; करना
 * چمکنا &harr; चमकना
 * چابی &harr; چابھی &harr; चाबी (two Urdu spellings)
 * تھوک &harr; थूक
 * šokk &harr; shokk
 * τσούχτρα &harr; τσούκτρα
 * a &harr; á (Spanish obsolete)
 * Compounds — space, hyphen, fusion:
 * aardvark &harr; aard-vark
 * bushbaby &harr; bush baby
 * dogend &harr; dog-end, dog end
 * horsefly &harr; horse fly (compounding variant)
 * milliwatt &harr; milli-watt (compounding variant)
 * stumbling block &harr; stumbing-block (compounding variants)
 * touch-tone &harr; touchtone (compounding variants)
 * assegai &harr; assagai, assagaie, assagay, assegay, azagaia, hassagay, hassaguay, zagaie, zagaye (most of them obsolete)
 * Capitalization — All-caps vs. initial caps:
 * Basic &harr; BASIC
 * blimey &rarr; blimey O'Reilly, blimey O'Riley, cor blimey, gawd blimey, gorblimey (variants maybe but not alternative spellings)
 * Bo &rarr; Beau (Dutch given name), Bosse (Swedish hypocoristic form??)
 * bo &rarr; boh (Italian interjection)
 * Level of assimilation into English:
 * café &harr; cafe (accentuation)
 * facade &harr; façade (accentuation)
 * naïve &harr; naive &harr; naif &harr; naïf (accentuation; retaining foreign gender distinction or not)
 * résumé &harr; resume &harr; resumé (accentuation)
 * Renée &harr; Renee (accentuation)
 * shmuck &rarr; schmuck (English vs German spelling of ʃ sound)
 * cloche &rarr; clutch, cloch (Spanish)
 * Cuna &rarr; Kuna
 * лифт &rarr; lift (different script)
 * презиме &rarr; prezime (different script)
 * див &rarr; div (different script)
 * дивљи &rarr; divlji (different script)
 * џиновски &rarr; džinovski (different script)
 * сено &rarr; seno (different script)
 * gannai &rarr; kunnai (minority language)
 * Großbritannien &rarr; Grossbritannien (different national orthography)
 * have another think coming &rarr; have another thing coming
 * hiccup &rarr; hiccough
 * homma &rarr; homa (minority language)
 * I haven't the foggiest &rarr; I haven't the foggiest idea, I haven't the foggiest notion (variants of idiomatic phrases yes but not alternative spellings)
 * Ian &rarr; Iain
 * id &rarr; ide (fish), id. (abbreviation)
 * Kefalovryssion &rarr; Kefalovrysi, Kefalovryssi, Kefalovrisi, Kefalovrissi (all marked as alternate), Kefalovrysion, Kefalovrision, Kefalovrission (all marked as older forms)
 * koala &rarr; coala (Spanish)
 * kreativnost &rarr; креативност (different script)
 * kuna &rarr; guna (minority language)
 * Kuna &rarr; Cuna
 * la &rarr; lade (Swedish, seems to be alternate past tense form rather than alternative spelling)
 * learnt &rarr; learned
 * miâ-jī &rarr; miâ-lī
 * Moctezuma &rarr; Montenchuma, Muteçuma, Moteçuma, Montezúma, Moctezoma, Motecuzoma, Motecuhzoma, Moctezuma, Moteuczoma, Montezuma (different forms of foreign proper name used over varying periods of time)
 * more &rarr; море (different script)
 * nooblet &rarr; n00blet, newblet, nublet
 * Nova Jorca &rarr; Nova York
 * キリン &rarr; 麒麟 (different script)
 * カエサル &rarr; シーザー (different pronunciation too)
 * one &rarr; оне (different script)
 * भूगोल &rarr; بھوگول (different script)
 * धूना &rarr; دھونا (different script)
 * otorhinolaryngology &rarr; otolaryngology
 * ころ &rarr; ごろ (variant pronunciation too)
 * palaeontography &rarr; paleontography
 * panentheist &rarr; pan-en-theist, Panentheist, PanenTheist, Pan-en-theist, Pan-en-Theist (both hyphenation and capitalization vary)
 * philibeg &rarr; filibeg
 * prefect &rarr; præfect (æ &rarr; ae &rarr; e development)
 * prosty &rarr; prostie
 * Pushto &rarr; Pashto, Pashtu, Poshto, Pushtu (romanization of script vs romanization of various regional pronunciations?)
 * scorpius &rarr; scorpio, scorpios
 * signaler &rarr; signaller (l vs ll in US vs UK)
 * ἰδέα &rarr; εἰδέα, ἰδέη
 * Ἰούδας &rarr; Ἰουδά
 * Ἰερεμίας &rarr; Ἱερεμίας
 * sulfuric acid &rarr; sulphuric acid (from US vs UK variants of sulfur/sulphur)
 * tessellate &rarr; tesselate (l vs ll in US vs UK)
 * tire-pressure &rarr; tyre-pressure (from US+Canada vs UK variants of tire/tyre)
 * 違い &rarr; 違う (Related terms seems more appropriate)
 * whack-a-mole &rarr; Whac-A-Mole, whac-a-mole
 * Yana &rarr; Яна (I didn't think Bulgarian was ever written in Latin script!)
 * you &rarr; ya, yah, yer, -cha, -ja, u, yoo, eu, iow, yew, yewe, yo, yoow, youe, yow, yowe, yu, yw, ȝewe, ȝhow, ȝhu, ȝo, ȝou, ȝoue, ȝow, ȝowe