Wiktionary:About Chinese/tasks

Below are some of the tasks pertinent to Chinese-language entries on Wiktionary.

Regular patrolling

 * 1) Recent changes to Chinese entries (can be customised – hide registered users, hide patrolled edits, etc.):
 * Changes to Category:Chinese lemmas
 * The recent 2000 unpatrolled changes to Chinese lemmas
 * 1)  or  linking to nonexistent pages:
 * Category:Chinese terms with uncreated forms

Entry maintenance

 * 1) Add the template  where applicable. (Example)
 * 2) Entries using  to link to an entry without a Chinese section:
 * Special:WhatLinksHere/Template:tracking/zh-see/no Chinese section found
 * Simplified entries only
 * 1) Entries using  to link to another variant character not containing :
 * Special:WhatLinksHere/Template:tracking/zh-see/unidirectional reference to variant
 * 1) Entries using  to link to entries without reciprocal mentioning in their :
 * Special:WhatLinksHere/Template:tracking/zh-see/unidirectional reference variant→orthodox
 * 1) Entries using  with gloss-less components:
 * Special:WhatLinksHere/Template:tracking/zh-forms/no gloss found for Chinese character (character)
 * Special:WhatLinksHere/Template:tracking/zh-forms/no gloss found but entry exists (blue-linked multisyllabic)
 * Special:WhatLinksHere/Template:tracking/zh-forms/no gloss found with a nonexistent entry (red-linked multisyllabic)
 * 1) Expansion and cleanup needed for character pages:
 * Details at User:Wyang/char-summary and User:Justinrleung/char-summary
 * 1) Entries with missing senses:
 * User:Tooironic/Chinese entries with missing senses
 * 1) Entries with Pinyin different from CC-CEDICT and/or Dictionary by the Ministry of Education of Taiwan:
 * User:Wyang/Pinyin-check
 * 1) Entries with missing Cantonese readings, sorted by importance:
 * User:Wyang/cmn-no-yue
 * 1) Compounds with Mandarin or Cantonese character readings not found in the individual character pages:
 * Cantonese: User:Wyang/check-yue-pron, search
 * 1) Compounds (ABCD) not mentioned in derived terms on the component pages (AB and CD)
 * Special:WhatLinksHere/Template:tracking/zh-forms/compounds not mentioned in derived terms on the component pages (for component with more than one character)
 * 1)  usage missing POS
 * zh-pron usage missing POS
 * Entries in the above category that are not hanzi
 * 1) Entries with multiple/problematic Hokkien readings lacking location labels
 * Category:Hokkien terms needing pronunciation attention
 * 1) Translations into Chinese to be checked (split by topolect) - Mandarin, Cantonese, etc.
 * Requests for review of Mandarin translations
 * Requests for review of Cantonese translations
 * 1) List the routinely consulted dictionaries under About Chinese/references.
 * 2) Clearing Han character categories for individual lects.
 * Mandarin Han characters
 * Cantonese Han characters
 * 1) Characters with mc/oc paramater but no data to show (e.g. 砼, which are incorrectly categorized to Category:Middle Chinese lemmas and Category:Old Chinese lemmas)
 * Special:WhatLinksHere/Template:tracking/zh-pron/Middle Chinese data not found
 * Special:WhatLinksHere/Template:tracking/zh-pron/Old Chinese data not found
 * 1) Invalid syllables in Module:zh/data/yue-word
 * 2) Characters with translingual definitions
 * 3) Entries lacking a  template
 * Module:zh-dial-syn/check-presence
 * 1) Characters with specific part of speech headers
 * Special:WhatLinksHere/Template:tracking/zh-forms/specific part of speech header used
 * 1) Entries using  with missing forms:
 * Special:WhatLinksHere/Template:tracking/zh-forms/entry possibly missing a simplified form
 * Special:WhatLinksHere/Template:tracking/zh-forms/entry possibly missing a variant form
 * 1) Compare Zhengzhang's reconstructions in the second edition of his book (2013) with those in the first edition (2003) and update the data modules (very minor differences from a quick look)

Done but need to be regularly checked

 * 1) ✅ (24 Jun 2018) Entries with zh-pron but no definition section (e.g. 蒦).
 * Special:WhatLinksHere/Template:tracking/zh-forms/no definitions section found
 * 1) ✅ (24 Jun 2018) "Derived terms" L3 heading fix, on entries without a second PoS
 * link
 * 1) ✅ (24 Jun 2018) Entries with  instead of
 * entries

Entry creation

 * 1) Entries from Modern Standard Chinese Dictionary  missing on Wiktionary:
 * User:Wyang/basic-Chinese-words
 * 1) CEDICT entries missing on Wiktionary:
 * User:Wyang/cedict
 * 1) Chinese entries to add, by Tooironic:
 * User:Tooironic/Chinese entries to add
 * 1) Missing chengyu and suyu:
 * User:Tooironic/Chengyu list and User:Tooironic/Suyu list
 * 1) Missing entries with multiple topolect readings:
 * User:Wyang/desired
 * 1) Requested entries on Wiktionary (note many of these are rare and low-priority):
 * Requested entries (Chinese)
 * 1) Red-linked Chinese terms in existing translations:
 * User:Matthias_Buchmeier
 * 1) Appendix:Mandarin Frequency lists (the appendices require some attention - duplicates, definitions and pinyin among others):
 * Appendix:Mandarin Frequency lists
 * 1) Wu Chinese requiring transliterations (this is a request list, it's important to add those to actual Chinese entries if Wu readings are valid and applicable):
 * Requested_entries_(Chinese)

Technical

 * 1) Fix the disappearance of spaces in Pinyin after erhua or toneless-ification in.
 * 2)  argument and layout redesign - e.g. ,  ,  ,  , Quanzhou: ge̍rh  {白} , goa̍t  {文}.
 * 3) Add phonetic bopomofo, e.g. "ㄑㄧㄥˊ ㄅㄨˋ ㄗˋ ㄐㄧㄣ [Phonetic: ㄑㄧㄥˊ　ㄅㄨˊ　ㄗˋ　ㄐㄧㄣ]" for 情不自禁.
 * 4) Redesign modules for dialectal pronunciation table for single-character entries, e.g. more flexible entry of pronunciations, no display of location if there is no pronunciation for that location, incorporation of Module:zh/data/dial.
 * 5) Entry header level check, best if automatic whenever there is a ==Chinese== section. Headers should have defined relative levels.
 * 6) List Cantonese characters with only one pronunciation, and generate a list of compound pronunciations for entries making use of these characters which are currently pronunciation-less. Batch upload after review. Make  automatically add the Cantonese pronunciation if it is entirely predictable.
 * 7) See points #9, 10 in  above.
 * 8) Rewrite About Chinese/references in Lua, allowing one to locate the (highlighted) specific reference after the click.
 * 9) Error check functions in Module:zh-pron and its submodules, detecting errors such as: (1) incorrect diacritic placement in Pinyin; (2) invalid Jyutping syllables; (3) space before and after the parameter names (e.g.  ); etc.
 * 10) General display improvement, more dynamic features (e.g. hover), more JavaScript usage, etc.
 * 11) Think about a general JSON-like structure of term data storage and incorporate ID's for senses, instead of the currently ad-hoc extraction of glosses by.
 * 12) Centralise all usage examples and quotations in a backend database and assign each word in the example to a sense in the entry. Make the database queryable. Automatically display examples and quotes on the entry.
 * 13) Synonyms, Antonyms, See-also terms and Dial-syn display format redesign, with terms placed directly underneath senses with an improved layout, perhaps with drop-downs similar to quotations ▼.
 * 14) IPA for the sub-dialects of Teochew: Bangkok, Cambodia, Chaoyang (潮陽), Chaozhou (潮州), Chenghai (澄海), Huilai (惠來), Jieyang (揭陽), Kalimantan, Nan'ao County (南澳), Puning (普寧), Raoping (饒平), Shantou (汕頭).
 * 15) Support for more sub-dialects of the Southern group of :,.
 * 16) Support for more  lects (with use of Wugniu):, , ?
 * 17) Support for more lects, including:
 * 18) ;Dialects with known transliteration systems
 * 19) * Southern Wu (Wugniu or otherwise -- Wenzhounese - About_Chinese/Wenzhounese; maybe "w-o" or "w-w" for the parameter?)
 * 20) * Northeastern Mandarin (Harbin; maybe "m-h" or "m-ne" as a parameter)
 * 21) * Hainanese Min ( or Hainanese Pinyin; "mh" for Wenchang and "mh-h" for Haikou, for the parameter?)
 * 22) * Pu-Xian Min (cpx or mp; see )
 * 23) * Taiwanese Hakka (maybe "h-t"; Hailu, Dabu, Raoping, Zhao'an) - Taiwanese Hakka Romanization System
 * 24) * Shaozhou Tuhua ("sz" or "st"; See this document from Unicode)
 * 25) * Lower Yangtze Mandarin (Yangzhou; see 扬州话拉丁化字母表; "m-y"?; Nanjing; see 南京話拉丁化方案; "m-l" or "m-n"?)
 * 26) * Other Christian-Romanized lects: Ningbo Wu, Hangzhou Wu, Taizhou Wu, Jinhua Wu, etc. see 教會羅馬字
 * 27) ;Dialects without known transliteration systems
 * 28) * (maybe list it, separately, as "mn-h", with "sw" for Shanwei, "hf" for Haifeng, "lf" for Lufeng)
 * 29) * Huizhou (or simply Hui, with either "hz" or "hu" for the parameter respectively)
 * 30) * Mai Chinese ("ma"?)
 * 31) * Danzhou Chinese ("dz"?)
 * 32) * (maybe "mdt/m-dt"?)
 * 33) * Shao-Jiang Min (maybe "sm"?)
 * 34) * Central Min (most certainly "mz" as a parameter; an in-house transliteration system for Central Min could be similar to the one for Northern Min)
 * 35) * Pinghua ("p"?)
 * 36) * (wxa)
 * 37) Linking  to relevant etymology/pronunciation sections instead of default Chinese section (when required). See Talk:攰 for more.
 * 38) Automatic pinyin transliteration of compounds with different pronunciations in mainland China and Taiwan based on Module:zh/data/cmn-tag. See Talk:擁 for more.
 * 39) Inclusion of compound words from 《兩岸萌典》 when  is invoked. The current output from 《萌典》 contains a significant number of classical Chinese compounds and lacks certain terms commonly used in vernacular speech. Compare 崗（兩岸萌典） and 崗（萌典）
 * 40) See this revision for an example.
 * 41) ✅ Adding  to zh-pron (First attempt )
 * 42) Support for.
 * 43) Existing simplified forms that are not using newly-encoded characters (example: 牛犅萤→牛𰠫萤).
 * 1) Support for.
 * 2) Existing simplified forms that are not using newly-encoded characters (example: 牛犅萤→牛𰠫萤).