Module talk:zh/data

Re: Simplified character of 著 is 着
is it always? Or it doesn't matter? BTW, the reading "jìzhuò" for 記著/记着 I got from the Chinese Wiktionary and found on the web. My dictionary doesn't have this word. --Anatoli (обсудить/вклад) 07:28, 4 June 2014 (UTC)
 * You are more trustworthy on the Simplified Chinese matter. AFAIK, the Chinese Wiktionary article is generated by a bot, and I have a link of a singer pronouncing it as ji4zhe. --kc_kennylau (talk) 07:34, 4 June 2014 (UTC)
 * I trust your knowledge on this one more than the Chinese Wiktionary and I saw your edit summary with the link. That's why I made my recent edits using [zhe]. --Anatoli (обсудить/вклад) 07:36, 4 June 2014 (UTC)
 * Yes, traditional 著 corresponds to both simplified 著 (famous) and 着 (grammatical particle). We probably should make a separate small conversion table in Module:zh-usex (like the pronunciation conversion table), and make 著 -> 着 unless restricted, since the particle is much more common. Wyang (talk) 23:56, 4 June 2014 (UTC)

嚮
Guys, could you add 嚮 - pinyin and simplified? See 嚮往, it didn't work as expected. --Anatoli (обсудить/вклад) 00:23, 14 June 2014 (UTC)

Taiwanese 俄, Cantonese alternative pronunications
, apparently 俄 is (or can be) pronounced with the fourth tone in Taiwan - "è" not "é". Could you confirm, pls? If correct, please add alternative readings.

Also, should alternative Cantonese pronunciations be included for 俄 - ngo4, o4, same as 我 has ngo5 and o5? What about initial "n" as "l", e.g. 你 nei5, lei5? --Anatoli (обсудить/вклад) 00:20, 23 June 2014 (UTC)
 * No, ngo5 and o5 are not Cantonese readings of the aforementioned character. The pronunciation "lei5" is considered non-standard in Hong Kong. --kc_kennylau (talk) 08:50, 23 June 2014 (UTC)
 * Well, these are . Some of these lazy pronunciations or hypercorrections have become so commonplace that not using them in Hong Kong would sound wrong. eg. Wyang (talk) 23:57, 23 June 2014 (UTC)

Error on 崖 Mainland/Taiwan
Frank, could you help, please? 懸崖 and 悬崖 have errors. They should produce xuányá (Mainland) and xuán'ái (Taiwan) has ㄒㄩㄢˊ　｜ㄞˊ Is "｜" a vertical bar or Zhuyin ㄧ?--Anatoli (обсудить/вклад) 09:07, 26 June 2014 (UTC)


 * If it's "yái", not "ái", then Zhuyin generation needs to be fixed as well, please. --Anatoli (обсудить/вклад) 09:12, 26 June 2014 (UTC)


 * It is caused by the fact that it converts the Taiwanese spelling to xuanai instead of xuan'ai. --kc_kennylau (talk) 09:58, 26 June 2014 (UTC)
 * Thank you. It was failing when it was "yái" originally, which is actually correct. dict.revised.moe.edu.tw uses vertical bar "｜" instead of "ㄧ", that's why I was confused. So the Taiwanese reading is "yái" (ㄧㄞˊ). And "xuányái" shouldn't cause any problem, should it? I really don't know how to fix it. I guess I can put "yái" back but it was failing before. --Anatoli (обсудить/вклад) 11:01, 26 June 2014 (UTC)
 * (I'll fix it after putting this comment here.) --kc_kennylau (talk) 11:05, 26 June 2014 (UTC)
 * Apparently "yai" must be made a valid pinyin syllable (used in Taiwanese Mandarin) but I don't know how. 崖 must be the only character that uses it. --Anatoli (обсудить/вклад) 11:12, 26 June 2014 (UTC)
 * Better now? --kc_kennylau (talk) 11:20, 26 June 2014 (UTC)
 * Yes, thank you, Kenny! --Anatoli (обсудить/вклад) 11:22, 26 June 2014 (UTC)

Multiple Alternate Pronunciations in Taiwan and Mainland
What do I do if the pronunciations aren't as simple as alternating pronunciations between Taiwan and Mainland? Take for example, the character 揣, which is read as either chuai3, chuai4, chuai1 in Mainland, and just chuai3 in Taiwan.

In the word 揣測 (揣测), both sides read this as chuai3 ce4.

In the word 掙揣 (挣揣) however, Mainland reads this as zheng4 chuai4, but Taiwan reads this as zheng4 chuai3.

In the word 揣與 (揣与), Mainland reads this as chuai1 yu3, but Taiwan reads this as chuai3 yu3.

So, how do I represent this in the module? Thanks. --Mar vin kaiser (talk) 14:39, 8 February 2015 (UTC)
 * This probably can't be done in the module but in individual entry's pronunciation, by providing variants and notes. --Anatoli T. (обсудить/вклад) 03:49, 9 February 2015 (UTC)

Code size
The module's code is extremely long and makes it difficult to edit. Wouldn't it be better to host this information in separate modules? —suzukaze (t・c) 22:49, 31 July 2015 (UTC)
 * Agreed. It is easy to do so. --Octahedron80 (talk) 04:26, 17 April 2016 (UTC)

duplicated keys
I checked 'st' table and found that there are 40 pairs of duplicated keys. (I didn't check 'ts' table yet.) What can we do with these? Data won't come out both.

--Octahedron80 (talk) 02:10, 17 April 2016 (UTC)
 * I think we should keep the more common ones, for example,  instead of  . — justin(r)leung { (t...) 03:33, 17 April 2016 (UTC)