Module talk:zh/data/dial-pron/牛

Adding dialects
How to add dialects? Lindasc (talk) 06:55, 13 July 2018 (UTC)


 * (... —Suzukaze-c◇◇ 06:58, 13 July 2018 (UTC))


 * Hi and welcome. At the moment it unfortunately doesn't support other dialects. Can I ask what kind of dialectal data you would like to add? If it's a large amount of data we can restructure the dialectal pronunciation system, and make that possible. Wyang (talk) 07:07, 13 July 2018 (UTC)


 * Thank you for information. There are many subdialects under a regional language, for example, Wu Chinese speaking region covers tens of cities and each city has its own dialect that may be added in the future individually. I think the current dialect list is not human readable because it's only a large array with no key-value pairs. I suggest to restructure it to dictionary-like system. Lindasc (talk) 10:15, 13 July 2018 (UTC)


 * That's true. When the data was originally added, this format was used since the data seemed to be the only large, freely available pronunciation data. While unpaired, it also has the benefit of preventing sporadic additions which often use inconsistent notations, e.g. "thien55", "thiɛn55", "thian55", "thian5", etc. for 天 in Beijing Mandarin. If there is a need to batch-add new data, then this can be changed to a key-value system which would be clearer and easier to maintain. Wyang (talk) 10:28, 13 July 2018 (UTC)


 * We can apply individual checks on certain dialect keys in a key-value system to prevent inconsistent notations, just like what Template does. For example,   etc. Lindasc (talk) 10:47, 13 July 2018 (UTC)


 * Yes, a check for each dialect would be needed if we allow customised additions. The format of these individual pages can parallel that of Module:zh/data/dial-syn (scroll down), and we already have the dialectal attributes stored at Module:zh/data/dial, which is handy. Can I ask ― how much data would you like to add, and what is the data? Do you know how to write Lua and Wiki codes? Wyang (talk) 11:03, 13 July 2018 (UTC)