User talk:Helrasincke

Welcome Message
-- Apisite (talk) 03:19, 6 February 2023 (UTC)

Hi
Feel free to contact me if you have any ideas regarding improvement of Lithuanian content on Wiktionary. I'm fairly familiar with the language. 70.172.194.25 10:11, 9 February 2023 (UTC)

Frequency lists/Spanish/Wikipedia 45K (2016)
Hey. Thanks for the frequency lists? Can you make Frequency lists/Spanish/Wikipedia 45K (2016)? Van Man Fan (talk) 01:05, 15 March 2023 (UTC)


 * @Van Man Fan Any particular country? Otherwise I'll take a recent random selection. They will be much bigger than 45K. Helrasincke (talk) 07:07, 15 March 2023 (UTC)
 * @Van Man Fan I notice that you're already found the index page for Spanish, but I'm wondering if there's something wrong with the User:Matthias_Buchmeier. It should be much bigger than 45K too. I'd rather focus my efforts on less-resourced languages if possible. Helrasincke (talk) 07:11, 15 March 2023 (UTC)


 * Thanks for the reply. I'd say a random selection would be fine. The problem with User:Matthias_Buchmeier is that it is full of misspelling. Van Man Fan (talk) 08:56, 15 March 2023 (UTC)
 * @Van Man Fan Fair enough. That will unfortunately be a problem to a greater or lesser extent with all of these lists - there is a lot of white noise that is hard to filter out. I'll see what I can get to in the next week or so. In the meantime, you might also have luck in the other user generated lists linked at Frequency lists/Spanish - the wikisource one looks quite promising. Alternatively, you could try Category:Spanish_entry_maintenance (for instance, there seems to be a lot in Category:Spanish_redlinks). Helrasincke (talk) 12:12, 15 March 2023 (UTC)
 * @Van Man Fan I've uploaded the first 100K so far at Mixed 730K, see how you get on with that. I strongly suggest enabling the OrangeLinks gadget if you haven't already, since otherwise you'll be left with mostly proper nouns and brand names. Helrasincke (talk) 11:30, 16 March 2023 (UTC)


 * Awesome! That's just what I was after :) Van Man Fan (talk) 13:24, 16 March 2023 (UTC)
 * One comment - it is missing all the capital Ns! We're getting things like acional instead of Nacional etc. How weird! Van Man Fan (talk) 22:11, 16 March 2023 (UTC)
 * Thanks for bringing this to my attention. I did notice during processing that a few foreign terms were missing an initial letter in one or two places. The mistake seems to have been on my end, so I've fixed the script and reuploaded. P.s. nacional in lowercase will not occur in this list since my process removes all duplicate entries which are differentiated only by a capital letter, leaving the variant with the highest number of occurrences. Helrasincke (talk) 02:10, 17 March 2023 (UTC)

Standard Spelling
Hey. Can you change the spelling of the Acehnese language from Achinese to the standard Acehnese, it's because some people will probably be confused on what language you are talking about unless they search it themselves to know that it is an alternative spelling of Acehnese. OsbertDanielson (talk) 11:15, 16 March 2023 (UTC)


 * Thank you for pointing that out. To my knowledge they are both valid spellings in English, the Oxford for example spells it Achinese, but Wiktionary uses your suggested spelling for entries so I've changed it to reflect that. Helrasincke (talk) 11:27, 16 March 2023 (UTC)

Discord
If you have further questions and if you have discord, you can ask me more directly on the discord server. Vininn126 (talk) 11:04, 1 October 2023 (UTC)

Upper Sorbian
Please do not add accelerated forms of languages ​​you do not know, the pronunciation module is not ready yet and you are adding stress incorrectly in foreign words Stríðsdrengur (talk) 11:17, 15 March 2024 (UTC)


 * Thank you for bringing this to my attention. I was actually focussing on making sure the inflections were correct, you can see they've all been manually checked against the only/main publicly available academic source, since these auto-generated IPA always leave a little wiggle room for nitpicking. I can either remove the pronunciation lines (though why it's even implemented if it's not yet ready to deploy, I'll leave you have a hard think about) or else mark the pages as requiring attention, or you are welcome do so yourself if you prefer, since I've been cleaning up after you too (not that I mind, we all have our occasional oversights). Best, Helrasincke (talk) 11:36, 15 March 2024 (UTC)
 * The rule is with Upper Sorbian: native words generally have initial stress but loan words can have stress elsewhere, which is retained in inflected forms. The module is generally ready, I'd say. There might be some things missing but I think most is covered... But I agree you shouldn't edit languages you're not too familiar with. Vininn126 (talk) 11:40, 15 March 2024 (UTC)
 * Thanks for your input @Vininn126, that gives the whole issue a more actionable direction. So if I'm hearing you correctly, is not of concern, since it's presumably of native stock;  has non-initial stress as marked at the headword [adˈʀɛsa], so  should be /adˈʀɛsax/ instead of /ˈadʀɛsax/; whereas  /ˈabɔnɔvanjɛ/ is marked with initial stress at the headword (where one may be forgiven for expecting /aˈbɔnɔvanjɛ/ a la  /aˈbɔnɔwat͡ʃ/), so either my edits here are correct or someone has marked the headword incorrectly. Perhaps @Stríðsdrengur would like to chime-in, since they created the pronunciation section in the first place.
 * This all leads to the question of why, if a headword's stress is marked as non-initial, aren't the accelerated forms generating with the concomitant non-initial stress? Is this expected behaviour or a kink yet to be smoothed out? I'm hoping the latter, since the acceleration modules for East Slavic languages handle this without problem.
 * Your final point I'll take on board. I'll better inform myself next time. Helrasincke (talk) 12:31, 15 March 2024 (UTC)
 * I have not made the module able to detect stress in headwords or declension tables. Most languages don't. Vininn126 (talk) 12:32, 15 March 2024 (UTC)
 * @Vininn126 but wouldn't this be desirable? Otherwise why include it at all in the accelerated module? It would be fallacious to conclude that most languages don't do this because they neither want nor need it. A more likely explanation would be that most don't receive the attention of skilled coders waiting to optimise their WT coverage (though I admit there is an impressive and growing number of languages which do receive this level of love). Helrasincke (talk) 12:43, 15 March 2024 (UTC)
 * I do not have the tech skills to do that. Vininn126 (talk) 12:47, 15 March 2024 (UTC)

Update Yiddish JNW template
Should probably change 2023 (ongoing) to 2024 (ongoing). Not really sure how I could do that myself through the template page, so just letting you know to do that. Insaneguy1083 (talk) 21:29, 4 April 2024 (UTC)


 * @Insaneguy1083 Thanks for the heads up! It needed to be updated in the module, not the template. I'm not sure if my solution is particularly foolproof either, since someone to update it one day when (/if) ever it is finally completed. Btw, this may be of interest to you. Even without orange links enabled, there's a ton of words to grab. They should also all have correct unicode (which may mean that the interlanguage links won't work if other projects are using unicode ligatures). Helrasincke (talk) 22:21, 4 April 2024 (UTC)
 * I don't think JNW will ever be completed. Its sole purpose seems to be a big database of all the Yiddish words and word variants that the creator can find, and there's of course a lot of dictionaries and such to go through.
 * Thanks for the link of Yiddish words from other wikis. That said, I'm not nearly as active now with adding Yiddish entries as I had been last summer, when I was at an internship and was not given any work to do. These days I mostly just add the daily word on the Verterbukh website and call it a day. Right now I'm mostly focussing my efforts on Belarusian, Lithuanian and Macanese. Insaneguy1083 (talk) 22:34, 4 April 2024 (UTC)