User talk:Jberkel/2023

This page shows conversations on my talkpage from 2023.

Purpose of Category:American Spanish
Hello Jberkel, I was looking through Category:Regional Spanish recently and found Category:American Spanish which was created by you in 2021. I am not quite sure what the purpose of the category is in comparison to Category:United States Spanish and Category:Latin American Spanish. Can you clarify why you created Category:American Spanish? Thanks &mdash;The Editor's Apprentice (talk) 22:26, 26 January 2023 (UTC)


 * Looks like it was a mistake and should be United States Spanish. Adding "US" to an "es" label automatically categorizes as "American Spanish" when it should be . This is not obvious. I'll fix the entries, but the category might fill up again. Not sure if there's a better solution. Jberkel 23:12, 26 January 2023 (UTC)
 * I see. Thanks for emptying the category, it sounds like our label system needs to be overhauled. The source of the problem is apparently Module:labels/data/regional where the "US" label is associated with "American" categories. The specific "US Spanish" label is defined it Module:labels/data/lang/es. The two resolutions that I see are either creating some sort of exception to how the "US" label is processed for Spanish, if such a thing is possible, or changing the way the "US" label is interpreted for all languages, probably to associate it with "United States" categories. In my view, the second solution is preferable. It seems this gets into Beer parlour territory so I might start a discussion there in a bit if you aren't interested in doing so yourself. &mdash;The Editor's Apprentice (talk) 00:20, 27 January 2023 (UTC)
 * Please go for it. Jberkel 00:32, 27 January 2023 (UTC)

Quote templates
The problem with these is that they don't always refer to the same edition as the original quote given, and information is being lost. On Sazerac, for example, I gave a citation from p. 415 of the Vintage 2007 edition, which has now been templatized to link to p. 415 of some other edition where the word in question does not appear. So these need to be checked more carefully. Ƿidsiþ 06:56, 20 February 2023 (UTC)


 * Ok, unfortunately that edition isn't available on IA. I'll re-check them manually. Jberkel 07:44, 20 February 2023 (UTC)

Zuckerfest
Could you take a look at the etymology in this newly added entry and fix it up please? It needs to be "templatised" tbh and I feel it's best if I leave it to someone who actually knows the language. Acolyte of Ice (talk) 13:29, 22 February 2023 (UTC)
 * Done, IP was quicker :) – Jberkel 15:05, 22 February 2023 (UTC)

mk.jsonl.bz2
Hello! How can I download this file? Whenever I try, it says "404 Not Found". Thanks. Gorec (talk) 19:56, 2 March 2023 (UTC)


 * The files didn't get generated correctly on the last run, I'll include them in the March update. – Jberkel 20:43, 2 March 2023 (UTC)
 * ✅. – Jberkel 20:20, 6 March 2023 (UTC)
 * Thanks! 👍 -- Gorec (talk) 20:21, 6 March 2023 (UTC)

knob
Why have you changed the format of the dates for the citations used in our knob entry? Surely it is more informative to provide a day, month and year for cites when that is known to us rather than simply the year. I would say that it's preferable to provide our readers with more information rather than less and writing the date in the format YYYY-MM-DD seems to be the Wiktionary standard in situations where the day and month are known and included. Moreover, not only does this format provide the date information in order of importance by placing the year first but avoids arguments over whether the day or month should be written first. --Overlordnat1 (talk) 10:19, 6 March 2023 (UTC)


 * I find them noisy, especially for books, where in most cases readers don't care if a book was published on March 15 or 16 (however, maybe just the month could be included). Often these dates are extracted from Google Books, which is unreliable. Providing the ISBN is more useful than month/day, and presumably it could be used to retrieve the exact date if needed. Jberkel 10:58, 6 March 2023 (UTC)


 * I see what you mean but I'd still say more information is better than less, though I see that hou have added the months back in now, which is a welcome move. I've just brought up the issue at the Beer Parlour to see what the community's consensus on the issue is. --Overlordnat1 (talk) 14:28, 6 March 2023 (UTC)

Module:usex/templates/testcases/documentation
Any idea why this has been in CAT:E for the past several days? I don't see any edits since 2017. Chuck Entz (talk) 02:11, 20 March 2023 (UTC)

Wanted lists
Hi, I have a question about the lists generated at User:Jberkel/lists/wanted. When new lists get generated, does the program check that the red terms still have incoming links? For example, at User:Jberkel/lists/wanted/20230301/ga, the top five terms no longer have incoming links after I removed them from ga-preposition contractions back in November for being nonexistent, but they're still getting listed. —Mahāgaja · talk 10:53, 22 March 2023 (UTC)


 * There are some caching issues with the HTML dumps. See T305407. I'll see if there are some possible workarounds (filtering deleted entries for example). Jberkel 00:31, 24 March 2023 (UTC)

Database dump / Statistics
Hey, good to see you managed to generate a new list of statistics since Ungoliant seems to have stopped doing them. I'm curious (personally from a technical/programming standpoint, and more broadly to prevent longer periods of no updates in the future such as occurred when Ungoliant stopped generating them in early 2022) about the process used to generate the list, maybe you could humor me and help me understand. My main questions are: a) where do you get the database dump from, and b) what code (or third-party tool(s)?) is used to analyze it, extract the relevant data and generate the Wikicode-table? (As a side note, do you know why some deltas are missing for languages that did see a lot of activity since April 2022, such as those of Gothic and most of English?)

Edit: I just noticed Wiktionary talk:Statistics, should've checked there first. Still, the question about deltas stands. Great work anyhow! — Mnemosientje (t · c) 09:59, 27 March 2023 (UTC)
 * I need to write some documentation, so that someone else can easily regenerate them if needed. At the moment it's a bit complicated, I'd like to simplify it first. Regarding the deltas, they have to be taken with a grain of salt, for instance I'm not entirely sure if the way of classifying gloss/non-gloss matches exactly the procedure used by Ungoliant. Future deltas should be more useful. – Jberkel 10:10, 27 March 2023 (UTC)

Latin Gerunds
While "Gerund" may be on the list of disallowed headers, it can't be swapped out for "Verb". Latin gerunds aren't verbs: there is no number, no tense, no mood, and no conjugation. It has no characteristics of a Verb. I can understand why Gerund isn't allowed for English, Spanish, or any modern language I'm familiar with, but for Latin Gerund is the best option. But they are definitely NOT Verbs. All such changes from Gerund to Verb in Latin entries will need to be changed to something else. --EncycloPetey (talk) 21:55, 29 March 2023 (UTC)


 * I checked a few Latin gerund entries, and they had all "Verb" as a header–in fact, laborandum is the only entry with this header. You should bring this up in the BP if you disagree with this. Jberkel 22:06, 29 March 2023 (UTC)


 * They all have Verb as a header because that's what you changed all the Gerund headers to two years ago (e.g.,, ). I have no idea why (or where) it was decided they are verbs, but gerunds are never verbs.  In English entries, gerunds are treated as nouns: e.g., "Walking is good exercise."  In Latin, it's not that straightforward, and the model used at English walking fails to account for which senses of the verb transfer to the gerund, and make it impossible to organize quotations. --EncycloPetey (talk) 03:54, 1 April 2023 (UTC)
 * → new thread on the BP. Jberkel 12:41, 1 April 2023 (UTC)

return the definition to what I wrote. you are obviously biased
return it 202.161.72.243 10:41, 4 April 2023 (UTC)

Why did you delete the authors Jewish origin?
It is in the source that is presented. 2A02:908:1612:4E0:AC0B:8E4:4947:79A8 14:29, 30 April 2023 (UTC)
 * Which source? I've moved the description to Wikidata, so it would have to be changed there. – Jberkel 14:57, 30 April 2023 (UTC)
 * You changed the description where you removed the adjective "Jewish" for some reason. 2A02:908:1612:4E0:3D73:D3B:7B99:1043 17:27, 4 May 2023 (UTC)
 * You can add it back on Wikidata, but I thought the pile-up of adjectives ("German Jewish British") sounded very weird. Plus it's not even mentioned on her Wikipedia page. Jberkel 17:57, 4 May 2023 (UTC)

X does not a Y make: whence then?
What do you mean by saying it's not really a derived term? The snowclone refers to the proverb in its Etymology section. Is it a mistake then? So what is the correct etymology of the snowclone phrase? Adam78 (talk) 12:02, 29 May 2023 (UTC)


 * It is linking to a non-mainspace entry, and those should be handled in "See also", at least that's how I understood WT:EL. Maybe it doesn't have to be that strict, though. Jberkel 12:22, 29 May 2023 (UTC)

Deleting entries without feedback.
Stop requesting deletion of entries without reason. This is abusive behavior. Newdefinitions108 (talk) 21:19, 9 June 2023 (UTC)

ჹ
Hi. I would very much like to find the alphabets that use this letter. But I don't think hiding our ignorance is a good approach. It's misleading.

I could only ID Tabasaran, but there it's not used for its nominal value. It would be good to find at least one alphabet that used it for /ɢ/. kwami (talk) 20:34, 3 July 2023 (UTC)


 * Can you not put this information in a section inside the entry? Right now, the L2 header is non-standard (you'll notice the warning "bad character in header" in the diff) and cannot be parsed automatically. Jberkel 20:57, 3 July 2023 (UTC)
 * Yes, that would probably work better. Gave it a try. See what you think. kwami (talk) 20:59, 3 July 2023 (UTC)
 * Yes, that's better! However, it should be "Usage notes", not "User notes". Jberkel 21:04, 3 July 2023 (UTC)
 * Oops! Got it.
 * (I wonder how many times I've done that now.) kwami (talk) 01:19, 4 July 2023 (UTC)

Old English dump
Hello! I've been working a lot on Old English redlinks, and I was wondering if it would be possible to get a redlink dump for Old English? Thank you! Vergencescattered (talk) 18:08, 12 October 2023 (UTC)


 * I see you've added "ang" to the list, so it'll be generated next time the job runs (which might take a while, see the next comment on this page). Jberkel 22:43, 7 December 2023 (UTC)

Wanted wanted
Hey! Can we get a new User:Jberkel/lists/wanted soon? Denazz (talk) 08:21, 7 December 2023 (UTC)


 * The WMF people have started to look into the dump problems reported at T345176 (and others), once that's fixed I'll be able to generate new lists. Jberkel 22:48, 7 December 2023 (UTC)