Wiktionary talk:Frequency lists/TV/2006/1-1000

Personal names
What is with all of the personal names on this list? Surely this can't be for real. Someone needs to clean them out.--144.92.184.45 01:31, 10 July 2006 (UTC)
 * Hey, if they actually say John more than they say marriage I'm not gonna argue. What I want to know is how do people say 'c'. Is that them just saying the letter C or is it the 'cute' for 'see'.


 * Yep, they're keepers. I will say, though, that while some of them are obvious (House, Phoebe, Frasier) I have no idea why "Greenlee" is so high in the rankings. Surely, it can't be that common across series. Is there a list somewhere of the specific scripts that were used in compiling this thing? 116.231.238.115 05:33, 15 December 2013 (UTC)

Individual letters?
Why are individual letters on the list? There are a lot of them, people don't just say stuff like "j". And what about the word "Niles"? It's not even in the Wiktionary! Niles is a town near me, the only possible reason this could be on there is if the TV station that was monitored was a local Niles station or somesuch. Just ain't right.
 * Educational programming often will recite the alphabet (and those programs have text too.) Legal proceedings list items by number or letter as well.  Niles is a surname and a given name, both for character names and actor names.  The list is what it is...interesting, but kinda strange if you look too closely, or get caught up on minor details.  --Connel MacKenzie 04:13, 2 November 2007 (UTC)
 * That's not what it is at all.


 * First, people do say "J" all the time: they just often transcribe it as "Jay". Second, these are from the scripts, where the letters are used to describe which character is speaking &c. They're keepers, but they should generally be capitalized.116.231.238.115 05:33, 15 December 2013 (UTC)

I and I'm

 * "I see the 'words' I and I'm listed on a different place. Maybe thats the reason why there is such a big hole between 'you' and 'I'. Shouldn't 'I'm' be replaced by m' or 'am' ? (and the number of 'Is be increased?) (sorry for the )"

Ok and Okay
I think it would make sense to interlink obviously related words. Although "I" and "I'm" can be counted as distinct entities, given that this list refers to scripts, which are spoken, the sound of "ok" is identical to that of "okay", and I think they need a note beside each.—DIV (128.250.80.15 03:16, 8 July 2008 (UTC))
 * Scripts are written, transcripts are transcribed from speech. If it really were based on scripts, I'd disagree, but looking at the methodology it does seem to be based on fan transcriptions or closed captioning instead of anything done with much care (possibly explaining why "mm" is more common than "mmm" here).


 * Even so, I'd rather keep them separate and just do a separate list for the lemmas. 116.231.238.115 15:22, 15 December 2013 (UTC)

Relative frequencies
Can someone please state how many words there were in the corpus, so I can calculate percentages? What can anyone do with absolute frequencies and a ranking? Nothing! The ranking stays the same, but without an absolute number of words in the corpus this was counted on the individual counts are meaningless.

Probable flub
I can't think of anything that nbsp means other than non-breaking space and assume we shouldn't include it just for the hell of it. Thoughts?116.231.238.115 05:33, 15 December 2013 (UTC)
 * Given the desire not to reformat all of the other pages based on this one, I've simply struck it through as the best option pending a rewrite.116.231.238.115 15:54, 15 December 2013 (UTC)

User:Chuck Entz
seems to have page ownership issues. He's consistently reverting improvements to the article without paying any attention to the capitals already on the list, at least one of which is erroneous (at least some uses of rose will be to the flower, in addition to the name). Kindly revert his future edits to the page to this improved format until he either builds consensus that the page should never use capitals (and fix the existing ones) or (more helpfully) backs off and permits people to link the names to the articles on the names.116.231.238.115 11:55, 15 December 2013 (UTC)