Wiktionary:Frequency lists/English/TV and Movie Scripts (2006)

Here are frequency lists comparable to the Gutenberg ones, but based on 29,213,800 words from TV and movie scripts and transcripts.

Here's a fuller explanation of how the list was generated and its limitations: Frequency lists/TV/2006/explanation.

Here are the top hundred words (from TV scripts) in alphabetical order:

style=margin-left: 1.6em;|1=

a

about

all

and

are

as

at

back

be

because

been

but

can

can't

come

could

did

didn't

do

don't

for

from

get

go

going

good

got

had

have

he

her

here

he's

hey

him

his

how

I

if

I'll

I'm

in

is

it

it's

just

know

like

look

me

mean

my

no

not

now

of

oh

OK

okay

on

one

or

out

really

right

say

see

she

so

some

something

tell

that

that's

the

then

there

they

think

this

time

to

up

want

was

we

well

were

what

when

who

why

will

with

would

yeah

yes

you

your

you're

Here they are in frequency order:


 * 1-1000
 * 1001-2000
 * 2001-3000
 * 3001-4000
 * 4001-5000
 * 5001-6000
 * 6001-7000
 * 7001-8000
 * 8001-9000
 * 9001-10000
 * 10001-12000
 * 12001-14000
 * 14001-16000
 * 16001-18000
 * 18001-20000
 * 20001-22000
 * 22001-24000
 * 24001-26000
 * 26001-28000
 * 28001-30000
 * 30001-32000
 * 32001-34000
 * 34001-36000
 * 36001-38000
 * 38001-40000
 * 40001-41284

Statistics

 * Top 1,000 words cover 85.5% of all words (24,981,922 / 29,213,800).
 * Top 10,000 words cover 97.2% of all words (28,398,152 / 29,213,800).
 * This is a third of all the unique words. The rest were used 5 or fewer times each.