Wiktionary:French frequency lists (Belgium, finance)

Frequency lists from Wortschatz Leipzig with the authorization from the laboratory. The list was based on Belgium written sources, with a clear financial bias.


 * 1-2000
 * 2001-4000
 * 4001-6000
 * 6001-8000
 * 8001-10000

Note: these indicative lists still require some cleanup, because: This list does not unify inflected words (with plural or feminine mark on nouns or adjectives, or conjugated verbs), and does not recognize auxiliaries of verbs at compound tenses as part of the conjugated verb, but treat auxiliaries separately for each inflected form. Alphabetising this list can be very helpful for spotting redundancies.
 * 1) they don't unify common words that are normally not capitalized in the dictionary, but can be capitalized at the begining of sentences or in titles;
 * 2) they do not break correctly words preceded by a separate word contracted with an apostrophe for very common articles (l') or preposition (d') or negation adverb (n') or pronoun (c', j', l', m', s', t'), or verbal liaison particles (-t-, -z-, which are not really words as they don't have any meaning but are written for phonetic reason), or pronoun subjects just after the verb (after a mandatory linking hyphen, that still does not make a compound word but denotes the inversion of the subject rather than the normal occurrence of an object): all these words should be counted separately;
 * 3) the source is certainly from Belgian French written papers only, with typical occurrences for that country and no equivalence for France, or other French speaking countries where these words are much rarely used (such as currency abbreviations, Belgian toponyms for regions and cities, and many missing terms for very common specialties in France);
 * 4) the list contains isolated letters that are not words, per se (except a few effective words: a, à, y);
 * 5) as well, there are acronyms and symbols occurring only in written documents but not as part of the spoken language;
 * 6) frequent proper names are included but are not very specific to any of the 4 studied languages.