Wiktionary:Thesaurus

This is the main project page of Wiktionary Thesaurus, a Wiktionary subproject and a wiki namespace aiming at creating a thesaurus, a dictionary of synonyms, antonyms, and further semantically related terms such as hyponyms, hypernyms, meronyms, and holonyms. The project was formerly called Wikisaurus.

 New to the thesaurus?
 * See a random page in the thesaurus
 * Browse all thesaurus entries
 * Search Thesaurus, in the input box below:

Please contribute your own thesaurus entries, or add to the existing thesaurus entries.

Purpose
 Chunked purpose The purpose of Wiktionary Thesaurus is to serve the role of an electronic thesaurus—a dictionary of synonyms, near-synonyms, antonyms, and near-antonyms, and also of other semantically related terms such as hyponyms, hypernyms, meronyms, and holonyms.
 * Purpose:
 * To help people find words that they
 * can't recall or
 * don't know
 * To help people explore the network of words
 * Semantic relations:
 * Synonymy – same or similar meaning
 * Antonymy – opposite meaning
 * Hyponymy – narrower meaning, subclass
 * Hypernymy – broader meaning, superclass
 * Being an instance – being an example of the class, set membership
 * Meronymy – part, such as wheel of a car
 * Holonymy – whole, such as car of a wheel
 * Users:
 * Writers
 * Managers
 * Contributors to wikis
 * Bloggers
 * Writers of love letters
 * Journal writers

The purpose of such a thesaurus in general is mainly to help anyone who writes for living or fun—writers, managers, contributors to wikis, bloggers, and writers of love letters—to find words they don't recall or even know when they recall words that are semantically related to the sought word. In general, anyone to whom the choice of words matters can benefit from a thesaurus, especially one linked to a dictionary providing the definitions.

Added value of Wiktionary Thesaurus is its Wiktionary integration—it links to and is linked from Wiktionary.

Browse
To start browsing the thesaurus, you can start at the root, Thesaurus:entity, and proceed from there through the hyponymic network to high genera such as Thesaurus:person, Thesaurus:organism, Thesaurus:animal, Thesaurus:plant, and Thesaurus:artifact and further down. You can also browse by topical thesaurus category such as Category:Thesaurus:Geography (Thesaurus:forest), Category:Thesaurus:Personality (Thesaurus:humble) or Category:Thesaurus:Appearance (Thesaurus:beautiful).

Model
The thesaurus is organized primarily on the model of WordNet. That is, the key organizing principles are the relations of hyponymy (subclass) and hypernymy (superclass), and to a lesser extent meronymy (part of a whole) and holonymy (whole of the part). See also Semantic relations. The design of Roget's 1911 thesaurus is somewhat similar in that it does not restrict the entries to lists of synonyms and antonyms; however, Roget's thesaurus does not use WordNet relations. The design of Oxford English Dictionary thesaurus is somewhat similar in that it has a hierarchically organized thesaurus. However, its subordination relation is not a strict hyponymy but is in part thematic. By contrast, the thesaurus of Merriam-Webster has synonyms, antonyms and words "related". Editors who want to create thesaurus entries that are primarily for lists of synonyms can do so without worrying about the other relations, but keep in mind that there should be only a single thesaurus entry for a synonym set, thereby avoiding duplication.

One sense per entry
Each entry should ideally have a single sense. Nonetheless, the format supports multiple senses for the cases where this seems to be the best option. It is usually possible to pick different headwords for different senses. Each entry should ideally stand for a semantic object; the headword should be in part an accident. The point is not to list all senses of the headword. Thus, there can be a single sense in Thesaurus:sound, and another sense of "sound" is covered at Thesaurus:inlet. If it becomes impossible to keep finding dedicated headwords, we may resort to disambiguating naming like "rich (wealthy)" or "rich, wealthy". Sometimes, the headword becomes less ambiguous by using a phrase: there is Thesaurus:English language. WordNet seems to do fine mostly by the comma convention.

To use the entry headword as a basis for covering all the senses of the headword would lead to a duplication of synonym rings covered in other headwords, at odds with the duplication-avoidance rationale for the thesaurus. Thus, there is no point in duplicating Thesaurus:spicy in Thesaurus:hot. By contrast, having a sense for an adjective and a sense for a noun in Thesaurus:German is a different use case and makes a little bit more sense, being caused by a lack of suitable English disambiguating headword unless one opts for "German person" and "German language" as headwords.

Multilingualism
English Wiktionary Thesaurus shall contain entries for other languages than English.

Category:Thesaurus entries by language features these entries. Categorization is done by ws sense.

Historically, there was no agreed-upon naming scheme for non-English thesaurus entries. The following conventions existed:   No language code, native headword. For example: Thesaurus:صار (Arabic), Thesaurus:yaxşı (Azerbaijani), Thesaurus:chat (French), Thesaurus:god (English and Danish). Entries in different languages with identically spelled titles are placed on the same Thesaurus page, exactly as we do for ordinary dictionary entries.  Language code, native headword. For example: Thesaurus:fr:embêter (French), Thesaurus:sga:ar (Old Irish), Thesaurus:non:sverð (Old Norse).  Language code, English headword. For example: Thesaurus:da:beautiful (Danish), Thesaurus:ar:become happy (Arabic), Thesaurus:sound/fi (Finnish). 

By October 2022, convention A was followed by 98.8% of non-English thesaurus entries. In November 2023, all remaining thesaurus entries using convention B and C were standardised to convention A.

Convention A carries certain disadvantages. One is that the automatic display of the [⇒ thesaurus] link next to synonyms and other terms can be triggered in situations where it is not relevant. For example, Thesaurus:yes lists "" as a synonym, but the [⇒ thesaurus] link next to "ar" points to a page containing synonyms for an unrelated term in a different language. Another disadvantage is that the scheme is unique to English Wiktionary. Having Thesaurus entries for multiple languages on a single page causes problems for interwiki linking to other-language Wiktionary editions, such as French Wiktionnaire.

Topical categorization is an unsolved problem: there is Category:Thesaurus:Geography, but no language-specific one. We could create "Category:Thesaurus:en:Geography", "Category:Thesaurus:es:Geography", etc., on the model of mainspace topical categories.

Discussions:
 * Thesaurus_talk:juoppo, 2009

Formatting
Formatting is specified and discussed at:
 * /Format

Example entries:
 * Thesaurus:error – mostly synonyms
 * Thesaurus:aircraft – mostly hyponyms
 * Thesaurus:food – a complex entry
 * Thesaurus:word – hyponyms are grouped semantically using as separator. However, multiple people indicated they preferred the text labels for clarity, so using labels is probably the way to go instead going forward.
 * Thesaurus:animal sound – hyponyms grouped semantically with text labels

Inclusion
As for entry headwords, they must be attested. Not all mainspace entries should have their own thesaurus entry: the point of thesaurus is in part to prevent duplication of lists. The headwords can sometimes be sum of parts if deemed preferable, as in Thesaurus:beautiful person.

As for list items, all items in lists of synonyms, antonyms, hyponyms, etc. on Thesaurus pages are required to be attested, using the same attestation criteria as the mainspace. There is no requirement that they must be more than sum of parts. Roget's Thesaurus did include many sum-of-parts phrases.

Semantic relations
See also Semantic relations.

If you want to create synonym-only entries, you do not need to worry about the other relationships all that much. This is especially true of adjectives. For nouns, it often pays off to figure out a good node in the hyponymic (subclass/superclass) network.

Synonyms and antonyms
Synonyms are terms with the same or very similar meaning. Register (informal, vulgar, etc.) does not impact synonymy. Examples: Thesaurus:wise, Thesaurus:drunk. Some putative synonyms are better classified as hyponyms.

Antonyms are terms with opposite meaning. Antonyms are sometimes concentrated in an opposite thesaurus entry. Example: Thesaurus:drunk.

Hypernyms and hyponyms
Hypernyms are terms with broader meaning, capturing a superclass relationship: X is a hypernym of Y if each Y is also an instance of X. Example: Thesaurus:bird.

Hyponyms are terms with narrower meaning, capturing a subclass relationship: X is a hyponym of Y is each X is an instance of Y. Examples: Thesaurus:drunk, Thesaurus:bird. In many entries, hyponyms can be listed only up to a point, to some nesting level. For instance, it makes no sense to list all hyponyms in Thesaurus:person; by contrast, listing all hyponyms in Thesaurus:relative or Thesaurus:musician seems fine.

Holonyms and meronyms
Holonyms are terms for wholes containing parts: X is a holonym of Y if Y is part of X. Example: Thesaurus:relative.

Meronyms are terms for parts of wholes: X is a meronym of Y if X is part of Y. Example: Thesaurus:aircraft.

Classes and instances
X is a class of Y if Y is an instance of X, different from hypernyms. Example: Thesaurus:Ecuador.

Instances are opposite of classes, different from hyponyms. Example: Thesaurus:country.

Coordinate terms and troponyms
Coordinate terms, also known as cohyponyms, are mostly unused in the thesaurus since it duplicates hyponymic structures from other entries.

Troponyms are unused: use hyponyms and hypernyms for verbs as well.

Various
The section "Various" is intended to capture other interesting relations, to broaden the navigation network beyond specifically defined relations. It supports creativity, but may lead to disagreements between editors since there is no set of specific rules governing the section.

Example entries:
 * Thesaurus:number: has all sorts of terms relating to numbers that are not hyponyms or instances.
 * Thesaurus:size: has adjectives for size and these do not fit hyponymy or instance-of relationships.
 * Thesaurus:aircraft: has people on board, who are strictly speaking not meronyms.

Minimum item count
A putative thesaurus entry with 2–5 items can probably be comfortably handled by the mainspace synonym lists, and may be not worth an entry. However, there is no agreed on rigid rule for this. The thesaurus most pays off when the item counts are larger.

There is usually no need to create "leaf node" entries for 1 or 2 synonyms and 1 hypernym. Such items are sufficiently covered in the hypernym entries and in the mainspace. Thus, there is Thesaurus:lake but no Thesaurus:pond.

Templates
Lists of templates:
 * Category:Thesaurus templates.
 * Index_to_templates

Templates:

Mainspace
Linking from mainspace to Thesaurus entries:
 * Links to thesaurus entries can be added to the "Synonyms" section (or "Hyponyms", "Antonyms", etc. where appropriate) using the template (which displays something like: ), or using conventional wikitext syntax.
 * The template, used to render per-sense synonyms directly beneath definitions, accepts   links, which should be placed after any specific synonyms (e.g.  )
 * Especially for Thesaurus entries featuring mostly synonyms, it is good to add a link to the Thesaurus entry from all the mainspace entries for the synonyms, so that the user knows that there is a Thesaurus entry when visiting the mainspace.

Wikidata
Wikidata with its subclass and instance of relationships does some of the job of the thesaurus, and is hugely more complete. However, it is not suited for extensive synonym lists and it does not make it convenient to browse hyponymic networks, only supporting easy navigation from an item to its superclass. Some of its subclass stuctures seem needlessly complex and overengineered.

Roget-MICRA thesaurus
Roget's 1911 thesaurus with MICRA supplementation is available here: The appendix has a search box and conveniently features links to mainspace.
 * Appendix:Roget MICRA thesaurus

A search box for convenience:

Moby Thesaurus II
Moby Thesaurus II is available here: The appendix has a search box and conveniently features links to mainspace.
 * Appendix:Moby Thesaurus II

A search box for convenience:

Identity
The current title of the project is "Thesaurus" and "Wiktionary Thesaurus". Before mid-2017, it was "Wikisaurus" Alternatives considered include "Wikithesaurus". In the past, WikiSaurus spelling with capital 'S' must have existed at some point.

Online thesauri
Public domain


 * 1911 version of Roget's Thesaurus hosted by Project Gutenberg
 * Appendix:Roget's thesaurus classification
 * Moby Thesaurus II by Grady Ward - public domain
 * Dictionary at datasegment.com - includes Moby thesaurus in its search results

Free as in "freedom"


 * http://wordnet.princeton.edu/ - licensed under ; see also Princeton WordNet
 * https://en-word.net/ - licensed under [Creative Commons Attribution (CC-BY) 4.0 License]; see also Github []

Proprietary


 * http://thesaurus.reference.com
 * http://www.merriam-webster.com/thesaurus
 * http://www.bartleby.com/62/
 * http://www.visualthesaurus.com/
 * http://encarta.msn.com/thesaurus__/thesaurus.html
 * http://www.fao.org/agrovoc/
 * http://www.smartdefine.org
 * http://www.powerthesaurus.org

Other


 * None listed.

Statistics
Statistics about the thesaurus entries, as of Oct 2022:
 * Entries: 4,833
 * English entries: 2,487
 * Chinese entries: 1,900
 * Other-language entries: 446
 * Entries containing colon in title: 29

Page views
Anatomy entries get a fair amount of page views, as is expected. But they are not alone; other entries with non-trivial page views include Thesaurus:pros and cons and Thesaurus:child.|Thesaurus:child

Recent changes

 * Recent changes in Thesaurus
 * New pages in Thesaurus
 * Random page in Thesaurus

Shortcuts

 * WT:WSI - a Thesaurus index.
 * WT:WS - to this page.
 * See also Shortcut

Subpages
Highlighted subpages:
 * /Format
 * /Requested entries

Project subpages:
 * /Format - how to format a Thesaurus entry
 * /Purpose - on the purpose of the Thesaurus
 * Thesaurus considerations - Original discussion about the project.
 * /Improvements 1 - Its talkpage has a discussion from July 2008.
 * /Improvements 2 - Discussion about the direction and overall project.
 * /Requested entries - A lot of words with candidate lists of synonyms that can be used as a starting point for creation of entries. The size of the page: 700 words.

To do
Things to do:
 * /Requested entries - add requested entries
 * Requests for cleanup - clean up entries with formatting and other problems
 * Appendix:Roget's thesaurus classification - add entries using Roget's thesaurus as a checklist and model

All entries
Lists of all Thesaurus entries:
 * All Thesaurus pages (WT:WSI)
 * Category:Thesaurus

Discussion
Discussions about Thesaurus are scattered across various pages. In the future, they should better take place in Beer Parlour, a general policy discussion room.

Pages with discussions:
 * Thesaurus considerations -- starting in 2002 and 2003, getting more traffic in 2004, with most discussion ended by the end of 2006
 * Wikisaurus/Improvements 1 -- created in February 2005, and stopped immediately; a surge of activity appeared in July 2008
 * Wikisaurus/Improvements 2 -- created in April 2006, active in May 2006 and then stopped; a surge of activity appeared in May 2008

For more discussions, see.

Beer parlour
Discussions about Thesaurus at Beer parlour:

The following list is highly incomplete.
 * 2005
 * WikiSaurus category - March 2005 - 750 words
 * 2006
 * Wiktionary:Project_-_WikiSaurus_improvement_1 - April 2006
 * Template_WikiSaurus-link - April 2006
 * Wikisaurus_cleanup - April 2006 - 2800 words
 * Thesaurus:new - May 2006 - 45 words
 * WikiSaurus_proposal - May 2006 - 172 words
 * Pushing for the definitive WikiSaurus name and namespace - May 2006 - 2200 words
 * Stop me if this sounds_familiar - Oct 2006 - on semiprotecting Wikisaurus entries - see also the vote
 * 2007
 * Necessary_tidying_up_of_Wikisaurus_templates. - May 2007
 * Wikisaurus_changes - May 2007 - on inclusion criteria including the option of 30,000 Google hits - 650 words
 * 2008
 * January-April 2008 : none found.
 * WikiSaurus - May 2008 - a proposal of deletion of Wikisaurus
 * Thesaurus_flunky - May 2008
 * Specific_Universal_Changes_in_Wikisaurus - June 2008
 * Yet_Another_Interminable_Discussion_about_Wikisaurus - July 2008
 * Category:Wikisaurus - July 2008
 * Wikisaurus at cross purposes - July 2008 - including whether all items in WS entries should link to WS or to WT - many participants
 * Wikisaurus alteration - Sep 2008 - about the appearance of.
 * Moby Project - Sep 2008 - on importing Moby II thesaurus into Wikisaurus
 * 2009
 * Wikisaurus - non-English entries - Mar 2009
 * on using the Wikisaurus - Apr 2009
 * Wikisaurus - inclusion criteria - Nov 2009
 * 2010
 * Proposed Wikisaurus style changes - Jan 2010
 * International Wikisaurus - Jan 2010
 * Vote on deleting Template:Wikisaurus-link- Jan 2010
 * Poll: Deleting "/more" pages from Wikisaurus - Sep 2010
 * Planned vote: Deleting Wikisaurus slash-more pages, Oct 2010; see also the vote
 * A synonym of itself in Wikisaurus, Oct 2010
 * "See also" in Wikisaurus, Nov 2010
 * 2013
 * Wikisaurus and attestation - Sep 2013 - and the vote
 * 2014
 * Redirects in Wikisaurus - Mar 2014
 * 2017
 * Disambiguate Wikisaurus (thesaurus) entries by language - Aug 2017
 * 2017
 * Disambiguate Wikisaurus (thesaurus) entries by language - Aug 2017

See also search for "Wikisaurus" in the archives of Beer parlour.

Index
An index to this page:
 * All entries - see, All Thesaurus pages, and Category:Thesaurus
 * Example entries - see
 * Layout - see
 * Logo - see
 * Monitoring - see
 * Recent changes - see
 * Requested entries - see /Requested entries and
 * Spelling - see
 * Title - see