Semantic similarity is a metric defined over a set of documents or terms, where the idea of distance between items is based on the likeness of their meaning or semantic content[citation needed] as opposed to lexicographical similarity. These are mathematical tools used to estimate the strength of the semantic relationship between units of language, concepts or instances, through a numerical description obtained according to the comparison of information supporting their meaning or describing their nature.[1][2] The term semantic similarity is often confused with semantic relatedness. Semantic relatedness includes any relation between two terms, while semantic similarity only includes "is a" relations.[3]
For example, "car" is similar to "bus", but is also related to "road" and "driving".
Computationally, semantic similarity can be estimated by defining a topological similarity, by using ontologies to define the distance between terms/concepts. For example, a naive metric for the comparison of concepts ordered in a partially ordered set and represented as nodes of a directed acyclic graph (e.g., a taxonomy), would be the shortest-path linking the two concept nodes. Based on text analyses, semantic relatedness between units of language (e.g., words, sentences) can also be estimated using statistical means such as a vector space model to correlate words and textual contexts from a suitable text corpus. The evaluation of the proposed semantic similarity / relatedness measures are evaluated through two main ways. The former is based on the use of datasets designed by experts and composed of word pairs with semantic similarity / relatedness degree estimation. The second way is based on the integration of the measures inside specific applications such as information retrieval, recommender systems, natural language processing, etc.
^Harispe S.; Ranwez S.; Janaqi S.; Montmain J. (2015). "Semantic Similarity from Natural Language and Ontology Analysis". Synthesis Lectures on Human Language Technologies. 8 (1): 1–254. arXiv:1704.05295. doi:10.2200/S00639ED1V01Y201504HLT027. S2CID 17428739.
^Feng Y.; Bagheri E.; Ensan F.; Jovanovic J. (2017). "The state of the art in semantic relatedness: a framework for comparison". Knowledge Engineering Review. 32: 1–30. doi:10.1017/S0269888917000029. S2CID 52172371.
^A. Ballatore; M. Bertolotto; D.C. Wilson (2014). "An evaluative baseline for geo-semantic relatedness and similarity". GeoInformatica. 18 (4): 747–767. arXiv:1402.3371. Bibcode:2014arXiv1402.3371B. doi:10.1007/s10707-013-0197-8. S2CID 17474023.
and 26 Related for: Semantic similarity information
Semanticsimilarity is a metric defined over a set of documents or terms, where the idea of distance between items is based on the likeness of their meaning...
forms of semantic networks has been created for specific use. For example, in 2008, Fawsy Bendeck's PhD thesis formalized the SemanticSimilarity Network...
Latent semantic analysis (LSA) is a technique in natural language processing, in particular distributional semantics, of analyzing relationships between...
A semanticsimilarity network (SSN) is a special form of semantic network. designed to represent concepts and their semanticsimilarity. Its main contribution...
Semantic memory refers to general world knowledge that humans have accumulated throughout their lives. This general knowledge (word meanings, concepts...
Semantic change (also semantic shift, semantic progression, semantic development, or semantic drift) is a form of language change regarding the evolution...
vectors which are nearby as measured by cosine similarity. This indicates the level of semanticsimilarity between the words, so for example the vectors...
analysis, cosine similarity is a measure of similarity between two non-zero vectors defined in an inner product space. Cosine similarity is the cosine of...
and studies theories and methods for quantifying and categorizing semanticsimilarities between linguistic items based on their distributional properties...
based on a similarity function Similarity learning – Supervised learning of a similarity function Self-similarity matrix Semanticsimilarity – Natural...
The Semantic Web, sometimes known as Web 3.0 (not to be confused with Web3), is an extension of the World Wide Web through standards set by the World Wide...
A semantic wiki is a wiki that has an underlying model of the knowledge described in its pages. Regular, or syntactic, wikis have structured text and untyped...
the subject of the Semantic Differential. Likert scale Componential analysis Semantic gap SemanticsimilaritySemanticsimilarity network Structural differential...
completed experiments on latent semantic analysis (LSA) that supported the opposite. Instead of an increase in semanticsimilarity when there was a decrease...
SemEval (Semantic Evaluation) is an ongoing series of evaluations of computational semantic analysis systems; it evolved from the Senseval word sense evaluation...
Semantic discord is the situation where two parties disagree on the definition of a word(s) that is essential to communicating or formulating the concept(s)...
of researchers to compute what they refer to as "semantic relatedness" by means of cosine similarity between the aforementioned vectors, collectively...
within a domain Philosophical language Protégé (software) Semantic network Semanticsimilarity network Structuralism Systematics Taxon, a population of...
Disambiguation, Semanticsimilarity, and also to automatically rank WordNet synsets according to how strongly they possess a given semantic property, such...
Semantic satiation is a psychological phenomenon in which repetition causes a word or phrase to temporarily lose meaning for the listener, who then perceives...
Semantic matching is a technique used in computer science to identify information which is semantically related. Given any two graph-like structures,...
In computer science, an abstract semantic graph (ASG) or term graph is a form of abstract syntax in which an expression of a formal or programming language...
meaningful space where the distance between words is related to semanticsimilarity. Training is performed on aggregated global word-word co-occurrence...
of lexicology. Since lexicology studies the meaning of words and their semantic relations, it often explores the history and development of a word. Etymologists...