Semantic Interpretation for Speech Recognition information
World Wide Web Consortium recommendation
This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed. Find sources: "Semantic Interpretation for Speech Recognition" – news · newspapers · books · scholar · JSTOR(February 2015) (Learn how and when to remove this message)
Semantic Interpretation for Speech Recognition (SISR) defines the syntax and semantics of annotations to grammar rules in the Speech Recognition Grammar Specification (SRGS). Since 5 April 2007, it is a World Wide Web Consortium recommendation.[1]
By building upon SRGS grammars, it allows voice browsers via ECMAScript to semantically interpret complex grammars and provide the information back to the application. For example, it allows utterances like "I would like a Coca-cola and three large pizzas with pepperoni and mushrooms." to be interpreted into an object that can be understood by an application. For example, the utterance could produce the following object named order:
If used against this grammar that includes SISR markup in addition to the standard SRGS grammar in XML format:
<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE grammar PUBLIC "-//W3C//DTD GRAMMAR 1.0//EN" "http://www.w3.org/TR/speech-grammar/grammar.dtd"><grammarxmlns="http://www.w3.org/2001/06/grammar"xml:lang="en"xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"xsi:schemaLocation="http://www.w3.org/2001/06/grammar http://www.w3.org/TR/speech-grammar/grammar.xsd"version="1.0"mode="voice"tag-format="semantics/1.0"root="order"><ruleid="order">Iwouldlikea
<rulerefuri="#drink"/><tag>out.drink=newObject();out.drink.liquid=rules.drink.type;
out.drink.drinksize=rules.drink.drinksize;</tag>and
<rulerefuri="#pizza"/><tag>out.pizza=rules.pizza;</tag></rule><ruleid="kindofdrink"><one-of><item>coke</item><item>pepsi</item><item>cocacola<tag>out="coke";</tag></item></one-of></rule><ruleid="foodsize"><tag>out="medium";</tag><!-- "medium" is default if nothing said --><itemrepeat="0-1"><one-of><item>small<tag>out="small";</tag></item><item>medium</item><item>large<tag>out="large";</tag></item><item>regular<tag>out="medium";</tag></item></one-of></item></rule><!-- Construct Array of toppings, return Array --><ruleid="tops"><tag>out=newArray;</tag><rulerefuri="#top"/><tag>out.push(rules.top);</tag><itemrepeat="1-">and
<rulerefuri="#top"/><tag>out.push(rules.top);</tag></item></rule><ruleid="top"><one-of><item>anchovies</item><item>pepperoni</item><item>mushroom<tag>out="mushrooms";</tag></item><item>mushrooms</item></one-of></rule><!-- Two properties (drinksize, type) on left hand side Rule Variable --><ruleid="drink"><rulerefuri="#foodsize"/><rulerefuri="#kindofdrink"/><tag>out.drinksize=rules.foodsize;out.type=rules.kindofdrink;</tag></rule><!-- Three properties on rules.pizza --><ruleid="pizza"><rulerefuri="#number"/><rulerefuri="#foodsize"/><tag>out.pizzasize=rules.foodsize;out.number=rules.number;</tag>pizzaswith
<rulerefuri="#tops"/><tag>out.topping=rules.tops;</tag></rule><ruleid="number"><one-of><item><tag>out=1;</tag><one-of><item>a</item><item>one</item></one-of></item><item>two<tag>out=2;</tag></item><item>three<tag>out=3;</tag></item></one-of></rule></grammar>
^Semantic Interpretation for Speech Recognition (SISR) Version 1.0
and 26 Related for: Semantic Interpretation for Speech Recognition information
Speechrecognition is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that...
specified via the Semantic InterpretationforSpeechRecognition (SISR) standard. SISR is used inside SRGS to specify the semantic results associated with...
themselves. Challenges in natural language processing frequently involve speechrecognition, natural-language understanding, and natural-language generation....
applications in building computer systems that can recognize speech, in improving speechrecognitionfor hearing- and language-impaired listeners, and in foreign-language...
The Speech Application Programming Interface or SAPI is an API developed by Microsoft to allow the use of speechrecognition and speech synthesis within...
SRGS SemanticInterpretationforSpeechRecognition SRGS Specification (W3C Recommendation) Natural Language Semantics Markup Language for the Speech Interface...
Latent semantic analysis (LSA) is a technique in natural language processing, in particular distributional semantics, of analyzing relationships between...
computer-assisted interpretation has emerged, with dedicated tools integrating glossaries and automated speechrecognition. Whispered interpretation is known in...
hazard symbol and an emoji), are not based on speech-based writing systems. The common link is the interpretation of symbols to extract the meaning from the...
keyboard, and mouse) with a voice modality (speechrecognitionfor input, speech synthesis and recorded audio for output). However other modalities, such...
word recognition during reading, to examine the processes involved in the extraction of orthographic, morphological, phonological, and semantic information...
pattern recognition is the basis for computer-aided diagnosis (CAD) systems. CAD describes a procedure that supports the doctor's interpretations and findings...
understanding and interpretation of the face. Here, perception implies the presence of consciousness and hence excludes automated facial recognition systems. Although...
transformers have been applied to fields including computer vision, speechrecognition, natural language processing, machine translation, bioinformatics...
particular value in the field of computer speechrecognition, since the ability to build and search a network of semantically connected ideas would greatly increase...
semantic content. Technologies related to accessibility: Helps create tools for the disabled, such as sign language interpretation and text to speech...
just a syllable which is used in acoustic experiments to examine speechrecognition. Experiments involving pseudonyms have led to the discovery of the...
Tasks, Semantic Priming, and Reading". 2014. Green, D. W. (1986). Control, activation, and resource: A framework and a model for the control of speech in...
used in a wide range of applications, including computer vision, speechrecognition, identification of albuminous sequences in bioinformatics, production...
variant Mairzy Doats Mondegreen, the erroneous interpretation of language by homophony Mots d'Heures Phono-semantic matching (PSM), a borrowing in which a foreign...