Global Information Lookup Global Information

Semantic Interpretation for Speech Recognition information


Semantic Interpretation for Speech Recognition (SISR) defines the syntax and semantics of annotations to grammar rules in the Speech Recognition Grammar Specification (SRGS). Since 5 April 2007, it is a World Wide Web Consortium recommendation.[1]

By building upon SRGS grammars, it allows voice browsers via ECMAScript to semantically interpret complex grammars and provide the information back to the application. For example, it allows utterances like "I would like a Coca-cola and three large pizzas with pepperoni and mushrooms." to be interpreted into an object that can be understood by an application. For example, the utterance could produce the following object named order:

 {
   drink: {
     liquid:"coke",
     drinksize:"medium"
   },
   pizza: {
     number: "3",
     pizzasize: "large",
     topping: [ "pepperoni", "mushrooms" ]
   }
 }

If used against this grammar that includes SISR markup in addition to the standard SRGS grammar in XML format:

<?xml version="1.0" encoding="UTF-8"?> 
<!DOCTYPE grammar PUBLIC "-//W3C//DTD GRAMMAR 1.0//EN"
                  "http://www.w3.org/TR/speech-grammar/grammar.dtd">
<grammar xmlns="http://www.w3.org/2001/06/grammar" xml:lang="en"
         xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
         xsi:schemaLocation="http://www.w3.org/2001/06/grammar
                             http://www.w3.org/TR/speech-grammar/grammar.xsd"
         version="1.0" mode="voice" tag-format="semantics/1.0" root="order">
   <rule id="order">
      I would like a
      <ruleref uri="#drink"/>
      <tag>out.drink = new Object(); out.drink.liquid=rules.drink.type;
           out.drink.drinksize=rules.drink.drinksize;</tag>
      and
      <ruleref uri="#pizza"/>
      <tag>out.pizza=rules.pizza;</tag>
   </rule>
   <rule id="kindofdrink">
      <one-of>
         <item>coke</item>
         <item>pepsi</item>
         <item>coca cola<tag>out="coke";</tag></item>
      </one-of>
   </rule>
   <rule id="foodsize">
      <tag>out="medium";</tag> <!-- "medium" is default if nothing said -->
      <item repeat="0-1">
         <one-of>
            <item>small<tag>out="small";</tag></item>
            <item>medium</item>
            <item>large<tag>out="large";</tag></item>
            <item>regular<tag>out="medium";</tag></item>
         </one-of>
      </item>
   </rule>
   <!-- Construct Array of toppings, return Array -->
   <rule id="tops">
      <tag>out=new Array;</tag>
      <ruleref uri="#top"/>
      <tag>out.push(rules.top);</tag>
      <item repeat="1-">
         and
         <ruleref uri="#top"/>
         <tag>out.push(rules.top);</tag>
      </item>
   </rule>
   <rule id="top">
      <one-of>
         <item>anchovies</item>
         <item>pepperoni</item>
         <item>mushroom<tag>out="mushrooms";</tag></item>
         <item>mushrooms</item>
      </one-of>
   </rule>
   <!-- Two properties (drinksize, type) on left hand side Rule Variable -->
   <rule id="drink">
      <ruleref uri="#foodsize"/>
      <ruleref uri="#kindofdrink"/>
      <tag>out.drinksize=rules.foodsize; out.type=rules.kindofdrink;</tag>
   </rule>
   <!-- Three properties on rules.pizza -->
   <rule id="pizza">
      <ruleref uri="#number"/>
      <ruleref uri="#foodsize"/>
      <tag>out.pizzasize=rules.foodsize; out.number=rules.number;</tag>
      pizzas with
      <ruleref uri="#tops"/>
      <tag>out.topping=rules.tops;</tag>
   </rule>
   <rule id="number">
      <one-of>
         <item>
            <tag>out=1;</tag>
            <one-of>
               <item>a</item>
               <item>one</item>
            </one-of>
         </item>
         <item>two<tag>out=2;</tag></item>
         <item>three<tag>out=3;</tag></item>
      </one-of>
   </rule>
</grammar>
  1. ^ Semantic Interpretation for Speech Recognition (SISR) Version 1.0

and 26 Related for: Semantic Interpretation for Speech Recognition information

Request time (Page generated in 0.8674 seconds.)

Semantic Interpretation for Speech Recognition

Last Update:

Semantic Interpretation for Speech Recognition (SISR) defines the syntax and semantics of annotations to grammar rules in the Speech Recognition Grammar...

Word Count : 510

Speech Recognition Grammar Specification

Last Update:

elements: this is done in a companion W3C standard, Semantic Interpretation for Speech Recognition (SISR). SISR is based on ECMAScript, and ECMAScript...

Word Count : 697

Speech recognition

Last Update:

Speech recognition is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that...

Word Count : 12457

Speech Synthesis Markup Language

Last Update:

Specification (PLS) Speech Recognition Grammar Specification (SRGS) Semantic Interpretation for Speech Recognition (SISR) SABLE speech synthesis markup language...

Word Count : 331

Voice browser

Last Update:

Markup Language (CCXML) Speech Recognition Grammar Specification (SRGS) Semantic Interpretation for Speech Recognition (SISR) Speech Synthesis Markup Language...

Word Count : 238

World Wide Web Consortium

Last Update:

Semantic Interpretation for Speech Recognition Simple Knowledge Organization System Synchronized Multimedia Integration Language SOAP SPARQL Speech Recognition...

Word Count : 2228

VoiceXML

Last Update:

specified via the Semantic Interpretation for Speech Recognition (SISR) standard. SISR is used inside SRGS to specify the semantic results associated with...

Word Count : 1041

Natural language processing

Last Update:

themselves. Challenges in natural language processing frequently involve speech recognition, natural-language understanding, and natural-language generation....

Word Count : 6665

Speech perception

Last Update:

applications in building computer systems that can recognize speech, in improving speech recognition for hearing- and language-impaired listeners, and in foreign-language...

Word Count : 8168

Microsoft Speech API

Last Update:

The Speech Application Programming Interface or SAPI is an API developed by Microsoft to allow the use of speech recognition and speech synthesis within...

Word Count : 2381

Natural Language Semantics Markup Language

Last Update:

SRGS Semantic Interpretation for Speech Recognition SRGS Specification (W3C Recommendation) Natural Language Semantics Markup Language for the Speech Interface...

Word Count : 76

Annotation

Last Update:

Cruz, Isabel (eds.). "Effective and efficient Semantic Table Interpretation using TableMiner+". Semantic Web. 8 (6): 921–957. doi:10.3233/SW-160242. Ramnandan...

Word Count : 3658

Latent semantic analysis

Last Update:

Latent semantic analysis (LSA) is a technique in natural language processing, in particular distributional semantics, of analyzing relationships between...

Word Count : 7603

Language interpretation

Last Update:

computer-assisted interpretation has emerged, with dedicated tools integrating glossaries and automated speech recognition. Whispered interpretation is known in...

Word Count : 6568

Reading

Last Update:

hazard symbol and an emoji), are not based on speech-based writing systems. The common link is the interpretation of symbols to extract the meaning from the...

Word Count : 33626

Multimodal interaction

Last Update:

keyboard, and mouse) with a voice modality (speech recognition for input, speech synthesis and recorded audio for output). However other modalities, such...

Word Count : 4852

Psycholinguistics

Last Update:

word recognition during reading, to examine the processes involved in the extraction of orthographic, morphological, phonological, and semantic information...

Word Count : 4392

Pattern recognition

Last Update:

pattern recognition is the basis for computer-aided diagnosis (CAD) systems. CAD describes a procedure that supports the doctor's interpretations and findings...

Word Count : 4267

Face perception

Last Update:

understanding and interpretation of the face. Here, perception implies the presence of consciousness and hence excludes automated facial recognition systems. Although...

Word Count : 14740

Deep learning

Last Update:

transformers have been applied to fields including computer vision, speech recognition, natural language processing, machine translation, bioinformatics...

Word Count : 17587

Speech segmentation

Last Update:

particular value in the field of computer speech recognition, since the ability to build and search a network of semantically connected ideas would greatly increase...

Word Count : 2035

Semantic parsing

Last Update:

semantic content. Technologies related to accessibility: Helps create tools for the disabled, such as sign language interpretation and text to speech...

Word Count : 2859

Pseudoword

Last Update:

just a syllable which is used in acoustic experiments to examine speech recognition. Experiments involving pseudonyms have led to the discovery of the...

Word Count : 1628

Bilingual lexical access

Last Update:

Tasks, Semantic Priming, and Reading". 2014. Green, D. W. (1986). Control, activation, and resource: A framework and a model for the control of speech in...

Word Count : 5436

Computer vision

Last Update:

used in a wide range of applications, including computer vision, speech recognition, identification of albuminous sequences in bioinformatics, production...

Word Count : 7529

Homophonic translation

Last Update:

variant Mairzy Doats Mondegreen, the erroneous interpretation of language by homophony Mots d'Heures Phono-semantic matching (PSM), a borrowing in which a foreign...

Word Count : 828

PDF Search Engine © AllGlobal.net