SlideShare ist ein Scribd-Unternehmen logo
1 von 18
Downloaden Sie, um offline zu lesen
 Topic voice Browser
 Registration # 2013-ag-6094
 Name javaria kanwal
 Supervisor Miss.uzma satter
Submitted to
Prof. Akmal Rehan
1
VOICE BROWSER
2
WHAT IS A VOICE BROWSER
A voice browser is a device :
that interprets voice input and interprets voice markup
languages to generate voice output.
that interprets a script which specifies exactly what to verbally
present to the user as well as when to present each piece of
information.
3
MOTIVATION
There are 10 times as many telephones as connected PCs.
Cell phones usage is growing dramatically.
Speaking and listening are the natural usage modes for modes. Easy
to use - for people with no knowledge or fear of computers.
Voice interaction can escape the physical limitations on keypads and
displays as mobile devices become ever smaller
4
KEY TECHNOLOGIES
 Speech Recognition
Voice input VoXML file  Text
 Speech Synthesis
Text  VoXML file  Output(Pre-recorded)
5
WORLD WIDE WEB CONSORTIUM(W3C)
World Wide Web Consortium(W3C) develops
interpretable technologies(software and tools) to
lead the web to its full potential as a forum of
information ,commerce and communication.
 W3C Speech interface framework
 VoiceXML
 Speech recognition
 Call control
6
VOICEXML
voiceXML is a dialog markup language designs for
telephony applications where users are restricted to voice
and DTMF (touch tone) input.
7
8
SPEECH RECOGNITION
9
SPEECH GRAMMAR
 Speech grammars allow authors to specify the rules for
covering the sequence of words that users are expected
to say in particular context.
 These contextual clues allow the recognition engine to
focus on likely utterances , improving the chances of the
correct match
10
STOCHASTIC (N GRAM) LANGUAGE MODELS
 Speech grammar is un useful in case of open-
ended prompt e.g. how can I help you
 The solution is to use a stochastic language
models. such models specify the probability that
one word occurs following certain others. the
probabilities are computed from the collection of
utterances collected from many users.
11
SEMANTIC INTERPRETATION
The recognition process matches an utterance to a
speech grammar, building a parse tree as a
byproduct.
There are two approaches to harvesting semantic
rules from the parse tree :
1. Automating grammar rules with semantic
interpretation tags
2. Representing the results in XML
12
CALL CONTROL
 Fine-grained control of speech (signal processing )
resources and telephony resources in a VoiceXML
telephony platform.
 Will enable application developers to use markup to
perform call screening, whisper call waiting call
transfer, and more.
 Can be used to transfer a user from on voice
browser to another on a completely different
machine.
13
APPLICATIONS
 It can be divided into three categories :
 Web Browsing
 Limited information Access
 Spoken Dialog Systems
14
FUTURE
•Voice browsing will become visual(Multi-modal)
•Can be integrated to an OS
•Integrated to every application.
15
CONCLUSION
 Browser technology is changing very fast these
days and we are moving from the visual paradigm
to the voice paradigm.
 Voice browser is the technology to enter this
paradigm.
 Voice browser is a device which interpret voice
input and generate voice output.
16
17
18

Weitere ähnliche Inhalte

Was ist angesagt?

Voicebasedsrs 130319103050-phpapp02
Voicebasedsrs 130319103050-phpapp02Voicebasedsrs 130319103050-phpapp02
Voicebasedsrs 130319103050-phpapp02Lokesh Loki
 
Presentationgroup
PresentationgroupPresentationgroup
Presentationgrouplax8055
 
SMOWSER (A VOICE BASED BROWSER)
SMOWSER (A VOICE BASED BROWSER)SMOWSER (A VOICE BASED BROWSER)
SMOWSER (A VOICE BASED BROWSER)NEERAJ BAGHEL
 
Voice recognition security systems
Voice recognition security systemsVoice recognition security systems
Voice recognition security systemsSandeep Kumar
 
Text to speech with Google Cloud
Text to speech with Google CloudText to speech with Google Cloud
Text to speech with Google CloudRajarshi Ghosh
 
Report on online chatting
Report on online chattingReport on online chatting
Report on online chattingAmandeep Kaur
 
Paper on Speech Recognition
Paper on Speech RecognitionPaper on Speech Recognition
Paper on Speech RecognitionThejus Joby
 
IRJET- V-Mail (Voice based E-Mail Application): Review
IRJET-  	  V-Mail (Voice based E-Mail Application): ReviewIRJET-  	  V-Mail (Voice based E-Mail Application): Review
IRJET- V-Mail (Voice based E-Mail Application): ReviewIRJET Journal
 
Voice based email system for physically challenged
Voice based email system for physically challengedVoice based email system for physically challenged
Voice based email system for physically challengedIbrahim Khalil Shakik
 
Introduction to VoiceXml and Voice Web Architecture
Introduction to VoiceXml and Voice Web ArchitectureIntroduction to VoiceXml and Voice Web Architecture
Introduction to VoiceXml and Voice Web ArchitecturePaul Nguyen
 
classification of computer language
classification of computer languageclassification of computer language
classification of computer languageBinamraRegmi
 
Online chatting system
Online chatting systemOnline chatting system
Online chatting systemSamakshgoel3
 
Bluetooth based-chatting-system-using-android-docx
Bluetooth based-chatting-system-using-android-docxBluetooth based-chatting-system-using-android-docx
Bluetooth based-chatting-system-using-android-docxshanofa sanu
 
Text to-speech & voice recognition
Text to-speech & voice recognitionText to-speech & voice recognition
Text to-speech & voice recognitionMark Williams
 

Was ist angesagt? (20)

Voice browser
Voice browserVoice browser
Voice browser
 
Voice browser
Voice browserVoice browser
Voice browser
 
Voicebasedsrs 130319103050-phpapp02
Voicebasedsrs 130319103050-phpapp02Voicebasedsrs 130319103050-phpapp02
Voicebasedsrs 130319103050-phpapp02
 
Presentationgroup
PresentationgroupPresentationgroup
Presentationgroup
 
SMOWSER (A VOICE BASED BROWSER)
SMOWSER (A VOICE BASED BROWSER)SMOWSER (A VOICE BASED BROWSER)
SMOWSER (A VOICE BASED BROWSER)
 
Voice recognition security systems
Voice recognition security systemsVoice recognition security systems
Voice recognition security systems
 
Text to speech with Google Cloud
Text to speech with Google CloudText to speech with Google Cloud
Text to speech with Google Cloud
 
Report on online chatting
Report on online chattingReport on online chatting
Report on online chatting
 
Paper on Speech Recognition
Paper on Speech RecognitionPaper on Speech Recognition
Paper on Speech Recognition
 
IRJET- V-Mail (Voice based E-Mail Application): Review
IRJET-  	  V-Mail (Voice based E-Mail Application): ReviewIRJET-  	  V-Mail (Voice based E-Mail Application): Review
IRJET- V-Mail (Voice based E-Mail Application): Review
 
Voice based email system for physically challenged
Voice based email system for physically challengedVoice based email system for physically challenged
Voice based email system for physically challenged
 
Phonet
PhonetPhonet
Phonet
 
Introduction to VoiceXml and Voice Web Architecture
Introduction to VoiceXml and Voice Web ArchitectureIntroduction to VoiceXml and Voice Web Architecture
Introduction to VoiceXml and Voice Web Architecture
 
classification of computer language
classification of computer languageclassification of computer language
classification of computer language
 
final doc
final docfinal doc
final doc
 
Arbina project
Arbina projectArbina project
Arbina project
 
Online chatting system
Online chatting systemOnline chatting system
Online chatting system
 
Bluetooth based-chatting-system-using-android-docx
Bluetooth based-chatting-system-using-android-docxBluetooth based-chatting-system-using-android-docx
Bluetooth based-chatting-system-using-android-docx
 
Text to-speech & voice recognition
Text to-speech & voice recognitionText to-speech & voice recognition
Text to-speech & voice recognition
 
Computer language
Computer languageComputer language
Computer language
 

Andere mochten auch

Speech recognition project report
Speech recognition project reportSpeech recognition project report
Speech recognition project reportSarang Afle
 
Internet of Things and its Enabling Technologies - RFID
Internet of Things  and its Enabling Technologies - RFIDInternet of Things  and its Enabling Technologies - RFID
Internet of Things and its Enabling Technologies - RFIDSwetha Kogatam
 
Smart Glasses, Augmented Reality
Smart Glasses, Augmented RealitySmart Glasses, Augmented Reality
Smart Glasses, Augmented RealityDroidConTLV
 
Automated attendance system based on facial recognition
Automated attendance system based on facial recognitionAutomated attendance system based on facial recognition
Automated attendance system based on facial recognitionDhanush Kasargod
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversionankit_saluja
 
Face Recognition based Lecture Attendance System
Face Recognition based Lecture Attendance SystemFace Recognition based Lecture Attendance System
Face Recognition based Lecture Attendance SystemKarmesh Maheshwari
 
Automated Face Detection System
Automated Face Detection SystemAutomated Face Detection System
Automated Face Detection SystemAbhiroop Ghatak
 
Smart Glasses Market report 2015: towards 1 billion shipments
Smart Glasses Market report 2015: towards 1 billion shipments Smart Glasses Market report 2015: towards 1 billion shipments
Smart Glasses Market report 2015: towards 1 billion shipments Ori Inbar
 
Smart Glass Technology by Kiran
Smart Glass Technology by KiranSmart Glass Technology by Kiran
Smart Glass Technology by KiranKiran
 
Facial recognition powerpoint
Facial recognition powerpointFacial recognition powerpoint
Facial recognition powerpoint12206695
 
Blue brain project ppt
Blue brain project pptBlue brain project ppt
Blue brain project pptLishita Shah
 
Digital Signature
Digital SignatureDigital Signature
Digital Signaturesaurav5884
 

Andere mochten auch (18)

Speech recognition project report
Speech recognition project reportSpeech recognition project report
Speech recognition project report
 
Mobile number portability
Mobile number portabilityMobile number portability
Mobile number portability
 
Digital Signature
Digital SignatureDigital Signature
Digital Signature
 
Internet of Things and its Enabling Technologies - RFID
Internet of Things  and its Enabling Technologies - RFIDInternet of Things  and its Enabling Technologies - RFID
Internet of Things and its Enabling Technologies - RFID
 
Smart Glasses, Augmented Reality
Smart Glasses, Augmented RealitySmart Glasses, Augmented Reality
Smart Glasses, Augmented Reality
 
Automated attendance system based on facial recognition
Automated attendance system based on facial recognitionAutomated attendance system based on facial recognition
Automated attendance system based on facial recognition
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 
Face Recognition based Lecture Attendance System
Face Recognition based Lecture Attendance SystemFace Recognition based Lecture Attendance System
Face Recognition based Lecture Attendance System
 
Automated Face Detection System
Automated Face Detection SystemAutomated Face Detection System
Automated Face Detection System
 
Blue brain
Blue brain Blue brain
Blue brain
 
Smart Glasses Market report 2015: towards 1 billion shipments
Smart Glasses Market report 2015: towards 1 billion shipments Smart Glasses Market report 2015: towards 1 billion shipments
Smart Glasses Market report 2015: towards 1 billion shipments
 
Smart Glass Technology by Kiran
Smart Glass Technology by KiranSmart Glass Technology by Kiran
Smart Glass Technology by Kiran
 
Facial recognition powerpoint
Facial recognition powerpointFacial recognition powerpoint
Facial recognition powerpoint
 
Smart glass introduction
Smart glass introductionSmart glass introduction
Smart glass introduction
 
Best Ever PPT Of Bluebrain
Best Ever PPT Of BluebrainBest Ever PPT Of Bluebrain
Best Ever PPT Of Bluebrain
 
BLUE BRAIN
BLUE BRAINBLUE BRAIN
BLUE BRAIN
 
Blue brain project ppt
Blue brain project pptBlue brain project ppt
Blue brain project ppt
 
Digital Signature
Digital SignatureDigital Signature
Digital Signature
 

Ähnlich wie voice browser

Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversionankit_saluja
 
Voicexml543
Voicexml543Voicexml543
Voicexml543pavisony
 
Artificial Intelligence for Speech Recognition
Artificial Intelligence for Speech RecognitionArtificial Intelligence for Speech Recognition
Artificial Intelligence for Speech RecognitionRHIMRJ Journal
 
AI for voice recognition.pptx
AI for voice recognition.pptxAI for voice recognition.pptx
AI for voice recognition.pptxJhalakDashora
 
CCXML For Advanced Communications Applications
CCXML For Advanced Communications ApplicationsCCXML For Advanced Communications Applications
CCXML For Advanced Communications ApplicationsVoxeo Corp
 
IJSRED-V2I2P5
IJSRED-V2I2P5IJSRED-V2I2P5
IJSRED-V2I2P5IJSRED
 
02 state of the art speech technology using java speech api@egsp 25.08.2011
02 state of the art speech technology using java speech api@egsp 25.08.201102 state of the art speech technology using java speech api@egsp 25.08.2011
02 state of the art speech technology using java speech api@egsp 25.08.2011VinothkumaR Ramu
 
Wake-up-word speech recognition using GPS on smart phone
Wake-up-word speech recognition using GPS on smart phoneWake-up-word speech recognition using GPS on smart phone
Wake-up-word speech recognition using GPS on smart phoneIJERA Editor
 
ibm språkbanken websphere
ibm språkbanken websphereibm språkbanken websphere
ibm språkbanken webspherealkfdsj
 
8th Ethiopian ICT Conference Bazaar and Exhibition.pptx
8th Ethiopian ICT Conference Bazaar and Exhibition.pptx8th Ethiopian ICT Conference Bazaar and Exhibition.pptx
8th Ethiopian ICT Conference Bazaar and Exhibition.pptxssusera032bc
 

Ähnlich wie voice browser (20)

Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 
BTP paper
BTP paperBTP paper
BTP paper
 
Voicexml543
Voicexml543Voicexml543
Voicexml543
 
VoiceXML
VoiceXMLVoiceXML
VoiceXML
 
10.1.1.510.6198
10.1.1.510.619810.1.1.510.6198
10.1.1.510.6198
 
Bt35408413
Bt35408413Bt35408413
Bt35408413
 
Artificial Intelligence for Speech Recognition
Artificial Intelligence for Speech RecognitionArtificial Intelligence for Speech Recognition
Artificial Intelligence for Speech Recognition
 
Voicexml
VoicexmlVoicexml
Voicexml
 
Seminar
SeminarSeminar
Seminar
 
Google Voice-to-text
Google Voice-to-textGoogle Voice-to-text
Google Voice-to-text
 
AI for voice recognition.pptx
AI for voice recognition.pptxAI for voice recognition.pptx
AI for voice recognition.pptx
 
Voice browser1
Voice browser1Voice browser1
Voice browser1
 
CCXML For Advanced Communications Applications
CCXML For Advanced Communications ApplicationsCCXML For Advanced Communications Applications
CCXML For Advanced Communications Applications
 
VOICE RECOGNITION SYSTEM
VOICE RECOGNITION SYSTEMVOICE RECOGNITION SYSTEM
VOICE RECOGNITION SYSTEM
 
IJSRED-V2I2P5
IJSRED-V2I2P5IJSRED-V2I2P5
IJSRED-V2I2P5
 
02 state of the art speech technology using java speech api@egsp 25.08.2011
02 state of the art speech technology using java speech api@egsp 25.08.201102 state of the art speech technology using java speech api@egsp 25.08.2011
02 state of the art speech technology using java speech api@egsp 25.08.2011
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Wake-up-word speech recognition using GPS on smart phone
Wake-up-word speech recognition using GPS on smart phoneWake-up-word speech recognition using GPS on smart phone
Wake-up-word speech recognition using GPS on smart phone
 
ibm språkbanken websphere
ibm språkbanken websphereibm språkbanken websphere
ibm språkbanken websphere
 
8th Ethiopian ICT Conference Bazaar and Exhibition.pptx
8th Ethiopian ICT Conference Bazaar and Exhibition.pptx8th Ethiopian ICT Conference Bazaar and Exhibition.pptx
8th Ethiopian ICT Conference Bazaar and Exhibition.pptx
 

voice browser

  • 1.  Topic voice Browser  Registration # 2013-ag-6094  Name javaria kanwal  Supervisor Miss.uzma satter Submitted to Prof. Akmal Rehan 1
  • 3. WHAT IS A VOICE BROWSER A voice browser is a device : that interprets voice input and interprets voice markup languages to generate voice output. that interprets a script which specifies exactly what to verbally present to the user as well as when to present each piece of information. 3
  • 4. MOTIVATION There are 10 times as many telephones as connected PCs. Cell phones usage is growing dramatically. Speaking and listening are the natural usage modes for modes. Easy to use - for people with no knowledge or fear of computers. Voice interaction can escape the physical limitations on keypads and displays as mobile devices become ever smaller 4
  • 5. KEY TECHNOLOGIES  Speech Recognition Voice input VoXML file  Text  Speech Synthesis Text  VoXML file  Output(Pre-recorded) 5
  • 6. WORLD WIDE WEB CONSORTIUM(W3C) World Wide Web Consortium(W3C) develops interpretable technologies(software and tools) to lead the web to its full potential as a forum of information ,commerce and communication.  W3C Speech interface framework  VoiceXML  Speech recognition  Call control 6
  • 7. VOICEXML voiceXML is a dialog markup language designs for telephony applications where users are restricted to voice and DTMF (touch tone) input. 7
  • 8. 8
  • 10. SPEECH GRAMMAR  Speech grammars allow authors to specify the rules for covering the sequence of words that users are expected to say in particular context.  These contextual clues allow the recognition engine to focus on likely utterances , improving the chances of the correct match 10
  • 11. STOCHASTIC (N GRAM) LANGUAGE MODELS  Speech grammar is un useful in case of open- ended prompt e.g. how can I help you  The solution is to use a stochastic language models. such models specify the probability that one word occurs following certain others. the probabilities are computed from the collection of utterances collected from many users. 11
  • 12. SEMANTIC INTERPRETATION The recognition process matches an utterance to a speech grammar, building a parse tree as a byproduct. There are two approaches to harvesting semantic rules from the parse tree : 1. Automating grammar rules with semantic interpretation tags 2. Representing the results in XML 12
  • 13. CALL CONTROL  Fine-grained control of speech (signal processing ) resources and telephony resources in a VoiceXML telephony platform.  Will enable application developers to use markup to perform call screening, whisper call waiting call transfer, and more.  Can be used to transfer a user from on voice browser to another on a completely different machine. 13
  • 14. APPLICATIONS  It can be divided into three categories :  Web Browsing  Limited information Access  Spoken Dialog Systems 14
  • 15. FUTURE •Voice browsing will become visual(Multi-modal) •Can be integrated to an OS •Integrated to every application. 15
  • 16. CONCLUSION  Browser technology is changing very fast these days and we are moving from the visual paradigm to the voice paradigm.  Voice browser is the technology to enter this paradigm.  Voice browser is a device which interpret voice input and generate voice output. 16
  • 17. 17
  • 18. 18