SlideShare a Scribd company logo
1 of 15
Voice Browser
Sipna College of Engineering and Technology Page 1
INTRODUCTION
A voice browser is a device which interprets a (voice) markup language and is capable
of generating voice output and/or interpreting voice input and possibly other input/output
modalities . “ The definition of a voice browser, above is a broad one. The fact that the system
deals with speech is obvious given the firstword of the name , but what makes a software system
that interacts with the user via speed a “browser”. The information that the system uses (for
either domain data or dialog flow) is dynamic and comes somewhere from the Internet from an
end-user’s perspective, the moto is to provide a service similar to what graphical browsers of
HTML and related technologies do today, but on devices that are note equipped with full-
browsers or even the screens to support them.
A voice browser can simply be defined as an appliance or a gear which helps in
interpreting a markup language (the markup language referred here is 'voice') and producing a
voice output. It translates a given voice input into a voice output. It is the web browser which
provide the users with an interactive voice user interface. It is obvious from the first word of the
name that the system deals with pages that specify voice dialogues, just as our visual web pages
deals with HTML pages. But the question remains-how does a software system reciprocates to
the user via speech or voice browser? The software system procures its information from the
internet. From a user's outlook, the goal is to provide to the devices which do not have full-
browsers or even the screens to support them, a service which is similar to what the visual web
browsers and the related technologies offer today.[2]
Speech recognition technology is one from the fast growing engineering technologies.
Nearly 20% people of the world are suffering from various disabilities; many of them are blind
or unable to use their hands effectively. They can share information with people by operating
computer through voice input. Voice Browser is capable to recognize the speech and convert the
input audio into text; it also enables a user to perform operations such as open calculator,
WordPad, notepad, log off computer.
Voice Browser
Sipna College of Engineering and Technology Page 2
LITERATURE REVIEW
HTML is designed to be a mark-up language. Many of the structures in a document, such
as hyperlinks, headings, tables and lists, are represented explicitly in the HTML file for the
document by ‘tags’. It is the task of a web-browsing program to interpret the tags, to format the
content and to present the information to the user visually. There are several possibilities to re-
represent the content through the audio channels. One possible approach is to purposely design
an audio document for the relevant web page. It may involve the author making an explicit
recording of the document or parts of the document. Though this seems like the best strategy to
ensure the author’s intent is accurately rendered, it means that authors must create two
documents for everything they write, which is obviously impractical. A similar approach is the
development of a mark-up language for use with voice browsing applications. This is the long-
term solution offered by the W3C group. All the web documents are expected to be marked up
according to a VoiceXML specification (W3C, 2000), and that browsing products need then only
read and interpret these voice-specific tags to produce an audio version of the document.
However this requires not only a global acceptance of the specification, but that all authors then
use this specification when designing their HTML documents. Otherwise, only certain web pages
will be ‘viewable’ using compliant voice browsing applications.[5]
Standardization to voice browsing technique were given by:
The World Wide Web Consortium (W3C) develops interoperable technologies
(specifications, guidelines, software, and tools) to lead the Web to its full potential as a forum for
information, commerce, communication, and collectiveunder standing. W3C which includes:
1 .Voice Browser Working Group
2. Speech Interface Framework
1] Voice Browser Working Group It was established on 26 March 1999 and re-chartered through
31 January 2009. W3C voice browser working group made the speech interface framework
possible . This framework allows developers to create speech enabled applications that are based
on Web technologies.
Voice Browser
Sipna College of Engineering and Technology Page 3
The framework also provides developer with an environment that will be familiar to those. The
Aim of the W3C Working Group is to enable users to speak and listen to Web applications by
making standard languages for developing Web-based speech applications. This Working Group
concentrates on languages for capturing and producing speech and managing the conversation
between user and computer system, while a related Group, the Multimodal Interaction Working
Group, works on additional input modes including keyboard and mouse, ink and pen, etc. Its
recommendations have been reviewed by w3c group Members, by software developers, and
other interested parties, and are also endorsed by the Director as Web Standards.
2]Speech Interface Framework:
These framework includes: Voice XML: a language for creating audio dialogs that
feature synthesized speech, digitized audio, recognition of spoken and DTMF key input,
recording of spoken input, who are familiar with Web development techniques. So, applications
are written using parts of speech interface framework. Thus speech applications are written in
VoiceXML and are rendered through a Voice Browser. In much the same way as Web
applications are written in html and run on a Web browser.
As per estimation, over 85% of Interactive Voice Response (IVR) applications for
telephones (including mobile) use W3C's Voice XML standard. Voice Browser Working Group
are coordinating their efforts to make the Web available on more devices and in more situations.
telephony, and mixed initiative conversations.[1]
Some of its versions are:
• VoiceXML 1.0: designed for creating audio dialogs.
• VoiceXML 2.0: uses form interpretation algorithm(FIA).
• VoiceXML 2.1: 8 additional elements in FIA.
• Voice XML 3.0: relationship between semantics and syntax.
Voice Browser
Sipna College of Engineering and Technology Page 4
WORKING
Voice-based web to make information accessible to users who may not be able to
read or write, or who do not have access to the Internet. Users can access the voice-based web
using a toll-free number, through a variety of ways including a voice recognition system or a
tone phone. Unlike a computer interface, a voice interface needs no keyboard, no mouse, no
screen, freeing users from these barriers to access and action. It requires no training. It is
accessible to anyone with a telephone. Voice is mobile—information can be sent and retrieved
from anywhere. Since customers can have access at anytime from anywhere, voice makes it
possible to use time more effectively. Fast and efficient, voice frees users from not only the
desktop, but even the laptop.
The user gives the request through the voice or text using phone ,personal computer or
Touch tone. The request goes to the voice browser. If the request is voice, speech recognition
converts voice into text. Checks the grammars and then using speech synthesis to convert text
into pre-recorded audio. The recorded audio should be store in the administrator. It should
display to the user.
Voice Browser
Sipna College of Engineering and Technology Page 5
Fig 1. Block Diagram of Voice Browser
VoiceXML
scripts
Telephone
calls
Speech
recongnition
n
Request
through voice
Grammars
Voice
Browser
Audiofiles
Touch tone
Multimedia
files
Admin
Maintain
database
User
Request
through text
Resolve
request typeHTML
scripts
Voice Browser
Sipna College of Engineering and Technology Page 6
Fig. 2. Uploading and downloading
User
Request via
touch tone
Feedbac
k
Request
via phone
Request
name
Downlo
ad
Upload
Send
to
Voice
xml
Gramm
ars
Audio
files
Speech
synthesis
Voice Browser
Administrator
Search
Permissi
on grant
Delete
member
s
Updatio
n
Manage
Data base
Maintain
information
Reslove
request
type
Receive
request
Serve
r
Voice Browser
Sipna College of Engineering and Technology Page 7
User Interaction via Browser
Fig 3 : Sequence diagram
USER VISUAL
BROWSER
VOICE
BROWSER
ADMIN
request for home page
search content
send html files
voice request
send voice xml files
display
text or voice output
generate html files
grammar checking
pre-recorded audio
Voice Browser
Sipna College of Engineering and Technology Page 8
Admin -Administrator has the authority for convert the voice into text,text into voice and then
displaying to the user.
ASR-Automatic Speech Recognition is to convert the speech into text.
Fig 4 Block diagram for conversion of voice into text.
In above diagram ,voice is as input which is to be converted into text data. Voice is
analog quantity thus it can handle by digital for that purpose in above diagram we use analog to
digital converter then this voice is get divide into two parts, every part is pass through Acoustic
model and Language model respectively. The output of this two parts are combine and pass
through speech engine. Speech engine further process the signal and final convert speech into
text.[1]
Acoustic Model
An acoustic model is created by taking audio recordings of speech, and their text
transcriptions, and using software to create statistical representations of the sounds that make up
each word. It is used by a speech recognition engine to recognize speech.
Voice Browser
Sipna College of Engineering and Technology Page 9
Language Model
A language model is a file containing the probabilities of sequences of words. Language
models are used for dictation applications, whereas grammars are used in desktop command and
control or telephony interactive voice response (IVR) type applications.
Speech Engine
A speech engine is software that gives your computer the ability to play back text in a
spoken voice (referred to as text-to-speech or TTS).
VoiceXML
VoiceXML is a dialog markup language designed for telephony applications, where
users are restricted to voice and DTMF (touch tone) input. There are other languages: VoXML,
omniviewXML text.
Speech grammars
In most cases, user prompts are very carefully designed to encourage the user to answer
in a form that matches context free grammar rules. Speech Grammars allow authors to specify
rules covering the sequences of words that users are expected to say in particular contexts. These
contexual clues allow the recognition engine to focus on likely utterances, improving the chances
of a correct match.
Voice Browser
Sipna College of Engineering and Technology Page 10
Differences Between Graphical & Voice Browsing
Visual and aural are two most important channels of information processing. While most
of the interaction with computers have been designed around the visual channel, there are
circumstances where voice based man-machine interaction becomes preferable, and in some
cases, necessary, given that voice based interaction comes naturally to humans and can be used
by illiterate people easily. Voice User Interfaces (VUIs), however, are linear and nonpersistent,
thus have serious implications on the working memory load. Compared to a visual interface,
VUIs (considering Interactive Voice Response system) is slow as access is sequential, rather than
random. Moreover, however robust a Speech Recognition (SR) platform may be, it can never
achieve 100% accuracy. This results in an error prone interaction. In addition, speech interaction
may require higher user attention, and take a longer time to complete tasks, as compared to using
Graphical User Interface (GUI).
Graphical browsing is more passive due to the persistence of the visual information.
Voice browsing is more active since the user has to issue commands. Graphical Browsers are
client-based, whereas Voice Browsers are server-based.[6]
Voice Browser
Sipna College of Engineering and Technology Page 11
ADVANTAGES
 Less space requirements.
 Portable voice browsers can also be implemented.
 Practical interface for functionally blind users.
 Users can browse web while keeping there hands and eyes for other jobs.
 Voice interaction can escape the physical limitations on keypads and displays as mobile
devices become ever smaller.
APPLICATION
Voice Browser
Sipna College of Engineering and Technology Page 12
The speech technology is supposed to grow rapidly. The voice portal market is going to reach
billions in just a few years. It is estimated by the kelsey group that voice browsing market will
reach 6.5 billion dollars, while OVUM estimates a world market of 26 billion dollars. Anyone
may guess the actual growth of the industry of voice technology due to variations in these
figures. It is very difficult to navigate on a WAP to scroll through many lists. Hands-free
interaction enables us to develop an easy communication between the user and the system. [3]
Voice browsing can be used to access three kinds of information:
(a)Business: information like automated telephone ordering services, support desks, order
tracking, airline arrival and departure information services, cinema and theatre booking services,
home banking services, etc can be retrieved using voice browsing very easily.
(b)Public: voice browser can be used to access services like local , national and international
news alongwith community information such as weather forecasting, traffic conditions, school
closure and events. it can also be used to gather information on national and international stock
market information and also business and e-commerce transactions.
(c) Personal use : It is used in accessing personal information like voice mails, personal
horoscope ,personal newsletter, calendars, address and telephone lists etc.
Voice Browser
Sipna College of Engineering and Technology Page 13
FUTURE SCOPE
 Accuracy will become better and better by using better speech reorganization.
 Dictation speech recognition will gradually become accepted
 Greater use will be made of “intelligent systems” which will attempt to guess what the
speaker intended to say, rather than what was actually said, as people often misspeak and
make unintentional mistakes.
 Microphone and sound systems will be designed to adapt more quickly to changing
background noise levels, different environments, with better recognition of extraneous
material to be discarded.
Voice Browser
Sipna College of Engineering and Technology Page 14
CONCLUSION
In order to make technology more familiar to the user its access should be made more
easier. As we know that visual internet access experiences various limitations such as people
who are physically handicapped (specially blind users) cannot use keypads or touch screens for
giving instructions. Above all these limitations today’s generation demands to use internet
independent of PC’s and also hands free access to it. For this VOICE BROWSING is an
intelligent idea. This allows user to access web even in situations like driving etc where user
operate web just by listening and speaking rather than typing. Thus at last we conclude that
Voice browsing provides a natural way of accessing webs. Now it is up to the developers to take
up some inventory measures in order to bring this technology to us in a more colorful way.[4]
Voice Browser
Sipna College of Engineering and Technology Page 15
REFERENCE
[1]L. D. Catledge and J. E. Pitkow. Characterizing browsing strategies in the World Wide Web.
Computer Networks and ISDN Systems, 27(6):1065–1073, 1995.
[2]M.Bruynooghe and al.From Interpretation : towards the Global Optimization of prolog
Programs. In Proc. 1987 Symposium on logic programming, San Francisco,CA.
[3]International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume1 Issue
2 Nov 2012.
[4] Raman, T. V. (1996). Emacspeak - Direct Speech Access. ASSETS '96: The Second Annual
ACM Conference on Assistive Technologies, pp. 32-36, New York, ACM SIGCAPH.
[5]Beasley, R. et al.: Voice Application Development with VoiceXML. USA: Sams Publishing,
August 2001. (ISBN 0-672-32138)
[6] THE NEW ERA OF BROWSING -VOICE BROWSING Khushbu
1
, Manika Kapoor
2
,
Ayesha Tafsir
3
paper published at www.ijecs.in International Journal Of Engineering And
Computer Science ISSN:2319-7242 .

More Related Content

What's hot

What's hot (20)

Artificial Intelligence for Speech Recognition
Artificial Intelligence for Speech RecognitionArtificial Intelligence for Speech Recognition
Artificial Intelligence for Speech Recognition
 
Voice browser
Voice browserVoice browser
Voice browser
 
Voicexml ppt
Voicexml pptVoicexml ppt
Voicexml ppt
 
speech processing and recognition basic in data mining
speech processing and recognition basic in  data miningspeech processing and recognition basic in  data mining
speech processing and recognition basic in data mining
 
A seminar report on speech recognition technology
A seminar report on speech recognition technologyA seminar report on speech recognition technology
A seminar report on speech recognition technology
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognition
 
Speech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceSpeech Recognition in Artificail Inteligence
Speech Recognition in Artificail Inteligence
 
Speech Recognition by Iqbal
Speech Recognition by IqbalSpeech Recognition by Iqbal
Speech Recognition by Iqbal
 
Voicexml
VoicexmlVoicexml
Voicexml
 
Speech recognition an overview
Speech recognition   an overviewSpeech recognition   an overview
Speech recognition an overview
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Text to-speech & voice recognition
Text to-speech & voice recognitionText to-speech & voice recognition
Text to-speech & voice recognition
 
Virtual Personal Assistant
Virtual Personal AssistantVirtual Personal Assistant
Virtual Personal Assistant
 
Introduction to text to speech
Introduction to text to speechIntroduction to text to speech
Introduction to text to speech
 
Artificial intelligence for speech recognition
Artificial intelligence for speech recognitionArtificial intelligence for speech recognition
Artificial intelligence for speech recognition
 
Voicemorphing
VoicemorphingVoicemorphing
Voicemorphing
 
The Use of Artificial Intelligence and Machine Learning in Speech Recognition
The Use of Artificial Intelligence and Machine Learning in Speech RecognitionThe Use of Artificial Intelligence and Machine Learning in Speech Recognition
The Use of Artificial Intelligence and Machine Learning in Speech Recognition
 
Visual speech to text conversion applicable to telephone communication
Visual speech to text conversion  applicable  to telephone communicationVisual speech to text conversion  applicable  to telephone communication
Visual speech to text conversion applicable to telephone communication
 
Speech recognition final presentation
Speech recognition final presentationSpeech recognition final presentation
Speech recognition final presentation
 
Deep Learning For Speech Recognition
Deep Learning For Speech RecognitionDeep Learning For Speech Recognition
Deep Learning For Speech Recognition
 

Viewers also liked (6)

Voice based web browser
Voice based web browserVoice based web browser
Voice based web browser
 
Voicebasedsrs 130319103050-phpapp02
Voicebasedsrs 130319103050-phpapp02Voicebasedsrs 130319103050-phpapp02
Voicebasedsrs 130319103050-phpapp02
 
Synopsis Presentation
Synopsis PresentationSynopsis Presentation
Synopsis Presentation
 
voice browser
voice browservoice browser
voice browser
 
REST to JavaScript for Better Client-side Development
REST to JavaScript for Better Client-side DevelopmentREST to JavaScript for Better Client-side Development
REST to JavaScript for Better Client-side Development
 
Voice morphing ppt
Voice morphing pptVoice morphing ppt
Voice morphing ppt
 

Similar to voice browser

Paper on Speech Recognition
Paper on Speech RecognitionPaper on Speech Recognition
Paper on Speech Recognition
Thejus Joby
 
Software (fundamentals)
Software (fundamentals)Software (fundamentals)
Software (fundamentals)
JDoughty1
 
An Application for Performing Real Time Speech Translation in Mobile Environment
An Application for Performing Real Time Speech Translation in Mobile EnvironmentAn Application for Performing Real Time Speech Translation in Mobile Environment
An Application for Performing Real Time Speech Translation in Mobile Environment
Association of Scientists, Developers and Faculties
 
Device Independence
Device IndependenceDevice Independence
Device Independence
bjornh
 

Similar to voice browser (20)

Hak voice-browser
Hak voice-browserHak voice-browser
Hak voice-browser
 
Phonet
PhonetPhonet
Phonet
 
Voice based web browser
Voice based web browserVoice based web browser
Voice based web browser
 
Paper on Speech Recognition
Paper on Speech RecognitionPaper on Speech Recognition
Paper on Speech Recognition
 
final doc
final docfinal doc
final doc
 
Accessing Scholarly Content through FOSS based Assistive Technology
Accessing Scholarly Content through FOSS based Assistive TechnologyAccessing Scholarly Content through FOSS based Assistive Technology
Accessing Scholarly Content through FOSS based Assistive Technology
 
Software (fundamentals)
Software (fundamentals)Software (fundamentals)
Software (fundamentals)
 
IRJET- Voice based Billing System
IRJET-  	  Voice based Billing SystemIRJET-  	  Voice based Billing System
IRJET- Voice based Billing System
 
Toward a New Algorithm for Hands Free Browsing
Toward a New Algorithm for Hands Free BrowsingToward a New Algorithm for Hands Free Browsing
Toward a New Algorithm for Hands Free Browsing
 
An Application for Performing Real Time Speech Translation in Mobile Environment
An Application for Performing Real Time Speech Translation in Mobile EnvironmentAn Application for Performing Real Time Speech Translation in Mobile Environment
An Application for Performing Real Time Speech Translation in Mobile Environment
 
30
3030
30
 
DEVELOPMENT OF AN INTEGRATED TOOL THAT SUMMARRIZE AND PRODUCE THE SIGN LANGUA...
DEVELOPMENT OF AN INTEGRATED TOOL THAT SUMMARRIZE AND PRODUCE THE SIGN LANGUA...DEVELOPMENT OF AN INTEGRATED TOOL THAT SUMMARRIZE AND PRODUCE THE SIGN LANGUA...
DEVELOPMENT OF AN INTEGRATED TOOL THAT SUMMARRIZE AND PRODUCE THE SIGN LANGUA...
 
DEVELOPMENT OF AN INTEGRATED TOOL THAT SUMMARRIZE AND PRODUCE THE SIGN LANGUA...
DEVELOPMENT OF AN INTEGRATED TOOL THAT SUMMARRIZE AND PRODUCE THE SIGN LANGUA...DEVELOPMENT OF AN INTEGRATED TOOL THAT SUMMARRIZE AND PRODUCE THE SIGN LANGUA...
DEVELOPMENT OF AN INTEGRATED TOOL THAT SUMMARRIZE AND PRODUCE THE SIGN LANGUA...
 
DEVELOPMENT OF AN INTEGRATED TOOL THAT SUMMARRIZE AND PRODUCE THE SIGN LANGUA...
DEVELOPMENT OF AN INTEGRATED TOOL THAT SUMMARRIZE AND PRODUCE THE SIGN LANGUA...DEVELOPMENT OF AN INTEGRATED TOOL THAT SUMMARRIZE AND PRODUCE THE SIGN LANGUA...
DEVELOPMENT OF AN INTEGRATED TOOL THAT SUMMARRIZE AND PRODUCE THE SIGN LANGUA...
 
DEVELOPMENT OF AN INTEGRATED TOOL THAT SUMMARRIZE AND PRODUCE THE SIGN LANGUA...
DEVELOPMENT OF AN INTEGRATED TOOL THAT SUMMARRIZE AND PRODUCE THE SIGN LANGUA...DEVELOPMENT OF AN INTEGRATED TOOL THAT SUMMARRIZE AND PRODUCE THE SIGN LANGUA...
DEVELOPMENT OF AN INTEGRATED TOOL THAT SUMMARRIZE AND PRODUCE THE SIGN LANGUA...
 
visH (fin).pptx
visH (fin).pptxvisH (fin).pptx
visH (fin).pptx
 
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
 
Device Independence
Device IndependenceDevice Independence
Device Independence
 
Leveraging machine learning in text to-speech tools and applications.
Leveraging machine learning in text to-speech tools and applications.Leveraging machine learning in text to-speech tools and applications.
Leveraging machine learning in text to-speech tools and applications.
 
Google Voice-to-text
Google Voice-to-textGoogle Voice-to-text
Google Voice-to-text
 

Recently uploaded

Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
gajnagarg
 
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
amitlee9823
 
➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men 🔝Ongole🔝 Escorts S...
➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men  🔝Ongole🔝   Escorts S...➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men  🔝Ongole🔝   Escorts S...
➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men 🔝Ongole🔝 Escorts S...
amitlee9823
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
amitlee9823
 
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
amitlee9823
 

Recently uploaded (20)

VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 
Detecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachDetecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning Approach
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men 🔝Ongole🔝 Escorts S...
➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men  🔝Ongole🔝   Escorts S...➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men  🔝Ongole🔝   Escorts S...
➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men 🔝Ongole🔝 Escorts S...
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
 

voice browser

  • 1. Voice Browser Sipna College of Engineering and Technology Page 1 INTRODUCTION A voice browser is a device which interprets a (voice) markup language and is capable of generating voice output and/or interpreting voice input and possibly other input/output modalities . “ The definition of a voice browser, above is a broad one. The fact that the system deals with speech is obvious given the firstword of the name , but what makes a software system that interacts with the user via speed a “browser”. The information that the system uses (for either domain data or dialog flow) is dynamic and comes somewhere from the Internet from an end-user’s perspective, the moto is to provide a service similar to what graphical browsers of HTML and related technologies do today, but on devices that are note equipped with full- browsers or even the screens to support them. A voice browser can simply be defined as an appliance or a gear which helps in interpreting a markup language (the markup language referred here is 'voice') and producing a voice output. It translates a given voice input into a voice output. It is the web browser which provide the users with an interactive voice user interface. It is obvious from the first word of the name that the system deals with pages that specify voice dialogues, just as our visual web pages deals with HTML pages. But the question remains-how does a software system reciprocates to the user via speech or voice browser? The software system procures its information from the internet. From a user's outlook, the goal is to provide to the devices which do not have full- browsers or even the screens to support them, a service which is similar to what the visual web browsers and the related technologies offer today.[2] Speech recognition technology is one from the fast growing engineering technologies. Nearly 20% people of the world are suffering from various disabilities; many of them are blind or unable to use their hands effectively. They can share information with people by operating computer through voice input. Voice Browser is capable to recognize the speech and convert the input audio into text; it also enables a user to perform operations such as open calculator, WordPad, notepad, log off computer.
  • 2. Voice Browser Sipna College of Engineering and Technology Page 2 LITERATURE REVIEW HTML is designed to be a mark-up language. Many of the structures in a document, such as hyperlinks, headings, tables and lists, are represented explicitly in the HTML file for the document by ‘tags’. It is the task of a web-browsing program to interpret the tags, to format the content and to present the information to the user visually. There are several possibilities to re- represent the content through the audio channels. One possible approach is to purposely design an audio document for the relevant web page. It may involve the author making an explicit recording of the document or parts of the document. Though this seems like the best strategy to ensure the author’s intent is accurately rendered, it means that authors must create two documents for everything they write, which is obviously impractical. A similar approach is the development of a mark-up language for use with voice browsing applications. This is the long- term solution offered by the W3C group. All the web documents are expected to be marked up according to a VoiceXML specification (W3C, 2000), and that browsing products need then only read and interpret these voice-specific tags to produce an audio version of the document. However this requires not only a global acceptance of the specification, but that all authors then use this specification when designing their HTML documents. Otherwise, only certain web pages will be ‘viewable’ using compliant voice browsing applications.[5] Standardization to voice browsing technique were given by: The World Wide Web Consortium (W3C) develops interoperable technologies (specifications, guidelines, software, and tools) to lead the Web to its full potential as a forum for information, commerce, communication, and collectiveunder standing. W3C which includes: 1 .Voice Browser Working Group 2. Speech Interface Framework 1] Voice Browser Working Group It was established on 26 March 1999 and re-chartered through 31 January 2009. W3C voice browser working group made the speech interface framework possible . This framework allows developers to create speech enabled applications that are based on Web technologies.
  • 3. Voice Browser Sipna College of Engineering and Technology Page 3 The framework also provides developer with an environment that will be familiar to those. The Aim of the W3C Working Group is to enable users to speak and listen to Web applications by making standard languages for developing Web-based speech applications. This Working Group concentrates on languages for capturing and producing speech and managing the conversation between user and computer system, while a related Group, the Multimodal Interaction Working Group, works on additional input modes including keyboard and mouse, ink and pen, etc. Its recommendations have been reviewed by w3c group Members, by software developers, and other interested parties, and are also endorsed by the Director as Web Standards. 2]Speech Interface Framework: These framework includes: Voice XML: a language for creating audio dialogs that feature synthesized speech, digitized audio, recognition of spoken and DTMF key input, recording of spoken input, who are familiar with Web development techniques. So, applications are written using parts of speech interface framework. Thus speech applications are written in VoiceXML and are rendered through a Voice Browser. In much the same way as Web applications are written in html and run on a Web browser. As per estimation, over 85% of Interactive Voice Response (IVR) applications for telephones (including mobile) use W3C's Voice XML standard. Voice Browser Working Group are coordinating their efforts to make the Web available on more devices and in more situations. telephony, and mixed initiative conversations.[1] Some of its versions are: • VoiceXML 1.0: designed for creating audio dialogs. • VoiceXML 2.0: uses form interpretation algorithm(FIA). • VoiceXML 2.1: 8 additional elements in FIA. • Voice XML 3.0: relationship between semantics and syntax.
  • 4. Voice Browser Sipna College of Engineering and Technology Page 4 WORKING Voice-based web to make information accessible to users who may not be able to read or write, or who do not have access to the Internet. Users can access the voice-based web using a toll-free number, through a variety of ways including a voice recognition system or a tone phone. Unlike a computer interface, a voice interface needs no keyboard, no mouse, no screen, freeing users from these barriers to access and action. It requires no training. It is accessible to anyone with a telephone. Voice is mobile—information can be sent and retrieved from anywhere. Since customers can have access at anytime from anywhere, voice makes it possible to use time more effectively. Fast and efficient, voice frees users from not only the desktop, but even the laptop. The user gives the request through the voice or text using phone ,personal computer or Touch tone. The request goes to the voice browser. If the request is voice, speech recognition converts voice into text. Checks the grammars and then using speech synthesis to convert text into pre-recorded audio. The recorded audio should be store in the administrator. It should display to the user.
  • 5. Voice Browser Sipna College of Engineering and Technology Page 5 Fig 1. Block Diagram of Voice Browser VoiceXML scripts Telephone calls Speech recongnition n Request through voice Grammars Voice Browser Audiofiles Touch tone Multimedia files Admin Maintain database User Request through text Resolve request typeHTML scripts
  • 6. Voice Browser Sipna College of Engineering and Technology Page 6 Fig. 2. Uploading and downloading User Request via touch tone Feedbac k Request via phone Request name Downlo ad Upload Send to Voice xml Gramm ars Audio files Speech synthesis Voice Browser Administrator Search Permissi on grant Delete member s Updatio n Manage Data base Maintain information Reslove request type Receive request Serve r
  • 7. Voice Browser Sipna College of Engineering and Technology Page 7 User Interaction via Browser Fig 3 : Sequence diagram USER VISUAL BROWSER VOICE BROWSER ADMIN request for home page search content send html files voice request send voice xml files display text or voice output generate html files grammar checking pre-recorded audio
  • 8. Voice Browser Sipna College of Engineering and Technology Page 8 Admin -Administrator has the authority for convert the voice into text,text into voice and then displaying to the user. ASR-Automatic Speech Recognition is to convert the speech into text. Fig 4 Block diagram for conversion of voice into text. In above diagram ,voice is as input which is to be converted into text data. Voice is analog quantity thus it can handle by digital for that purpose in above diagram we use analog to digital converter then this voice is get divide into two parts, every part is pass through Acoustic model and Language model respectively. The output of this two parts are combine and pass through speech engine. Speech engine further process the signal and final convert speech into text.[1] Acoustic Model An acoustic model is created by taking audio recordings of speech, and their text transcriptions, and using software to create statistical representations of the sounds that make up each word. It is used by a speech recognition engine to recognize speech.
  • 9. Voice Browser Sipna College of Engineering and Technology Page 9 Language Model A language model is a file containing the probabilities of sequences of words. Language models are used for dictation applications, whereas grammars are used in desktop command and control or telephony interactive voice response (IVR) type applications. Speech Engine A speech engine is software that gives your computer the ability to play back text in a spoken voice (referred to as text-to-speech or TTS). VoiceXML VoiceXML is a dialog markup language designed for telephony applications, where users are restricted to voice and DTMF (touch tone) input. There are other languages: VoXML, omniviewXML text. Speech grammars In most cases, user prompts are very carefully designed to encourage the user to answer in a form that matches context free grammar rules. Speech Grammars allow authors to specify rules covering the sequences of words that users are expected to say in particular contexts. These contexual clues allow the recognition engine to focus on likely utterances, improving the chances of a correct match.
  • 10. Voice Browser Sipna College of Engineering and Technology Page 10 Differences Between Graphical & Voice Browsing Visual and aural are two most important channels of information processing. While most of the interaction with computers have been designed around the visual channel, there are circumstances where voice based man-machine interaction becomes preferable, and in some cases, necessary, given that voice based interaction comes naturally to humans and can be used by illiterate people easily. Voice User Interfaces (VUIs), however, are linear and nonpersistent, thus have serious implications on the working memory load. Compared to a visual interface, VUIs (considering Interactive Voice Response system) is slow as access is sequential, rather than random. Moreover, however robust a Speech Recognition (SR) platform may be, it can never achieve 100% accuracy. This results in an error prone interaction. In addition, speech interaction may require higher user attention, and take a longer time to complete tasks, as compared to using Graphical User Interface (GUI). Graphical browsing is more passive due to the persistence of the visual information. Voice browsing is more active since the user has to issue commands. Graphical Browsers are client-based, whereas Voice Browsers are server-based.[6]
  • 11. Voice Browser Sipna College of Engineering and Technology Page 11 ADVANTAGES  Less space requirements.  Portable voice browsers can also be implemented.  Practical interface for functionally blind users.  Users can browse web while keeping there hands and eyes for other jobs.  Voice interaction can escape the physical limitations on keypads and displays as mobile devices become ever smaller. APPLICATION
  • 12. Voice Browser Sipna College of Engineering and Technology Page 12 The speech technology is supposed to grow rapidly. The voice portal market is going to reach billions in just a few years. It is estimated by the kelsey group that voice browsing market will reach 6.5 billion dollars, while OVUM estimates a world market of 26 billion dollars. Anyone may guess the actual growth of the industry of voice technology due to variations in these figures. It is very difficult to navigate on a WAP to scroll through many lists. Hands-free interaction enables us to develop an easy communication between the user and the system. [3] Voice browsing can be used to access three kinds of information: (a)Business: information like automated telephone ordering services, support desks, order tracking, airline arrival and departure information services, cinema and theatre booking services, home banking services, etc can be retrieved using voice browsing very easily. (b)Public: voice browser can be used to access services like local , national and international news alongwith community information such as weather forecasting, traffic conditions, school closure and events. it can also be used to gather information on national and international stock market information and also business and e-commerce transactions. (c) Personal use : It is used in accessing personal information like voice mails, personal horoscope ,personal newsletter, calendars, address and telephone lists etc.
  • 13. Voice Browser Sipna College of Engineering and Technology Page 13 FUTURE SCOPE  Accuracy will become better and better by using better speech reorganization.  Dictation speech recognition will gradually become accepted  Greater use will be made of “intelligent systems” which will attempt to guess what the speaker intended to say, rather than what was actually said, as people often misspeak and make unintentional mistakes.  Microphone and sound systems will be designed to adapt more quickly to changing background noise levels, different environments, with better recognition of extraneous material to be discarded.
  • 14. Voice Browser Sipna College of Engineering and Technology Page 14 CONCLUSION In order to make technology more familiar to the user its access should be made more easier. As we know that visual internet access experiences various limitations such as people who are physically handicapped (specially blind users) cannot use keypads or touch screens for giving instructions. Above all these limitations today’s generation demands to use internet independent of PC’s and also hands free access to it. For this VOICE BROWSING is an intelligent idea. This allows user to access web even in situations like driving etc where user operate web just by listening and speaking rather than typing. Thus at last we conclude that Voice browsing provides a natural way of accessing webs. Now it is up to the developers to take up some inventory measures in order to bring this technology to us in a more colorful way.[4]
  • 15. Voice Browser Sipna College of Engineering and Technology Page 15 REFERENCE [1]L. D. Catledge and J. E. Pitkow. Characterizing browsing strategies in the World Wide Web. Computer Networks and ISDN Systems, 27(6):1065–1073, 1995. [2]M.Bruynooghe and al.From Interpretation : towards the Global Optimization of prolog Programs. In Proc. 1987 Symposium on logic programming, San Francisco,CA. [3]International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume1 Issue 2 Nov 2012. [4] Raman, T. V. (1996). Emacspeak - Direct Speech Access. ASSETS '96: The Second Annual ACM Conference on Assistive Technologies, pp. 32-36, New York, ACM SIGCAPH. [5]Beasley, R. et al.: Voice Application Development with VoiceXML. USA: Sams Publishing, August 2001. (ISBN 0-672-32138) [6] THE NEW ERA OF BROWSING -VOICE BROWSING Khushbu 1 , Manika Kapoor 2 , Ayesha Tafsir 3 paper published at www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 .