SlideShare ist ein Scribd-Unternehmen logo
1 von 23
AI for Accessibility
Improving Reading Fluency with Pronunciation Assessment
Luis Beltrán
Luis Beltrán
@darkicebeam
luis@luisbeltran.mx
luisbeltran.mx
Pronunciation Assessment in Speech Studio
https://speech.microsoft.com/portal
Developing an app
Speech SDK:
https://docs.microsoft.com/en-us/azure/cognitive-
services/speech-service/how-to-pronunciation-
assessment?pivots=programming-language-csharp
Result parameters
Demo: Pronunciation Assessment in mobile
application
Q & A
Useful links
Cool demo:
https://github.com/Azure-Samples/Cognitive-Speech-
TTS/tree/master/PronunciationAssessment/BrowserJS
Documentation:
https://docs.microsoft.com/en-us/azure/cognitive-services/speech-
service/how-to-pronunciation-assessment
Sample codes:
https://github.com/Azure-Samples/Cognitive-Speech-
TTS/tree/master/PronunciationAssessment
Thank you for your attention!
About Me:
https://about.me/luis-beltran

Weitere ähnliche Inhalte

Ähnlich wie AI for Accessibility.pptx

Ähnlich wie AI for Accessibility.pptx (20)

TDC2018SP | Trilha Comp Cognitiva - Sentiment Analysis com Power BI e Cogniti...
TDC2018SP | Trilha Comp Cognitiva - Sentiment Analysis com Power BI e Cogniti...TDC2018SP | Trilha Comp Cognitiva - Sentiment Analysis com Power BI e Cogniti...
TDC2018SP | Trilha Comp Cognitiva - Sentiment Analysis com Power BI e Cogniti...
 
Solvion Trendwerkstatt - Microsoft Azure + Bots
Solvion Trendwerkstatt - Microsoft Azure + BotsSolvion Trendwerkstatt - Microsoft Azure + Bots
Solvion Trendwerkstatt - Microsoft Azure + Bots
 
Webinar 3 : Make your chatbot as dynamic as your employees
Webinar 3 : Make your chatbot as dynamic as your employeesWebinar 3 : Make your chatbot as dynamic as your employees
Webinar 3 : Make your chatbot as dynamic as your employees
 
Azure Global Bootcamp 2017 - Microsoft Cognitive Services
Azure Global Bootcamp 2017 - Microsoft Cognitive ServicesAzure Global Bootcamp 2017 - Microsoft Cognitive Services
Azure Global Bootcamp 2017 - Microsoft Cognitive Services
 
Roopa_software_engineering
Roopa_software_engineeringRoopa_software_engineering
Roopa_software_engineering
 
Custom vision app step by step and cognitive service quick view
Custom vision app step by step and cognitive service quick viewCustom vision app step by step and cognitive service quick view
Custom vision app step by step and cognitive service quick view
 
Da 0 all'AI conversazionale usando Microsoft Azure
Da 0 all'AI conversazionale usando Microsoft AzureDa 0 all'AI conversazionale usando Microsoft Azure
Da 0 all'AI conversazionale usando Microsoft Azure
 
Building Bots with Azure and consume anywhere.pptx
Building Bots with Azure and consume anywhere.pptxBuilding Bots with Azure and consume anywhere.pptx
Building Bots with Azure and consume anywhere.pptx
 
Resume, Rosalyn A. Collins - Technical Writer
Resume, Rosalyn A. Collins - Technical WriterResume, Rosalyn A. Collins - Technical Writer
Resume, Rosalyn A. Collins - Technical Writer
 
Integration Monday - Logic Apps: Development Experiences
Integration Monday - Logic Apps: Development ExperiencesIntegration Monday - Logic Apps: Development Experiences
Integration Monday - Logic Apps: Development Experiences
 
CV - Jaspreet Singh
CV - Jaspreet SinghCV - Jaspreet Singh
CV - Jaspreet Singh
 
Benham Gibson Resume
Benham Gibson ResumeBenham Gibson Resume
Benham Gibson Resume
 
Lavina CV
Lavina CVLavina CV
Lavina CV
 
Kumar G - AI BOT
Kumar G - AI BOTKumar G - AI BOT
Kumar G - AI BOT
 
Serverless AI - London Loft
Serverless AI - London LoftServerless AI - London Loft
Serverless AI - London Loft
 
Gowrisankar_Resume
Gowrisankar_ResumeGowrisankar_Resume
Gowrisankar_Resume
 
Microsoft Teams Development - Conversational AI
Microsoft Teams Development - Conversational AIMicrosoft Teams Development - Conversational AI
Microsoft Teams Development - Conversational AI
 
Light Side of Microsoft: AI and research projects you never heard of before
Light Side of Microsoft: AI and research projects you never heard of beforeLight Side of Microsoft: AI and research projects you never heard of before
Light Side of Microsoft: AI and research projects you never heard of before
 
Brochure - Software Development Learning Path
 Brochure - Software Development Learning Path Brochure - Software Development Learning Path
Brochure - Software Development Learning Path
 
Capture the Cloud with Azure
Capture the Cloud with AzureCapture the Cloud with Azure
Capture the Cloud with Azure
 

Mehr von Luis775803

Mehr von Luis775803 (20)

TalentLand - Entendiendo tus documentos con Azure Form Recognizer.pptx
TalentLand - Entendiendo tus documentos con Azure Form Recognizer.pptxTalentLand - Entendiendo tus documentos con Azure Form Recognizer.pptx
TalentLand - Entendiendo tus documentos con Azure Form Recognizer.pptx
 
IA Conversacional con Power Virtual Agents.pptx
IA Conversacional con Power Virtual Agents.pptxIA Conversacional con Power Virtual Agents.pptx
IA Conversacional con Power Virtual Agents.pptx
 
Colombia Cloud Bootcamp - IA y Accesibilidad Pronunciation Assessment.pptx
Colombia Cloud Bootcamp - IA y Accesibilidad Pronunciation Assessment.pptxColombia Cloud Bootcamp - IA y Accesibilidad Pronunciation Assessment.pptx
Colombia Cloud Bootcamp - IA y Accesibilidad Pronunciation Assessment.pptx
 
STEMWeek - Entendiendo tus documentos con Azure Form Recognizer.pptx
STEMWeek - Entendiendo tus documentos con Azure Form Recognizer.pptxSTEMWeek - Entendiendo tus documentos con Azure Form Recognizer.pptx
STEMWeek - Entendiendo tus documentos con Azure Form Recognizer.pptx
 
Student Summit - Conoce más sobre mi carrera en IA y Datos.pptx
Student Summit - Conoce más sobre mi carrera en IA y Datos.pptxStudent Summit - Conoce más sobre mi carrera en IA y Datos.pptx
Student Summit - Conoce más sobre mi carrera en IA y Datos.pptx
 
Gira Speaker Latam - IA y Accesibilidad con Pronunciation Assessment.pptx
Gira Speaker Latam - IA y Accesibilidad con Pronunciation Assessment.pptxGira Speaker Latam - IA y Accesibilidad con Pronunciation Assessment.pptx
Gira Speaker Latam - IA y Accesibilidad con Pronunciation Assessment.pptx
 
Build After Party Bolivia - Hugging Face on Azure.pptx
Build After Party Bolivia - Hugging Face on Azure.pptxBuild After Party Bolivia - Hugging Face on Azure.pptx
Build After Party Bolivia - Hugging Face on Azure.pptx
 
Microsoft Reactor - Creando un modelo de Regresión con Azure Machine Learnin...
Microsoft Reactor - Creando un modelo de Regresión con Azure Machine Learnin...Microsoft Reactor - Creando un modelo de Regresión con Azure Machine Learnin...
Microsoft Reactor - Creando un modelo de Regresión con Azure Machine Learnin...
 
Introduction to .NET MAUI.pdf
Introduction to .NET MAUI.pdfIntroduction to .NET MAUI.pdf
Introduction to .NET MAUI.pdf
 
SISWeek Creando un sistema de reconocimiento facial con Face API.pptx
SISWeek Creando un sistema de reconocimiento facial con Face API.pptxSISWeek Creando un sistema de reconocimiento facial con Face API.pptx
SISWeek Creando un sistema de reconocimiento facial con Face API.pptx
 
Azure Guatemala.pptx
Azure Guatemala.pptxAzure Guatemala.pptx
Azure Guatemala.pptx
 
Conoce las novedades de .NET MAUI en .NET 7.pptx
Conoce las novedades de .NET MAUI en .NET 7.pptxConoce las novedades de .NET MAUI en .NET 7.pptx
Conoce las novedades de .NET MAUI en .NET 7.pptx
 
GAIB Philippines - Tailoring OpenAI’s GPT-3 to suit your specific needs.pptx
GAIB Philippines - Tailoring OpenAI’s GPT-3 to suit your specific needs.pptxGAIB Philippines - Tailoring OpenAI’s GPT-3 to suit your specific needs.pptx
GAIB Philippines - Tailoring OpenAI’s GPT-3 to suit your specific needs.pptx
 
Power BI Summit 2023 - Embedding PowerBI reports in .NET MAUI mobile apps.pptx
Power BI Summit 2023 - Embedding PowerBI reports in .NET MAUI mobile apps.pptxPower BI Summit 2023 - Embedding PowerBI reports in .NET MAUI mobile apps.pptx
Power BI Summit 2023 - Embedding PowerBI reports in .NET MAUI mobile apps.pptx
 
Towards Responsible AI - Global AI Student Conference 2022.pptx
Towards Responsible AI - Global AI Student Conference 2022.pptxTowards Responsible AI - Global AI Student Conference 2022.pptx
Towards Responsible AI - Global AI Student Conference 2022.pptx
 
Mes de Datos Ciencia de Datos a otro nivel con Azure Machine Learning.pptx
Mes de Datos Ciencia de Datos a otro nivel con Azure Machine Learning.pptxMes de Datos Ciencia de Datos a otro nivel con Azure Machine Learning.pptx
Mes de Datos Ciencia de Datos a otro nivel con Azure Machine Learning.pptx
 
GAIB Germany - Tailoring OpenAI’s GPT-3 to suit your specific needs.pptx
GAIB Germany - Tailoring OpenAI’s GPT-3 to suit your specific needs.pptxGAIB Germany - Tailoring OpenAI’s GPT-3 to suit your specific needs.pptx
GAIB Germany - Tailoring OpenAI’s GPT-3 to suit your specific needs.pptx
 
Platzi Azure.pptx
Platzi Azure.pptxPlatzi Azure.pptx
Platzi Azure.pptx
 
GAIB Latam - Tailoring OpenAI’s GPT-3 to suit your specific needs.pptx
GAIB Latam - Tailoring OpenAI’s GPT-3 to suit your specific needs.pptxGAIB Latam - Tailoring OpenAI’s GPT-3 to suit your specific needs.pptx
GAIB Latam - Tailoring OpenAI’s GPT-3 to suit your specific needs.pptx
 
Virtual Azure Community Day - Workloads de búsqueda full-text Azure Search.pptx
Virtual Azure Community Day - Workloads de búsqueda full-text Azure Search.pptxVirtual Azure Community Day - Workloads de búsqueda full-text Azure Search.pptx
Virtual Azure Community Day - Workloads de búsqueda full-text Azure Search.pptx
 

Kürzlich hochgeladen

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Kürzlich hochgeladen (20)

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 

AI for Accessibility.pptx

Hinweis der Redaktion

  1. Practicing pronunciation and getting timely feedback are essential to improving language skills. Assessment is conventionally driven by experienced teachers, which normally takes a lot of time and great effort, and this makes high-quality assessment expensive for students. In this session we will see how we can make use of artificial intelligence to create a technological solution to support students in learning another language in a practical way and with objective feedback in real time.
  2. Microsoft's Azure Speech service provides speech-to-text, text-to-speech, and speech translation capabilities to developers. At Build 2020, Microsoft announced several new preview capabilities, including Pronunciation Assessment that can assess speech pronunciation and give speakers feedback on the accuracy and fluency of spoken audio. Microsoft is using this service for its PowerPoint Presenter Coach feature.
  3. For language learners, practicing pronunciation and getting timely feedback are essential to improving language skills.  Assessment is conventionally driven by experienced teachers, which typically takes a lot of time and great effort, making high-quality assessment expensive for students. 
  4. How to solve this problem? Pronunciation Assessment, a novel AI-powered speech capability, is able to make language assessment more engaging and accessible to students of all backgrounds.  Pronunciation Assessment, a feature of Speech in Azure Cognitive Services, provides subjective and objective feedback to language learners with computer-aided technology.  With Pronunciation Assessment, language learners can practice, get instant feedback, and improve their pronunciation. Online learning solution providers or educators can use the ability to assess the pronunciation of multiple speakers in real time.  Pronunciation Assessment provides various evaluation results in different granularities, from individual phonemes to full text input. At the phoneme level, it provides precision scores of each phoneme, helping students better understand the pronunciation details of their speech.  At the word level, it can automatically detect errors and provide an accuracy score simultaneously, providing more detailed information about the omission, repetition, insertions, and incorrect pronunciation in the given speech.  At the full-text level, it offers additional fluency and integrity scores: fluency indicates how closely speech matches a native speaker's use of silent pauses between words, and completeness indicates how many words are spoken in speech to reference text input.
  5. An aggregate overall score of Accuracy, Fluency, and Completeness is then provided to indicate the overall pronunciation quality of the given speech.  With these characteristics, students can easily know the weakness of their speech and improve with the objective objectives. 
  6. You can get pronunciation assessment scores for: Full text Words Groups of syllables Phonemes in SAPI or IPA format
  7. Pronunciation assessment can provide syllable-level assessment results. Grouping in syllables is more legible and aligned with speaking habits, as a word is typically pronounced syllable by syllable rather than phoneme by phoneme. For en-US locale, the phoneme name is provided together with the score, to help identify which phonemes were pronounced accurately or inaccurately. For other locales, you can only get the phoneme score.
  8. With spoken phonemes, you can get confidence scores indicating how likely the spoken phonemes matched the expected phonemes.
  9. When speech is recognized, you can request the pronunciation assessment results as SDK objects or a JSON string.
  10. The phoneme alphabet is IPA. The syllables are returned alongside phonemes for the same word. You can use the Offset and Duration values to align syllables with their corresponding phonemes. For example, the starting offset (11700000) of the second syllable ("loʊ") aligns with the third phoneme ("l"). There are five NBestPhonemes corresponding to the number of spoken phonemes requested. Within Phonemes, the most likely spoken phonemes was "ə" instead of the expected phoneme "ɛ". The expected phoneme "ɛ" only received a confidence score of 47. Other potential matches received confidence scores of 52, 17, and 2.