SlideShare ist ein Scribd-Unternehmen logo
1 von 27
Easing Transcripts for MOOC
Videos with an ASR (Automated
Speech Recognition) System
Carlos Turró, Jorge Civera and Jaime Busquets
Universitat Politècnica de València
The result of not having a screwdriver
• Pain
• Frustration
• Select a different tool
How can I transcribe a video?
• Manually transcribing a video
takes 10 times the length of the
video (RTF)
• Boring
• It’s worse if you don’t know
about the topic of the video
Automated Speech Recognition (ASR)
• How good is it?
• Will it recognize my special
words?
• Will it really help me?
UPValenciaX MOOCs - Transcribing
https://media.upv.es/?id=b444d12e-db23-9a4f-9b3b-d1d9275d4cb4
UPValenciaX MOOCs - Transcribing
https://www.youtube.com/watch?v=dKrbzX5NjTs
UPValenciaX MOOCs - Transcribing
30 MOOC courses
UPValenciaX MOOCs -Transcribing
• API
• Just after
recording
ASR
• RTF 3
• Teaching
Assistants
Review
UPValenciaX MOOCs –Transcribing
• API
• Just after
recording
ASR
• RTF 3
• Teaching
Assistants
Review
70% less time
Transcription and Translation Platform
• Post-editing web interface (in HTML5)
Crowdsourcing
• We are crowdsourcing the on-campus courses using our own Paella
video player.
How to get good transcription quality
•Transcription systems learn to transcribe from examples
–At least 50 hours of videos (audio) in the source language previously transcribed
to learn the acoustic model
–Texts in millions of words to learn the language model
Language Videos (hours) Text (Mwords)
Dutch 532 628
English 620 464000
Estonian 130 410
French 88 1800
German 36 135
Portuguese 54 573
Italian 54 868
Slovene 27 224
Spanish 128 654
How to get good transcription quality (II)
•Adaptation of transcription systems to the specific videos is key for
high accuracy
•Availability of videos manually transcribed with similar acoustic conditions
•Availability of text resources related to the video in question
· Title is used to retrieve related documents
· Slides contain most of the special words used by the lecturer
· Documents: text content from the course, additional text resources (bibliography)
• Sound quality of the video has a direct relationship with quality
• No noise, no background music, please
Try yourself
http://mllp.upv.es
Our next step
Translations !!
Conclusions
• ASR technology is enough mature to help a lot in captioning
• However, there should be a review phase
• Quality can be enhanced by providing transcribed videos
• At UP Valencia we got transcribed our 30 MOOC courses with 3x TA
cost 
Thanks!
Questions?
Why transcription of MOOC video files?
• Accessibility
Why transcription of MOOC video files?
• Accessibility
• Searching into a video file
• Searching into a video repository
• Topic identification
• …and much more
Measuring Quality: Word Error rate
Where
S is the number of word substitutions,
D is the number of word deletions,
I is the number of word insertions,
N is the number of words in the reference text
Measuring Quality: Word Error Rate
Language WER
English
Dutch
20.8
24.5
Italian 17.7
Spanish 14.4
Estonian 27.1
French 22.7
Attributions
• Fingerspelling & tools Wikipedia
• Bored https://www.flickr.com/photos/left-hand/3132070992/
• Siri https://www.flickr.com/photos/smemon/8070397213/

Weitere ähnliche Inhalte

Andere mochten auch

Recruitment_Manager
Recruitment_ManagerRecruitment_Manager
Recruitment_ManagerChandru Gn
 
Türkiyeden vize istemeyen ülkelerin listesi
Türkiyeden vize istemeyen ülkelerin listesiTürkiyeden vize istemeyen ülkelerin listesi
Türkiyeden vize istemeyen ülkelerin listesiatakan555
 
Using Fundraising Data to Increase Giving
Using Fundraising Data to Increase GivingUsing Fundraising Data to Increase Giving
Using Fundraising Data to Increase GivingWest Muse
 
Agile Failure Patterns In Organisations – Leancamp Berlin 2016
Agile Failure Patterns In Organisations – Leancamp Berlin 2016Agile Failure Patterns In Organisations – Leancamp Berlin 2016
Agile Failure Patterns In Organisations – Leancamp Berlin 2016Stefan Wolpers
 
Contabilidad gubernamental
Contabilidad gubernamentalContabilidad gubernamental
Contabilidad gubernamentalthaLia_mf
 

Andere mochten auch (9)

Keith's Resume
Keith's ResumeKeith's Resume
Keith's Resume
 
Recruitment_Manager
Recruitment_ManagerRecruitment_Manager
Recruitment_Manager
 
Türkiyeden vize istemeyen ülkelerin listesi
Türkiyeden vize istemeyen ülkelerin listesiTürkiyeden vize istemeyen ülkelerin listesi
Türkiyeden vize istemeyen ülkelerin listesi
 
Using Fundraising Data to Increase Giving
Using Fundraising Data to Increase GivingUsing Fundraising Data to Increase Giving
Using Fundraising Data to Increase Giving
 
Job Fair Flyer
Job Fair FlyerJob Fair Flyer
Job Fair Flyer
 
Sslideshare
SslideshareSslideshare
Sslideshare
 
Agile Failure Patterns In Organisations – Leancamp Berlin 2016
Agile Failure Patterns In Organisations – Leancamp Berlin 2016Agile Failure Patterns In Organisations – Leancamp Berlin 2016
Agile Failure Patterns In Organisations – Leancamp Berlin 2016
 
G3 preescolar
G3 preescolarG3 preescolar
G3 preescolar
 
Contabilidad gubernamental
Contabilidad gubernamentalContabilidad gubernamental
Contabilidad gubernamental
 

Ähnlich wie Easing transcripts for mooc videos with an asr lwmoo cs

Using audio and video well in your moodle course
Using audio and video well in your moodle courseUsing audio and video well in your moodle course
Using audio and video well in your moodle courseColin Simpson
 
EMMA presentation - Alfons Juan - Language technologies for Education: recent...
EMMA presentation - Alfons Juan - Language technologies for Education: recent...EMMA presentation - Alfons Juan - Language technologies for Education: recent...
EMMA presentation - Alfons Juan - Language technologies for Education: recent...EUmoocs
 
Enriching video content for educational uses with Paella Player
Enriching video content for educational uses with Paella PlayerEnriching video content for educational uses with Paella Player
Enriching video content for educational uses with Paella PlayerCarlos Turró Ribalta
 
Video is key for Flipped Learning: the experience at UP Valencia
Video is key for Flipped Learning: the experience at UP ValenciaVideo is key for Flipped Learning: the experience at UP Valencia
Video is key for Flipped Learning: the experience at UP ValenciaCarlos Turró Ribalta
 
Multiply your reach
Multiply your reachMultiply your reach
Multiply your reachsrbhbaid
 
Survey says! Uncovering faculty support needs #DTL13
Survey says!  Uncovering faculty support needs #DTL13Survey says!  Uncovering faculty support needs #DTL13
Survey says! Uncovering faculty support needs #DTL13Tanya Joosten
 
Reinventing the lecture: how video technology and learning analytics are tran...
Reinventing the lecture: how video technology and learning analytics are tran...Reinventing the lecture: how video technology and learning analytics are tran...
Reinventing the lecture: how video technology and learning analytics are tran...John Couperthwaite
 
Newbutt podcasting to support
Newbutt podcasting to supportNewbutt podcasting to support
Newbutt podcasting to supportMEL SIG
 
Panopto workshop fall 2016
Panopto workshop fall 2016Panopto workshop fall 2016
Panopto workshop fall 2016Ashley Turner
 
REC:all Exploring the potential of lecture capture in universities and higher...
REC:all Exploring the potential of lecture capture in universities and higher...REC:all Exploring the potential of lecture capture in universities and higher...
REC:all Exploring the potential of lecture capture in universities and higher...MEDEA Awards
 
Instructional design in massive open online course (moocs)
Instructional design in massive open online course (moocs)Instructional design in massive open online course (moocs)
Instructional design in massive open online course (moocs)Eisa Rezaei
 
Personal capture.quick overview
Personal capture.quick overviewPersonal capture.quick overview
Personal capture.quick overviewCSaC
 
LITE 2016 – Administrate and Blended Learning; a Perfect Match [Rico Page & J...
LITE 2016 – Administrate and Blended Learning; a Perfect Match [Rico Page & J...LITE 2016 – Administrate and Blended Learning; a Perfect Match [Rico Page & J...
LITE 2016 – Administrate and Blended Learning; a Perfect Match [Rico Page & J...getadministrate
 

Ähnlich wie Easing transcripts for mooc videos with an asr lwmoo cs (20)

Using audio and video well in your moodle course
Using audio and video well in your moodle courseUsing audio and video well in your moodle course
Using audio and video well in your moodle course
 
Captioning Video
Captioning VideoCaptioning Video
Captioning Video
 
EMMA presentation - Alfons Juan - Language technologies for Education: recent...
EMMA presentation - Alfons Juan - Language technologies for Education: recent...EMMA presentation - Alfons Juan - Language technologies for Education: recent...
EMMA presentation - Alfons Juan - Language technologies for Education: recent...
 
Enriching video content for educational uses with Paella Player
Enriching video content for educational uses with Paella PlayerEnriching video content for educational uses with Paella Player
Enriching video content for educational uses with Paella Player
 
Video is key for Flipped Learning: the experience at UP Valencia
Video is key for Flipped Learning: the experience at UP ValenciaVideo is key for Flipped Learning: the experience at UP Valencia
Video is key for Flipped Learning: the experience at UP Valencia
 
Multiply your reach
Multiply your reachMultiply your reach
Multiply your reach
 
Survey says! Uncovering faculty support needs #DTL13
Survey says!  Uncovering faculty support needs #DTL13Survey says!  Uncovering faculty support needs #DTL13
Survey says! Uncovering faculty support needs #DTL13
 
MOOCs
MOOCsMOOCs
MOOCs
 
QFARC_14_1067
QFARC_14_1067QFARC_14_1067
QFARC_14_1067
 
Reinventing the lecture: how video technology and learning analytics are tran...
Reinventing the lecture: how video technology and learning analytics are tran...Reinventing the lecture: how video technology and learning analytics are tran...
Reinventing the lecture: how video technology and learning analytics are tran...
 
Fpvp
FpvpFpvp
Fpvp
 
Training Heritage Speakers: A Journey Worth Taking
Training Heritage Speakers: A Journey Worth TakingTraining Heritage Speakers: A Journey Worth Taking
Training Heritage Speakers: A Journey Worth Taking
 
Newbutt podcasting to support
Newbutt podcasting to supportNewbutt podcasting to support
Newbutt podcasting to support
 
Newbutt podcasting to support
Newbutt podcasting to supportNewbutt podcasting to support
Newbutt podcasting to support
 
Panopto workshop fall 2016
Panopto workshop fall 2016Panopto workshop fall 2016
Panopto workshop fall 2016
 
REC:all Exploring the potential of lecture capture in universities and higher...
REC:all Exploring the potential of lecture capture in universities and higher...REC:all Exploring the potential of lecture capture in universities and higher...
REC:all Exploring the potential of lecture capture in universities and higher...
 
Instructional design in massive open online course (moocs)
Instructional design in massive open online course (moocs)Instructional design in massive open online course (moocs)
Instructional design in massive open online course (moocs)
 
Personal capture.quick overview
Personal capture.quick overviewPersonal capture.quick overview
Personal capture.quick overview
 
LITE 2016 – Administrate and Blended Learning; a Perfect Match [Rico Page & J...
LITE 2016 – Administrate and Blended Learning; a Perfect Match [Rico Page & J...LITE 2016 – Administrate and Blended Learning; a Perfect Match [Rico Page & J...
LITE 2016 – Administrate and Blended Learning; a Perfect Match [Rico Page & J...
 
Adding audio lectures
Adding audio lecturesAdding audio lectures
Adding audio lectures
 

Mehr von Carlos Turró Ribalta

User derived videos in opencast. a first draft from upv
User derived videos in opencast. a first draft from upvUser derived videos in opencast. a first draft from upv
User derived videos in opencast. a first draft from upvCarlos Turró Ribalta
 
Hacia una nueva docencia ... caso UPV
Hacia una nueva docencia ... caso UPVHacia una nueva docencia ... caso UPV
Hacia una nueva docencia ... caso UPVCarlos Turró Ribalta
 
Pedagogical innovation at Universitat Politècnica de València
Pedagogical innovation at Universitat Politècnica de ValènciaPedagogical innovation at Universitat Politècnica de València
Pedagogical innovation at Universitat Politècnica de ValènciaCarlos Turró Ribalta
 
Paella player 4 - Presentation at Opencast Summit 2015 at Manchester
Paella player 4 - Presentation at Opencast Summit 2015 at ManchesterPaella player 4 - Presentation at Opencast Summit 2015 at Manchester
Paella player 4 - Presentation at Opencast Summit 2015 at ManchesterCarlos Turró Ribalta
 
Open edx developing x-blocks @ upvalencia (4)
Open edx   developing x-blocks @ upvalencia (4)Open edx   developing x-blocks @ upvalencia (4)
Open edx developing x-blocks @ upvalencia (4)Carlos Turró Ribalta
 

Mehr von Carlos Turró Ribalta (8)

User derived videos in opencast. a first draft from upv
User derived videos in opencast. a first draft from upvUser derived videos in opencast. a first draft from upv
User derived videos in opencast. a first draft from upv
 
Paella player and Opencast
Paella player and OpencastPaella player and Opencast
Paella player and Opencast
 
Hacia una nueva docencia ... caso UPV
Hacia una nueva docencia ... caso UPVHacia una nueva docencia ... caso UPV
Hacia una nueva docencia ... caso UPV
 
Paella player 5
Paella player 5Paella player 5
Paella player 5
 
Pedagogical innovation at Universitat Politècnica de València
Pedagogical innovation at Universitat Politècnica de ValènciaPedagogical innovation at Universitat Politècnica de València
Pedagogical innovation at Universitat Politècnica de València
 
Flipped Classroom project at UPV
Flipped Classroom project at UPVFlipped Classroom project at UPV
Flipped Classroom project at UPV
 
Paella player 4 - Presentation at Opencast Summit 2015 at Manchester
Paella player 4 - Presentation at Opencast Summit 2015 at ManchesterPaella player 4 - Presentation at Opencast Summit 2015 at Manchester
Paella player 4 - Presentation at Opencast Summit 2015 at Manchester
 
Open edx developing x-blocks @ upvalencia (4)
Open edx   developing x-blocks @ upvalencia (4)Open edx   developing x-blocks @ upvalencia (4)
Open edx developing x-blocks @ upvalencia (4)
 

Kürzlich hochgeladen

Configuration of IoT devices - Systems managament
Configuration of IoT devices - Systems managamentConfiguration of IoT devices - Systems managament
Configuration of IoT devices - Systems managamentBharaniDharan195623
 
Crushers to screens in aggregate production
Crushers to screens in aggregate productionCrushers to screens in aggregate production
Crushers to screens in aggregate productionChinnuNinan
 
Ch10-Global Supply Chain - Cadena de Suministro.pdf
Ch10-Global Supply Chain - Cadena de Suministro.pdfCh10-Global Supply Chain - Cadena de Suministro.pdf
Ch10-Global Supply Chain - Cadena de Suministro.pdfChristianCDAM
 
multiple access in wireless communication
multiple access in wireless communicationmultiple access in wireless communication
multiple access in wireless communicationpanditadesh123
 
Main Memory Management in Operating System
Main Memory Management in Operating SystemMain Memory Management in Operating System
Main Memory Management in Operating SystemRashmi Bhat
 
Industrial Safety Unit-IV workplace health and safety.ppt
Industrial Safety Unit-IV workplace health and safety.pptIndustrial Safety Unit-IV workplace health and safety.ppt
Industrial Safety Unit-IV workplace health and safety.pptNarmatha D
 
Indian Dairy Industry Present Status and.ppt
Indian Dairy Industry Present Status and.pptIndian Dairy Industry Present Status and.ppt
Indian Dairy Industry Present Status and.pptMadan Karki
 
Input Output Management in Operating System
Input Output Management in Operating SystemInput Output Management in Operating System
Input Output Management in Operating SystemRashmi Bhat
 
Transport layer issues and challenges - Guide
Transport layer issues and challenges - GuideTransport layer issues and challenges - Guide
Transport layer issues and challenges - GuideGOPINATHS437943
 
Engineering Drawing section of solid
Engineering Drawing     section of solidEngineering Drawing     section of solid
Engineering Drawing section of solidnamansinghjarodiya
 
Arduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptArduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptSAURABHKUMAR892774
 
System Simulation and Modelling with types and Event Scheduling
System Simulation and Modelling with types and Event SchedulingSystem Simulation and Modelling with types and Event Scheduling
System Simulation and Modelling with types and Event SchedulingBootNeck1
 
welding defects observed during the welding
welding defects observed during the weldingwelding defects observed during the welding
welding defects observed during the weldingMuhammadUzairLiaqat
 
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfCCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfAsst.prof M.Gokilavani
 
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)Dr SOUNDIRARAJ N
 
Correctly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleCorrectly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleAlluxio, Inc.
 
Katarzyna Lipka-Sidor - BIM School Course
Katarzyna Lipka-Sidor - BIM School CourseKatarzyna Lipka-Sidor - BIM School Course
Katarzyna Lipka-Sidor - BIM School Coursebim.edu.pl
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis
 

Kürzlich hochgeladen (20)

Configuration of IoT devices - Systems managament
Configuration of IoT devices - Systems managamentConfiguration of IoT devices - Systems managament
Configuration of IoT devices - Systems managament
 
Crushers to screens in aggregate production
Crushers to screens in aggregate productionCrushers to screens in aggregate production
Crushers to screens in aggregate production
 
Ch10-Global Supply Chain - Cadena de Suministro.pdf
Ch10-Global Supply Chain - Cadena de Suministro.pdfCh10-Global Supply Chain - Cadena de Suministro.pdf
Ch10-Global Supply Chain - Cadena de Suministro.pdf
 
multiple access in wireless communication
multiple access in wireless communicationmultiple access in wireless communication
multiple access in wireless communication
 
Main Memory Management in Operating System
Main Memory Management in Operating SystemMain Memory Management in Operating System
Main Memory Management in Operating System
 
Industrial Safety Unit-IV workplace health and safety.ppt
Industrial Safety Unit-IV workplace health and safety.pptIndustrial Safety Unit-IV workplace health and safety.ppt
Industrial Safety Unit-IV workplace health and safety.ppt
 
young call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Serviceyoung call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Service
 
Indian Dairy Industry Present Status and.ppt
Indian Dairy Industry Present Status and.pptIndian Dairy Industry Present Status and.ppt
Indian Dairy Industry Present Status and.ppt
 
Designing pile caps according to ACI 318-19.pptx
Designing pile caps according to ACI 318-19.pptxDesigning pile caps according to ACI 318-19.pptx
Designing pile caps according to ACI 318-19.pptx
 
Input Output Management in Operating System
Input Output Management in Operating SystemInput Output Management in Operating System
Input Output Management in Operating System
 
Transport layer issues and challenges - Guide
Transport layer issues and challenges - GuideTransport layer issues and challenges - Guide
Transport layer issues and challenges - Guide
 
Engineering Drawing section of solid
Engineering Drawing     section of solidEngineering Drawing     section of solid
Engineering Drawing section of solid
 
Arduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptArduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.ppt
 
System Simulation and Modelling with types and Event Scheduling
System Simulation and Modelling with types and Event SchedulingSystem Simulation and Modelling with types and Event Scheduling
System Simulation and Modelling with types and Event Scheduling
 
welding defects observed during the welding
welding defects observed during the weldingwelding defects observed during the welding
welding defects observed during the welding
 
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfCCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
 
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
 
Correctly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleCorrectly Loading Incremental Data at Scale
Correctly Loading Incremental Data at Scale
 
Katarzyna Lipka-Sidor - BIM School Course
Katarzyna Lipka-Sidor - BIM School CourseKatarzyna Lipka-Sidor - BIM School Course
Katarzyna Lipka-Sidor - BIM School Course
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
 

Easing transcripts for mooc videos with an asr lwmoo cs

  • 1. Easing Transcripts for MOOC Videos with an ASR (Automated Speech Recognition) System Carlos Turró, Jorge Civera and Jaime Busquets Universitat Politècnica de València
  • 2.
  • 3.
  • 4.
  • 5.
  • 6. The result of not having a screwdriver • Pain • Frustration • Select a different tool
  • 7. How can I transcribe a video? • Manually transcribing a video takes 10 times the length of the video (RTF) • Boring • It’s worse if you don’t know about the topic of the video
  • 8. Automated Speech Recognition (ASR) • How good is it? • Will it recognize my special words? • Will it really help me?
  • 9. UPValenciaX MOOCs - Transcribing https://media.upv.es/?id=b444d12e-db23-9a4f-9b3b-d1d9275d4cb4
  • 10. UPValenciaX MOOCs - Transcribing https://www.youtube.com/watch?v=dKrbzX5NjTs
  • 11. UPValenciaX MOOCs - Transcribing 30 MOOC courses
  • 12. UPValenciaX MOOCs -Transcribing • API • Just after recording ASR • RTF 3 • Teaching Assistants Review
  • 13. UPValenciaX MOOCs –Transcribing • API • Just after recording ASR • RTF 3 • Teaching Assistants Review 70% less time
  • 14. Transcription and Translation Platform • Post-editing web interface (in HTML5)
  • 15. Crowdsourcing • We are crowdsourcing the on-campus courses using our own Paella video player.
  • 16. How to get good transcription quality •Transcription systems learn to transcribe from examples –At least 50 hours of videos (audio) in the source language previously transcribed to learn the acoustic model –Texts in millions of words to learn the language model Language Videos (hours) Text (Mwords) Dutch 532 628 English 620 464000 Estonian 130 410 French 88 1800 German 36 135 Portuguese 54 573 Italian 54 868 Slovene 27 224 Spanish 128 654
  • 17. How to get good transcription quality (II) •Adaptation of transcription systems to the specific videos is key for high accuracy •Availability of videos manually transcribed with similar acoustic conditions •Availability of text resources related to the video in question · Title is used to retrieve related documents · Slides contain most of the special words used by the lecturer · Documents: text content from the course, additional text resources (bibliography) • Sound quality of the video has a direct relationship with quality • No noise, no background music, please
  • 20. Conclusions • ASR technology is enough mature to help a lot in captioning • However, there should be a review phase • Quality can be enhanced by providing transcribed videos • At UP Valencia we got transcribed our 30 MOOC courses with 3x TA cost 
  • 22.
  • 23. Why transcription of MOOC video files? • Accessibility
  • 24. Why transcription of MOOC video files? • Accessibility • Searching into a video file • Searching into a video repository • Topic identification • …and much more
  • 25. Measuring Quality: Word Error rate Where S is the number of word substitutions, D is the number of word deletions, I is the number of word insertions, N is the number of words in the reference text
  • 26. Measuring Quality: Word Error Rate Language WER English Dutch 20.8 24.5 Italian 17.7 Spanish 14.4 Estonian 27.1 French 22.7
  • 27. Attributions • Fingerspelling & tools Wikipedia • Bored https://www.flickr.com/photos/left-hand/3132070992/ • Siri https://www.flickr.com/photos/smemon/8070397213/