Towards Collaborative Annotation for Video Accessibility

Towards Collaborative
Annotation for
Video Accessibility
Pierre-Antoine Champin, Benoît Encelle,
Magali O. Beldame, Yannick Prié
Nick Evans and
Raphaël Troncy <raphael.troncy@eurecom.fr>

The consortium
 Dailymotion (Paris, FR) : video sharing website
 Promotes HTML 5 using the video tag, http://openvideo.dailymotion.com/
 LIRIS (Lyon, FR) : CS research group
 Silex Team: expertise in semantic web, annotation models, video annotation
and HCI for disabled people
 EURECOM (Sophia Antipolis, FR) : research center in
communications systems
 Multimedia team: expertise in multimedia analysis (speaker
diarization/recognition, speech recognition) and semantic web
 INS HEA + school (Lyon, FR)
 Experiences in physical disabilities: blindness, visual impairment, deafness
and hearing Loss
 Blind and death high-school students

26/04/2010 - Towards Collaborative Annotations for Video Accessibility - W4A 2010, Raleigh, USA -2

Goals and Motivations
 What is required to make video accessible on the Web?
 How to increase the number of accessible videos?
 Technologies:
 Annotating: automatic (speech transcription) and manual (social
collaborative annotation tool)
 Addressing: pointing to, retrieving, transmitting only parts of media
 Rendering: video visualization for the impaired, Braille output

 Expected benefits for:
 disabled people, getting better access to video
 video provider, reaching a wider audience
 the Web in general, using semantic annotations


Accessibility Features for Visually
Impaired and Blind People

Man’s actions Put on his shoes Walk in the street

Son’s actions Look his mother

Characters The mother, her son The son, the man The man and his friend

Scenery In the shop In the street

Annotations multimodal presentation
Annotations depends on video context
and user preferences

Audio Auditory Audio Braille
track icons description


Accessibility Features for Deaf People

Mother‘s dialogues How are you ?

Son’s dialogues Hi mom Fine and you ?

Sound Car horn

Annotations presentation
Annotations depends on video cointext
and user preferences

Video Subtitles Surtitles
track


Producing Video Annotations

 Automatic annotations  Social annotations

 Speaker diarization
Who spoke and When?  Annotation corrections,
 Speech recognition enhancement
Transcription  Audio description
(for visually impaired)
Annotations
Mother How are you ? Annotations
Son Ho mom Fine Mother How are you ?

Son Hi mom Fine and you ?

Sound Car horn


Braille Rendering
The Advene prototype emulation views

Enriched
Media Player

Timeline
with typed
annotations

7

Preliminary study (1/2)
 Semi-structured interviews with blind users (n=2)
 Participant’s habits when watching programs with audio description
 Audio description process
 Multimodal presentations of descriptions

 Requirements:
 R1: generate additional descriptions and provide unobtrusive access
to descriptions (tactile access for blind Braille readers)
 R2: descriptions at various level of granularity and verbosity
 R3: use system’s multimodal output to provide two or more
descriptions (e.g. speech synthesis and Braille display)


Preliminary study (2/2)
 Goal: see whether we can use auditory icons to convey
the rhythm of the editing of a movie to blind users
 e.g.: sound of a locomotive arriving from the right to convey the
concept of a traveling from right to left

 Experiment and questionnaires (n=16+9)
 Viewing with headsets of 5 min of Ratatouille,
http://www.imdb.com/title/tt0382932/

 Results:
 Rhythm and movie dynamic better perceived
 Usefulness of auditory icons but must be limited (5 max) and be very
different from the main soundtrack of the movie
 Editing cues: change of scenes, camera movement, flashback (e.g. NCIS)
 Audio zoom (e.g. Survivor)


ACAV Architecture

Benchmarking: Sphinx, HTK,
Julius

26/04/2010 - Towards Collaborative Annotations for Video Accessibility - W4A 2010, Raleigh, USA - 10

Media Fragments URI

Provide URI-based
mechanisms for uniquely
identifying fragments for
media objects on the Web,
such as video, audio, and
images.

Photo credit: Robert Freund


Media Fragments Processing

http://www.example.com/video.ogv#t=10,20


Conclusion

 ACAV will bring:
 Dedicated annotation schemas for video accessibility
 Social network model for video annotations
 Web integration of state of the art speech technologies
 GUI models for authoring and rendering video
annotations
 Media Fragments reference implementation
 Open source Braille plugin for most used Web browsers

http://www.acavideo.fr/


Towards Collaborative Annotation for Video Accessibility

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Ähnlich wie Towards Collaborative Annotation for Video Accessibility

Ähnlich wie Towards Collaborative Annotation for Video Accessibility (20)

Mehr von Raphael Troncy

Mehr von Raphael Troncy (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Towards Collaborative Annotation for Video Accessibility