SlideShare ist ein Scribd-Unternehmen logo
1 von 26
New Life for Old Media
Investigations into Speech Synthesis and Deep Learning-based Colorization for
Audiovisual Archive
Rudy Marsman, Victor de Boer, Themistoklis Karavellas, Johan Oomen
Netherlands Institute for Sound and Vision (NISV)
70% audio-visual heritage material
More than 1.000.000 hrs of
TV (public broadcasters)
Radio, Music,Documentaries, Film, Commercials,
etc.
Photographs, objects, …
CC BY - SA as preferable license
3000 items “Internet Quality”
Polygoon newsreels
Supporting a National and
European Audiovisual Commons
Public outreach by embracing
new technologies and
‘participatory culture’
Openbeelden.nl / openimages.eu
Explore AI techniques to enrich this archival
material to allow for new types of engagement
1. Text-To-Speech engine based on limited single narrator
2. Colorization of old black-and-white video footage
Philip Bloemendal
Famous anchorman
Iconic voice
tiny.cc/voiceNL
(not a virus)
Limited Domain Speech Synthesis
Can the current corpus of audio recordings
of Bloemendal be used to construct a TTS
engine?
• Percentage of the Dutch language can be
generated with the current corpus?
• What can we do to improve?
• How well is the text-to-speech engine
recognizable as Philip Bloemendal?
• How understandable are the constructed
audio files?
Text:
Audio:
The Dutch football played Germany
the.wav dutch.wav football.wav
Spoken Language
Elements Repository
(35,000 words)
team
Slot-and-filler Text-to-speech
3,300 newsreels,
speech recognition
How to expand the coverage of the index?
•Many (contemporary) words have not
been pronounced by Philip Bloemendal
•Multiple strategies
–Change format (Lowercase, diaeresis)
–Numbers
–Finding synonyms
–Decompounding
Finding Synonyms
• Open Dutch Wordnet
Dutch lexical semantic database
(Postma et al. 2016)
• Yields synsets
(e.g. Hoofdmeester -> Rector, Schoolhoofd)
• Computationally expensive lookup
Decompounding
• Dutch language allows for
compounding words, each word is
distinct in the corpus
• Decompounding is
computationally expensive (for
large corpora, long words)
• Constructed Bigrams and Trigrams
School, hoofd -> Schoolhoofd
Regen, water -> regenwater
Staat, hoofd -> StaatShoofd
4 corpora to test against
•News articles (same domain, different time) | 50 articles, 2743 unique words
•1970s news articles from the (same domain, time) | 50 articles | 16,191 words
•E-books (different domain, various times) |6 books | 2,657 words
•Tweets (different domain, different time) | 1000 tweets| 27,180 words
• Evaluation
– Number of distinct words
– Number of sentences
Evaluation
Results (words)Coverage
• 8 people tested the software
• Philip was recognized (or ‘that news guy’)
• Words with more consonants were easier to recognize
• When user input their own sentences, more recognition
• When sentences were demonstrated without subtitles, less
• Speed of software / GUI limited testing capabilities
How recognisable are sentences?
The use of Deep Neural Networks in colorizing video
Neural Networks
Recent progress in computational power made implementation
of Deep Neural Nets possible
Neural Networks trained on large training set can accurately
make predictions in real-world examples
Zhang et al. (2012) trained a neural net
on over a million images for colorization
http://richzhang.github.io/colorization/
Existing Literature
• Extract individual frames from video using FFMPEG
• Colorize each individual frame
• Re-compile video and attach original audio file
Outcome
Extract 200x200
frames 24fps
(ffmpeg)
Zhang et al. implemented in
TensorFlow
Combine into
videos (ffmpeg)
Implementation on Video
• Colorized videos are more ‘tangible’ and ‘alive’ than black/white
• Showing colorized Polygoonjournaals can augment TTS engine
• General positive responses on technology may increase attention to NISV collection
Outcome
Outcome
• Each frame is considered
independent and is colorized as such
--> Artifacts appear between frames
• Slow performance without use of
Nvidia GPU
• Low resolution
• Predicted colors still far from perfect
Challenges
www.openbeelden.nl/tags/ingekleurd
Hosted on Openbeelden
platform
One of the colorized videos
received 61,000+ views, 1,700
likes and was shared 521 times,
illustrating the potential to
engage new audiences.
tiny.cc/colorNL
• Collection-specific TTS systems for audio-enrichments of archive
material or multimedia applications.
• Colorization of old media allows for a new view on existing images
• NISV will continue investigating these emerging technologies to
enable new types of interaction and to further engage new
audiences with archival material in unexpected ways.
– In the media museum
– On its public-facing online channels.
Take home
New Life for old Media:
Investigations into Speech Synthesis and Deep Learning-based Colorization for
Audiovisual Archive
Rudy Marsman, Victor de Boer, Themistoklis Karavellas, Johan Oomen
Thank you
Annex: Results (sentences)
Dataset Unique sentences Unique sentences
found
After synsets After
decompounding
Contemporary news 1022 106 110 186
Old news 2626 183 190 301
Tweets 8937 174 181 296
Books 56106 9387 11385 18271

Weitere ähnliche Inhalte

Ähnlich wie New Life for Old Media (NEM presentation)

Rudy Marsman's thesis presentation slides: Speech synthesis based on a limite...
Rudy Marsman's thesis presentation slides: Speech synthesis based on a limite...Rudy Marsman's thesis presentation slides: Speech synthesis based on a limite...
Rudy Marsman's thesis presentation slides: Speech synthesis based on a limite...Victor de Boer
 
New EUscreen Portal launch
New EUscreen Portal launchNew EUscreen Portal launch
New EUscreen Portal launchEUscreen
 
AMIA Johan Oomen Final
AMIA Johan Oomen FinalAMIA Johan Oomen Final
AMIA Johan Oomen FinalJohan Oomen
 
Research and Development at Sound and Vision
Research and Development at Sound and Vision Research and Development at Sound and Vision
Research and Development at Sound and Vision Victor de Boer
 
Separate Pasts, Common Futures: Digital film preservation in a broadcast en...
Separate Pasts,  Common Futures: Digital film preservation in a  broadcast en...Separate Pasts,  Common Futures: Digital film preservation in a  broadcast en...
Separate Pasts, Common Futures: Digital film preservation in a broadcast en...Erwin Verbruggen
 
Navigating Access to Digital AV Collections
Navigating Access to Digital AV CollectionsNavigating Access to Digital AV Collections
Navigating Access to Digital AV CollectionsRebecca Fraimow
 
VRAlocal14: Is This Thing On, Hoover
VRAlocal14: Is This Thing On, HooverVRAlocal14: Is This Thing On, Hoover
VRAlocal14: Is This Thing On, HooverVanderbiltVRC
 
A la recherche
 A la recherche A la recherche
A la rechercheEd Weiss
 
Embracing Diversity: Searching over Multiple Languages - Suneel Marthi, Red H...
Embracing Diversity: Searching over Multiple Languages - Suneel Marthi, Red H...Embracing Diversity: Searching over Multiple Languages - Suneel Marthi, Red H...
Embracing Diversity: Searching over Multiple Languages - Suneel Marthi, Red H...Lucidworks
 
how to understand and implement the "WAVENET"
how to understand and implement the "WAVENET"how to understand and implement the "WAVENET"
how to understand and implement the "WAVENET"Adonis Han
 
Overview of the MediaEval 2012 Tagging Task
Overview of the MediaEval 2012 Tagging TaskOverview of the MediaEval 2012 Tagging Task
Overview of the MediaEval 2012 Tagging TaskMediaEval2012
 
SAMT 2009 Johan Oomen
SAMT 2009 Johan OomenSAMT 2009 Johan Oomen
SAMT 2009 Johan OomenJohan Oomen
 
transLectures fact sheet
transLectures fact sheettransLectures fact sheet
transLectures fact sheettransLectures
 
From Machine Translation to Machine Interpretation - Jimmy Kunzmann
From Machine Translation to Machine Interpretation - Jimmy KunzmannFrom Machine Translation to Machine Interpretation - Jimmy Kunzmann
From Machine Translation to Machine Interpretation - Jimmy KunzmannTAUS - The Language Data Network
 
Video Localization: Challenges, Opportunities and Best Practices
Video Localization: Challenges, Opportunities and Best PracticesVideo Localization: Challenges, Opportunities and Best Practices
Video Localization: Challenges, Opportunities and Best PracticesAnton Bollen
 
Audio adr (sarat koneti 2011 12-01)
Audio adr (sarat koneti 2011 12-01)Audio adr (sarat koneti 2011 12-01)
Audio adr (sarat koneti 2011 12-01)Sarat Koneti
 

Ähnlich wie New Life for Old Media (NEM presentation) (20)

Rudy Marsman's thesis presentation slides: Speech synthesis based on a limite...
Rudy Marsman's thesis presentation slides: Speech synthesis based on a limite...Rudy Marsman's thesis presentation slides: Speech synthesis based on a limite...
Rudy Marsman's thesis presentation slides: Speech synthesis based on a limite...
 
New EUscreen Portal launch
New EUscreen Portal launchNew EUscreen Portal launch
New EUscreen Portal launch
 
AMIA Johan Oomen Final
AMIA Johan Oomen FinalAMIA Johan Oomen Final
AMIA Johan Oomen Final
 
Research and Development at Sound and Vision
Research and Development at Sound and Vision Research and Development at Sound and Vision
Research and Development at Sound and Vision
 
Separate Pasts, Common Futures: Digital film preservation in a broadcast en...
Separate Pasts,  Common Futures: Digital film preservation in a  broadcast en...Separate Pasts,  Common Futures: Digital film preservation in a  broadcast en...
Separate Pasts, Common Futures: Digital film preservation in a broadcast en...
 
Navigating Access to Digital AV Collections
Navigating Access to Digital AV CollectionsNavigating Access to Digital AV Collections
Navigating Access to Digital AV Collections
 
VRAlocal14: Is This Thing On, Hoover
VRAlocal14: Is This Thing On, HooverVRAlocal14: Is This Thing On, Hoover
VRAlocal14: Is This Thing On, Hoover
 
A la recherche
 A la recherche A la recherche
A la recherche
 
Embracing Diversity: Searching over Multiple Languages - Suneel Marthi, Red H...
Embracing Diversity: Searching over Multiple Languages - Suneel Marthi, Red H...Embracing Diversity: Searching over Multiple Languages - Suneel Marthi, Red H...
Embracing Diversity: Searching over Multiple Languages - Suneel Marthi, Red H...
 
how to understand and implement the "WAVENET"
how to understand and implement the "WAVENET"how to understand and implement the "WAVENET"
how to understand and implement the "WAVENET"
 
A brief history of CALL
A brief history of CALLA brief history of CALL
A brief history of CALL
 
Overview of the MediaEval 2012 Tagging Task
Overview of the MediaEval 2012 Tagging TaskOverview of the MediaEval 2012 Tagging Task
Overview of the MediaEval 2012 Tagging Task
 
SAMT 2009 Johan Oomen
SAMT 2009 Johan OomenSAMT 2009 Johan Oomen
SAMT 2009 Johan Oomen
 
transLectures fact sheet
transLectures fact sheettransLectures fact sheet
transLectures fact sheet
 
From Machine Translation to Machine Interpretation - Jimmy Kunzmann
From Machine Translation to Machine Interpretation - Jimmy KunzmannFrom Machine Translation to Machine Interpretation - Jimmy Kunzmann
From Machine Translation to Machine Interpretation - Jimmy Kunzmann
 
Motion information and media
Motion information and mediaMotion information and media
Motion information and media
 
Audio adr
Audio adrAudio adr
Audio adr
 
Audio adr
Audio adrAudio adr
Audio adr
 
Video Localization: Challenges, Opportunities and Best Practices
Video Localization: Challenges, Opportunities and Best PracticesVideo Localization: Challenges, Opportunities and Best Practices
Video Localization: Challenges, Opportunities and Best Practices
 
Audio adr (sarat koneti 2011 12-01)
Audio adr (sarat koneti 2011 12-01)Audio adr (sarat koneti 2011 12-01)
Audio adr (sarat koneti 2011 12-01)
 

Mehr von Victor de Boer

One day workshop Linked Data and Semantic Web
One day workshop Linked Data and Semantic WebOne day workshop Linked Data and Semantic Web
One day workshop Linked Data and Semantic WebVictor de Boer
 
Linked Data for Digital Humanities research at Media Archives
Linked Data for Digital Humanities research at Media ArchivesLinked Data for Digital Humanities research at Media Archives
Linked Data for Digital Humanities research at Media ArchivesVictor de Boer
 
The Benefits of Linking Metadata for Internal and External users of an Audiov...
The Benefits of Linking Metadata for Internal and External users of an Audiov...The Benefits of Linking Metadata for Internal and External users of an Audiov...
The Benefits of Linking Metadata for Internal and External users of an Audiov...Victor de Boer
 
UX Challenges of Information Organisation: Assessment of Language Impairment ...
UX Challenges of Information Organisation: Assessment of Language Impairment ...UX Challenges of Information Organisation: Assessment of Language Impairment ...
UX Challenges of Information Organisation: Assessment of Language Impairment ...Victor de Boer
 
Interactive Dance Choreography Assistance presentation for ACE entertainment ...
Interactive Dance Choreography Assistance presentation for ACE entertainment ...Interactive Dance Choreography Assistance presentation for ACE entertainment ...
Interactive Dance Choreography Assistance presentation for ACE entertainment ...Victor de Boer
 
Fahad Ali's slides for Machine to-machine communication in rural conditions ...
Fahad Ali's slides for Machine to-machine communication in rural conditions  ...Fahad Ali's slides for Machine to-machine communication in rural conditions  ...
Fahad Ali's slides for Machine to-machine communication in rural conditions ...Victor de Boer
 
Linking African Traditional Medicine Knowledge - by Gossa Lo
Linking African Traditional Medicine Knowledge - by Gossa LoLinking African Traditional Medicine Knowledge - by Gossa Lo
Linking African Traditional Medicine Knowledge - by Gossa LoVictor de Boer
 
Enriching Media Collections for Event-based Exploration
Enriching Media Collections for Event-based ExplorationEnriching Media Collections for Event-based Exploration
Enriching Media Collections for Event-based ExplorationVictor de Boer
 
User-centered Data Science for Digital Humanities
User-centered Data Science for Digital HumanitiesUser-centered Data Science for Digital Humanities
User-centered Data Science for Digital HumanitiesVictor de Boer
 
Linked Data for Audiovisual Archives (Guest lecture at NISV)
Linked Data for Audiovisual Archives (Guest lecture at NISV)Linked Data for Audiovisual Archives (Guest lecture at NISV)
Linked Data for Audiovisual Archives (Guest lecture at NISV)Victor de Boer
 
Semantic Technology for Development: Semantic Web without the Web?
Semantic Technology for Development: Semantic Web without the Web?Semantic Technology for Development: Semantic Web without the Web?
Semantic Technology for Development: Semantic Web without the Web?Victor de Boer
 
DIVE+ and Events at EVENTS2017
DIVE+ and Events at EVENTS2017DIVE+ and Events at EVENTS2017
DIVE+ and Events at EVENTS2017Victor de Boer
 
Intro to Linked, Dutch Ships and Sailors and SPARQL handson
Intro to Linked, Dutch Ships and Sailors and SPARQL handson Intro to Linked, Dutch Ships and Sailors and SPARQL handson
Intro to Linked, Dutch Ships and Sailors and SPARQL handson Victor de Boer
 
Kasadaka and ICT4D at VU
Kasadaka and ICT4D at VUKasadaka and ICT4D at VU
Kasadaka and ICT4D at VUVictor de Boer
 
VU ICT4D symposium 2017 Francis Dittoh Mr. Meteo
VU ICT4D symposium 2017 Francis Dittoh  Mr. MeteoVU ICT4D symposium 2017 Francis Dittoh  Mr. Meteo
VU ICT4D symposium 2017 Francis Dittoh Mr. MeteoVictor de Boer
 
VU ICT4D symposium 2017 Chris van Aart
VU ICT4D symposium 2017 Chris van AartVU ICT4D symposium 2017 Chris van Aart
VU ICT4D symposium 2017 Chris van AartVictor de Boer
 
VU ICT4D symposium 2017 Gayo Diallo Towards a Digital African Traditional Hea...
VU ICT4D symposium 2017 Gayo Diallo Towards a Digital African Traditional Hea...VU ICT4D symposium 2017 Gayo Diallo Towards a Digital African Traditional Hea...
VU ICT4D symposium 2017 Gayo Diallo Towards a Digital African Traditional Hea...Victor de Boer
 
VU ICT4D symposium 2017 Wendelien Tuyp: Boosting african agriculture
VU ICT4D symposium 2017 Wendelien Tuyp: Boosting african agriculture VU ICT4D symposium 2017 Wendelien Tuyp: Boosting african agriculture
VU ICT4D symposium 2017 Wendelien Tuyp: Boosting african agriculture Victor de Boer
 
Exploring Audiovisual Archives through Aligned Thesauri
Exploring Audiovisual Archives through Aligned Thesauri Exploring Audiovisual Archives through Aligned Thesauri
Exploring Audiovisual Archives through Aligned Thesauri Victor de Boer
 

Mehr von Victor de Boer (20)

One day workshop Linked Data and Semantic Web
One day workshop Linked Data and Semantic WebOne day workshop Linked Data and Semantic Web
One day workshop Linked Data and Semantic Web
 
Linked Data for Digital Humanities research at Media Archives
Linked Data for Digital Humanities research at Media ArchivesLinked Data for Digital Humanities research at Media Archives
Linked Data for Digital Humanities research at Media Archives
 
The Benefits of Linking Metadata for Internal and External users of an Audiov...
The Benefits of Linking Metadata for Internal and External users of an Audiov...The Benefits of Linking Metadata for Internal and External users of an Audiov...
The Benefits of Linking Metadata for Internal and External users of an Audiov...
 
UX Challenges of Information Organisation: Assessment of Language Impairment ...
UX Challenges of Information Organisation: Assessment of Language Impairment ...UX Challenges of Information Organisation: Assessment of Language Impairment ...
UX Challenges of Information Organisation: Assessment of Language Impairment ...
 
Interactive Dance Choreography Assistance presentation for ACE entertainment ...
Interactive Dance Choreography Assistance presentation for ACE entertainment ...Interactive Dance Choreography Assistance presentation for ACE entertainment ...
Interactive Dance Choreography Assistance presentation for ACE entertainment ...
 
Fahad Ali's slides for Machine to-machine communication in rural conditions ...
Fahad Ali's slides for Machine to-machine communication in rural conditions  ...Fahad Ali's slides for Machine to-machine communication in rural conditions  ...
Fahad Ali's slides for Machine to-machine communication in rural conditions ...
 
Linking African Traditional Medicine Knowledge - by Gossa Lo
Linking African Traditional Medicine Knowledge - by Gossa LoLinking African Traditional Medicine Knowledge - by Gossa Lo
Linking African Traditional Medicine Knowledge - by Gossa Lo
 
Enriching Media Collections for Event-based Exploration
Enriching Media Collections for Event-based ExplorationEnriching Media Collections for Event-based Exploration
Enriching Media Collections for Event-based Exploration
 
User-centered Data Science for Digital Humanities
User-centered Data Science for Digital HumanitiesUser-centered Data Science for Digital Humanities
User-centered Data Science for Digital Humanities
 
Linked Data for Audiovisual Archives (Guest lecture at NISV)
Linked Data for Audiovisual Archives (Guest lecture at NISV)Linked Data for Audiovisual Archives (Guest lecture at NISV)
Linked Data for Audiovisual Archives (Guest lecture at NISV)
 
Semantic Technology for Development: Semantic Web without the Web?
Semantic Technology for Development: Semantic Web without the Web?Semantic Technology for Development: Semantic Web without the Web?
Semantic Technology for Development: Semantic Web without the Web?
 
DIVE+ and Events at EVENTS2017
DIVE+ and Events at EVENTS2017DIVE+ and Events at EVENTS2017
DIVE+ and Events at EVENTS2017
 
About Cultuurlink
About CultuurlinkAbout Cultuurlink
About Cultuurlink
 
Intro to Linked, Dutch Ships and Sailors and SPARQL handson
Intro to Linked, Dutch Ships and Sailors and SPARQL handson Intro to Linked, Dutch Ships and Sailors and SPARQL handson
Intro to Linked, Dutch Ships and Sailors and SPARQL handson
 
Kasadaka and ICT4D at VU
Kasadaka and ICT4D at VUKasadaka and ICT4D at VU
Kasadaka and ICT4D at VU
 
VU ICT4D symposium 2017 Francis Dittoh Mr. Meteo
VU ICT4D symposium 2017 Francis Dittoh  Mr. MeteoVU ICT4D symposium 2017 Francis Dittoh  Mr. Meteo
VU ICT4D symposium 2017 Francis Dittoh Mr. Meteo
 
VU ICT4D symposium 2017 Chris van Aart
VU ICT4D symposium 2017 Chris van AartVU ICT4D symposium 2017 Chris van Aart
VU ICT4D symposium 2017 Chris van Aart
 
VU ICT4D symposium 2017 Gayo Diallo Towards a Digital African Traditional Hea...
VU ICT4D symposium 2017 Gayo Diallo Towards a Digital African Traditional Hea...VU ICT4D symposium 2017 Gayo Diallo Towards a Digital African Traditional Hea...
VU ICT4D symposium 2017 Gayo Diallo Towards a Digital African Traditional Hea...
 
VU ICT4D symposium 2017 Wendelien Tuyp: Boosting african agriculture
VU ICT4D symposium 2017 Wendelien Tuyp: Boosting african agriculture VU ICT4D symposium 2017 Wendelien Tuyp: Boosting african agriculture
VU ICT4D symposium 2017 Wendelien Tuyp: Boosting african agriculture
 
Exploring Audiovisual Archives through Aligned Thesauri
Exploring Audiovisual Archives through Aligned Thesauri Exploring Audiovisual Archives through Aligned Thesauri
Exploring Audiovisual Archives through Aligned Thesauri
 

Kürzlich hochgeladen

Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...JojoEDelaCruz
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxlancelewisportillo
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)cama23
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxVanesaIglesias10
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...Postal Advocate Inc.
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSJoshuaGantuangco2
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfVanessa Camilleri
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 

Kürzlich hochgeladen (20)

Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptx
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdf
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 

New Life for Old Media (NEM presentation)

  • 1. New Life for Old Media Investigations into Speech Synthesis and Deep Learning-based Colorization for Audiovisual Archive Rudy Marsman, Victor de Boer, Themistoklis Karavellas, Johan Oomen
  • 2. Netherlands Institute for Sound and Vision (NISV)
  • 3. 70% audio-visual heritage material More than 1.000.000 hrs of TV (public broadcasters) Radio, Music,Documentaries, Film, Commercials, etc. Photographs, objects, …
  • 4.
  • 5. CC BY - SA as preferable license 3000 items “Internet Quality” Polygoon newsreels Supporting a National and European Audiovisual Commons Public outreach by embracing new technologies and ‘participatory culture’ Openbeelden.nl / openimages.eu
  • 6. Explore AI techniques to enrich this archival material to allow for new types of engagement 1. Text-To-Speech engine based on limited single narrator 2. Colorization of old black-and-white video footage
  • 7. Philip Bloemendal Famous anchorman Iconic voice tiny.cc/voiceNL (not a virus)
  • 8. Limited Domain Speech Synthesis Can the current corpus of audio recordings of Bloemendal be used to construct a TTS engine? • Percentage of the Dutch language can be generated with the current corpus? • What can we do to improve? • How well is the text-to-speech engine recognizable as Philip Bloemendal? • How understandable are the constructed audio files?
  • 9. Text: Audio: The Dutch football played Germany the.wav dutch.wav football.wav Spoken Language Elements Repository (35,000 words) team Slot-and-filler Text-to-speech 3,300 newsreels, speech recognition
  • 10. How to expand the coverage of the index? •Many (contemporary) words have not been pronounced by Philip Bloemendal •Multiple strategies –Change format (Lowercase, diaeresis) –Numbers –Finding synonyms –Decompounding
  • 11. Finding Synonyms • Open Dutch Wordnet Dutch lexical semantic database (Postma et al. 2016) • Yields synsets (e.g. Hoofdmeester -> Rector, Schoolhoofd) • Computationally expensive lookup
  • 12. Decompounding • Dutch language allows for compounding words, each word is distinct in the corpus • Decompounding is computationally expensive (for large corpora, long words) • Constructed Bigrams and Trigrams School, hoofd -> Schoolhoofd Regen, water -> regenwater Staat, hoofd -> StaatShoofd
  • 13. 4 corpora to test against •News articles (same domain, different time) | 50 articles, 2743 unique words •1970s news articles from the (same domain, time) | 50 articles | 16,191 words •E-books (different domain, various times) |6 books | 2,657 words •Tweets (different domain, different time) | 1000 tweets| 27,180 words • Evaluation – Number of distinct words – Number of sentences Evaluation
  • 15. • 8 people tested the software • Philip was recognized (or ‘that news guy’) • Words with more consonants were easier to recognize • When user input their own sentences, more recognition • When sentences were demonstrated without subtitles, less • Speed of software / GUI limited testing capabilities How recognisable are sentences?
  • 16. The use of Deep Neural Networks in colorizing video
  • 17. Neural Networks Recent progress in computational power made implementation of Deep Neural Nets possible Neural Networks trained on large training set can accurately make predictions in real-world examples
  • 18. Zhang et al. (2012) trained a neural net on over a million images for colorization http://richzhang.github.io/colorization/ Existing Literature
  • 19. • Extract individual frames from video using FFMPEG • Colorize each individual frame • Re-compile video and attach original audio file Outcome Extract 200x200 frames 24fps (ffmpeg) Zhang et al. implemented in TensorFlow Combine into videos (ffmpeg) Implementation on Video
  • 20. • Colorized videos are more ‘tangible’ and ‘alive’ than black/white • Showing colorized Polygoonjournaals can augment TTS engine • General positive responses on technology may increase attention to NISV collection Outcome
  • 22. • Each frame is considered independent and is colorized as such --> Artifacts appear between frames • Slow performance without use of Nvidia GPU • Low resolution • Predicted colors still far from perfect Challenges
  • 23. www.openbeelden.nl/tags/ingekleurd Hosted on Openbeelden platform One of the colorized videos received 61,000+ views, 1,700 likes and was shared 521 times, illustrating the potential to engage new audiences. tiny.cc/colorNL
  • 24. • Collection-specific TTS systems for audio-enrichments of archive material or multimedia applications. • Colorization of old media allows for a new view on existing images • NISV will continue investigating these emerging technologies to enable new types of interaction and to further engage new audiences with archival material in unexpected ways. – In the media museum – On its public-facing online channels. Take home
  • 25. New Life for old Media: Investigations into Speech Synthesis and Deep Learning-based Colorization for Audiovisual Archive Rudy Marsman, Victor de Boer, Themistoklis Karavellas, Johan Oomen Thank you
  • 26. Annex: Results (sentences) Dataset Unique sentences Unique sentences found After synsets After decompounding Contemporary news 1022 106 110 186 Old news 2626 183 190 301 Tweets 8937 174 181 296 Books 56106 9387 11385 18271