SlideShare ist ein Scribd-Unternehmen logo
1 von 14
Downloaden Sie, um offline zu lesen
IBM confidential ©IBM Corporation
Watson Beat:
New Cognitive Era
2©IBM Corporation
FutureCurrent
IBM Confidential | Do Not Distribute © 2015 IBM Corporation
▪ Why Music?
▪ Why unsupervised learning?
▪ Why Watson Beat?
▪ Representing music based on parameters is not very intuitive for most people
▪ For example: I like songs with 120 bpm, following B# major etc.
▪ Watson Beat provides a new way to query and compose new music based on reference tracks
▪ Ease of use: no requirement of prior knowledge, music theory etc.
▪ Provide input track(s) to our model, it gives you a new output track(s)
Motivation:
3
IBM Confidential | Do Not Distribute © 2015 IBM Corporation
▪ Demo of potential application
▪ I am an amateur video game developer, and I need music for my game, and I like how “Game of Thrones” sounds
▪ Feed it through Watson Beat and get 100s of variations on same music
▪ Ability to steer compositions based on intent — slow, sad, fast, happy, vibrant
Learning Music: Basic Idea
Learned
“Game Of Thrones”
Input
Track(s)
Perturb model
using creativity
genes
(known influences)
Extract
musical
characteristics
(pitch, rhythm,
dynamics etc.)
Reconstruct track by
iteratively learning
input musical
characteristics with
added perturbation
Output
Track(s)
4
Original
“Game Of Thrones”
IBM Confidential | Do Not Distribute © 2015 IBM Corporation
▪ RBM: Stochastic Neural Net with
▪ one layer of visible units
▪ one layer of hidden units
▪ Learning RBMs: Contrastive Divergence
▪ DBN: Stack multiple Restricted Boltzmann
Machines (RBM)
Deep Belief Networks (DBN):
5
x: Visible layer
h: Hidden Layer
W: weights
Input Vector
Perturbed Input Vector
unsupervised learning of NNs
(RBMs, Auto encoder etc.)
Output Vector
RBM
IBM Confidential | Do Not Distribute © 2015 IBM Corporation
▪ Watson Beat Pandora station
▪ Suggest recreated version of songs that you like, you’ve been listening to etc.
▪ Producers, composers, music engineers create music based on intent (slow, fast, happy vibrant)
▪ Ability for retail stores, small businesses to create their own music based on original tracks
▪ Loop Pedal Mixing: pedal -> DJ Watson mixer -> amp
▪ https://www.youtube.com/watch?v=qX2eJsj9MiQ
Applications: Cloud based cognitive music service
6
IBM Confidential | Do Not Distribute © 2015 IBM Corporation
Backup Slides
7
IBM Confidential | Do Not Distribute © 2015 IBM Corporation
Training RBMs
8
IBM Confidential | Do Not Distribute © 2015 IBM Corporation
Recreate original Music using RBMs
C# E B
Time1/16 1/16
h: Hidden Layer
(Holds extracted features of visible layer)
x: Visible layer
(Holds pitch information)
9
C E# B
Time1/16 1/16
x~: Learned visible layer
(Holds learned pitch information)
p(h|x) p(x|h)
Demo: Recreated
“Mary had a little lamb”
Example 1: Create new music by adding perturbation
10
h: Hidden Layer
(Holds extracted features of visible
Demo: Learned
“Mary” (less perturbation)
Demo: Learned
“Mary” (more perturbation)
x: Visible layer
(Holds initial pitch information)
h1: Hidden Layer
(Holds extracted features of visible
C# rand
B
Time
x: Visible layer
(Holds perturbed pitch information)
x~: Learned visible layer
(Holds learned pitch information)
p(h|x) p(x|h)
E rand E# rand`
B
Time
A rand`
p1(h|x)
C# E B
Time
Demo: Original “Mary
had a little lamb”
11
h: Hidden Layer
(Holds extracted features of visible
Demo: Spooky
String Quartet
x: Visible layer
(Holds initial pitch information)
h1: Hidden Layer
(Holds extracted features of visible
C#
Oct
B
Time
x: Visible layer
(Holds perturbed pitch information)
x~: Learned visible layer
(Holds learned pitch information)
p(h|x) p(x|h)
E Oct
E# Oct`
B
Time
A Oct`
p1(h|x)
C# E B
Time
Demo: Original “String
Quartet”
Example 2: Create new music by steering learning based on emotional intent
Example 3: Create new music by learning from two songs and adding perturbation
C# minor
B
Time
Visible layer
(Holds pitch + bias information)
Hidden Layer
(Holds extracted features of visible layer)
minor
E
Song A
D#
Song B
Weights
12
Demo: Learned
“Willie Nelson”
and
”Miley Cyrus”
13
Original Adele
Learned Adele
“Vibrant version”
Learned Adele
“Mellow version”
Example 4: Create new music by steering learning based on emotional intent
14
x1: Visible layer for RBM1
x2: Visible layer for RBM2
h1: Hidden Layer for RBM 1
W1: weights for RBM1
x3: Visible Layer for RBM3
h2: hidden layer for RBM2
h3: hidden layer for RBM3
W2: weights for RBM2
W3: weights for RBM3

Weitere ähnliche Inhalte

Andere mochten auch

Scalable Learning in Computer Vision
Scalable Learning in Computer VisionScalable Learning in Computer Vision
Scalable Learning in Computer Vision
butest
 

Andere mochten auch (13)

Building with Watson - Advanced Integrations with Watson Conversation
Building with Watson - Advanced Integrations with Watson ConversationBuilding with Watson - Advanced Integrations with Watson Conversation
Building with Watson - Advanced Integrations with Watson Conversation
 
Watson DevCon 2016 - The Flu Fighters: Using IBM Cognitive Computing to Fight...
Watson DevCon 2016 - The Flu Fighters: Using IBM Cognitive Computing to Fight...Watson DevCon 2016 - The Flu Fighters: Using IBM Cognitive Computing to Fight...
Watson DevCon 2016 - The Flu Fighters: Using IBM Cognitive Computing to Fight...
 
Watson DevCon 2016 - Engage and Be Engaging: Building Compassionate and Perso...
Watson DevCon 2016 - Engage and Be Engaging: Building Compassionate and Perso...Watson DevCon 2016 - Engage and Be Engaging: Building Compassionate and Perso...
Watson DevCon 2016 - Engage and Be Engaging: Building Compassionate and Perso...
 
Scalable Learning in Computer Vision
Scalable Learning in Computer VisionScalable Learning in Computer Vision
Scalable Learning in Computer Vision
 
Building Realtime Javascript Apps with PubNub
Building Realtime Javascript Apps with PubNubBuilding Realtime Javascript Apps with PubNub
Building Realtime Javascript Apps with PubNub
 
PubNub EON Realtime Dashboard Framework
PubNub EON Realtime Dashboard FrameworkPubNub EON Realtime Dashboard Framework
PubNub EON Realtime Dashboard Framework
 
Watson Equipment Advisor
Watson Equipment Advisor Watson Equipment Advisor
Watson Equipment Advisor
 
Building with Watson - Interpreting Language Using the Natural Language Class...
Building with Watson - Interpreting Language Using the Natural Language Class...Building with Watson - Interpreting Language Using the Natural Language Class...
Building with Watson - Interpreting Language Using the Natural Language Class...
 
Cognitive Reservoir Analytics
Cognitive Reservoir Analytics Cognitive Reservoir Analytics
Cognitive Reservoir Analytics
 
SenchaCon 2016: An Ext JS Dashboard for IoT Data - Dan Gallo
SenchaCon 2016: An Ext JS Dashboard for IoT Data - Dan Gallo   SenchaCon 2016: An Ext JS Dashboard for IoT Data - Dan Gallo
SenchaCon 2016: An Ext JS Dashboard for IoT Data - Dan Gallo
 
Cognitive Overview
Cognitive Overview Cognitive Overview
Cognitive Overview
 
SenchaCon 2016: Using Ext JS to Turn Big Data into Intelligence - Olga Petrov...
SenchaCon 2016: Using Ext JS to Turn Big Data into Intelligence - Olga Petrov...SenchaCon 2016: Using Ext JS to Turn Big Data into Intelligence - Olga Petrov...
SenchaCon 2016: Using Ext JS to Turn Big Data into Intelligence - Olga Petrov...
 
SenchaCon 2016: Using Ext JS 6 for Cross-Platform Development on Mobile - And...
SenchaCon 2016: Using Ext JS 6 for Cross-Platform Development on Mobile - And...SenchaCon 2016: Using Ext JS 6 for Cross-Platform Development on Mobile - And...
SenchaCon 2016: Using Ext JS 6 for Cross-Platform Development on Mobile - And...
 

Mehr von IBM Watson

Mehr von IBM Watson (17)

Easily Deploy your Chat Bot to Multiple Channels with Stamplay
Easily Deploy your Chat Bot to Multiple Channels with StamplayEasily Deploy your Chat Bot to Multiple Channels with Stamplay
Easily Deploy your Chat Bot to Multiple Channels with Stamplay
 
Watson DevCon 2016 - From Jeopardy! to the Future
Watson DevCon 2016 - From Jeopardy! to the FutureWatson DevCon 2016 - From Jeopardy! to the Future
Watson DevCon 2016 - From Jeopardy! to the Future
 
Get More From Your Messages with Twilio + Watson Add-Ons
Get More From Your Messages with Twilio + Watson Add-OnsGet More From Your Messages with Twilio + Watson Add-Ons
Get More From Your Messages with Twilio + Watson Add-Ons
 
5 Reasons to Attend the Upcoming Watson Developer Conference
5 Reasons to Attend the Upcoming Watson Developer Conference5 Reasons to Attend the Upcoming Watson Developer Conference
5 Reasons to Attend the Upcoming Watson Developer Conference
 
Application Starter Kits for Developers - Building with Watson
Application Starter Kits for Developers - Building with WatsonApplication Starter Kits for Developers - Building with Watson
Application Starter Kits for Developers - Building with Watson
 
Cognitive Business: Where digital business meets digital intelligence
Cognitive Business: Where digital business meets digital intelligenceCognitive Business: Where digital business meets digital intelligence
Cognitive Business: Where digital business meets digital intelligence
 
What Does It Mean To Be A Cognitive Company
What Does It Mean To Be  A Cognitive CompanyWhat Does It Mean To Be  A Cognitive Company
What Does It Mean To Be A Cognitive Company
 
Custom Image Classifier with Visual Recognition: Building with Watson
Custom Image Classifier with Visual Recognition: Building with Watson Custom Image Classifier with Visual Recognition: Building with Watson
Custom Image Classifier with Visual Recognition: Building with Watson
 
Leading in the Cognitive Age
Leading in the Cognitive AgeLeading in the Cognitive Age
Leading in the Cognitive Age
 
Applications for Cognitive Computing
Applications for Cognitive Computing Applications for Cognitive Computing
Applications for Cognitive Computing
 
How is Watson Changing the Future of the Automative Industry?
How is Watson Changing the Future of the Automative Industry?How is Watson Changing the Future of the Automative Industry?
How is Watson Changing the Future of the Automative Industry?
 
Building with Watson - Conversational Apps: What We've Learned and Where We'r...
Building with Watson - Conversational Apps: What We've Learned and Where We'r...Building with Watson - Conversational Apps: What We've Learned and Where We'r...
Building with Watson - Conversational Apps: What We've Learned and Where We'r...
 
Building with Watson: Watson Developer Certification Program
Building with Watson: Watson Developer Certification ProgramBuilding with Watson: Watson Developer Certification Program
Building with Watson: Watson Developer Certification Program
 
Innovation and Inspiration through Cognitive Computing: IBM Watson
Innovation and Inspiration through Cognitive Computing: IBM WatsonInnovation and Inspiration through Cognitive Computing: IBM Watson
Innovation and Inspiration through Cognitive Computing: IBM Watson
 
Building with Watson - Enhancing the Results of Your BI Dashboard
Building with Watson - Enhancing the Results of Your BI DashboardBuilding with Watson - Enhancing the Results of Your BI Dashboard
Building with Watson - Enhancing the Results of Your BI Dashboard
 
Transforming Business with Cognitive APIs: What Innovative Business Strategie...
Transforming Business with Cognitive APIs: What Innovative Business Strategie...Transforming Business with Cognitive APIs: What Innovative Business Strategie...
Transforming Business with Cognitive APIs: What Innovative Business Strategie...
 
Deploying Your Application on Bluemix in 4 Easy Steps
Deploying Your Application on Bluemix in 4 Easy StepsDeploying Your Application on Bluemix in 4 Easy Steps
Deploying Your Application on Bluemix in 4 Easy Steps
 

Kürzlich hochgeladen

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Kürzlich hochgeladen (20)

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 

Watson DevCon 2016 - Watson Beat: Making Music Cognitive

  • 1. IBM confidential ©IBM Corporation Watson Beat: New Cognitive Era
  • 3. IBM Confidential | Do Not Distribute © 2015 IBM Corporation ▪ Why Music? ▪ Why unsupervised learning? ▪ Why Watson Beat? ▪ Representing music based on parameters is not very intuitive for most people ▪ For example: I like songs with 120 bpm, following B# major etc. ▪ Watson Beat provides a new way to query and compose new music based on reference tracks ▪ Ease of use: no requirement of prior knowledge, music theory etc. ▪ Provide input track(s) to our model, it gives you a new output track(s) Motivation: 3
  • 4. IBM Confidential | Do Not Distribute © 2015 IBM Corporation ▪ Demo of potential application ▪ I am an amateur video game developer, and I need music for my game, and I like how “Game of Thrones” sounds ▪ Feed it through Watson Beat and get 100s of variations on same music ▪ Ability to steer compositions based on intent — slow, sad, fast, happy, vibrant Learning Music: Basic Idea Learned “Game Of Thrones” Input Track(s) Perturb model using creativity genes (known influences) Extract musical characteristics (pitch, rhythm, dynamics etc.) Reconstruct track by iteratively learning input musical characteristics with added perturbation Output Track(s) 4 Original “Game Of Thrones”
  • 5. IBM Confidential | Do Not Distribute © 2015 IBM Corporation ▪ RBM: Stochastic Neural Net with ▪ one layer of visible units ▪ one layer of hidden units ▪ Learning RBMs: Contrastive Divergence ▪ DBN: Stack multiple Restricted Boltzmann Machines (RBM) Deep Belief Networks (DBN): 5 x: Visible layer h: Hidden Layer W: weights Input Vector Perturbed Input Vector unsupervised learning of NNs (RBMs, Auto encoder etc.) Output Vector RBM
  • 6. IBM Confidential | Do Not Distribute © 2015 IBM Corporation ▪ Watson Beat Pandora station ▪ Suggest recreated version of songs that you like, you’ve been listening to etc. ▪ Producers, composers, music engineers create music based on intent (slow, fast, happy vibrant) ▪ Ability for retail stores, small businesses to create their own music based on original tracks ▪ Loop Pedal Mixing: pedal -> DJ Watson mixer -> amp ▪ https://www.youtube.com/watch?v=qX2eJsj9MiQ Applications: Cloud based cognitive music service 6
  • 7. IBM Confidential | Do Not Distribute © 2015 IBM Corporation Backup Slides 7
  • 8. IBM Confidential | Do Not Distribute © 2015 IBM Corporation Training RBMs 8
  • 9. IBM Confidential | Do Not Distribute © 2015 IBM Corporation Recreate original Music using RBMs C# E B Time1/16 1/16 h: Hidden Layer (Holds extracted features of visible layer) x: Visible layer (Holds pitch information) 9 C E# B Time1/16 1/16 x~: Learned visible layer (Holds learned pitch information) p(h|x) p(x|h) Demo: Recreated “Mary had a little lamb”
  • 10. Example 1: Create new music by adding perturbation 10 h: Hidden Layer (Holds extracted features of visible Demo: Learned “Mary” (less perturbation) Demo: Learned “Mary” (more perturbation) x: Visible layer (Holds initial pitch information) h1: Hidden Layer (Holds extracted features of visible C# rand B Time x: Visible layer (Holds perturbed pitch information) x~: Learned visible layer (Holds learned pitch information) p(h|x) p(x|h) E rand E# rand` B Time A rand` p1(h|x) C# E B Time Demo: Original “Mary had a little lamb”
  • 11. 11 h: Hidden Layer (Holds extracted features of visible Demo: Spooky String Quartet x: Visible layer (Holds initial pitch information) h1: Hidden Layer (Holds extracted features of visible C# Oct B Time x: Visible layer (Holds perturbed pitch information) x~: Learned visible layer (Holds learned pitch information) p(h|x) p(x|h) E Oct E# Oct` B Time A Oct` p1(h|x) C# E B Time Demo: Original “String Quartet” Example 2: Create new music by steering learning based on emotional intent
  • 12. Example 3: Create new music by learning from two songs and adding perturbation C# minor B Time Visible layer (Holds pitch + bias information) Hidden Layer (Holds extracted features of visible layer) minor E Song A D# Song B Weights 12 Demo: Learned “Willie Nelson” and ”Miley Cyrus”
  • 13. 13 Original Adele Learned Adele “Vibrant version” Learned Adele “Mellow version” Example 4: Create new music by steering learning based on emotional intent
  • 14. 14 x1: Visible layer for RBM1 x2: Visible layer for RBM2 h1: Hidden Layer for RBM 1 W1: weights for RBM1 x3: Visible Layer for RBM3 h2: hidden layer for RBM2 h3: hidden layer for RBM3 W2: weights for RBM2 W3: weights for RBM3