SlideShare ist ein Scribd-Unternehmen logo
1 von 26
Downloaden Sie, um offline zu lesen
1
Comparison of Transfer-Learning
Approaches for Response
Selection in Multi-Turn
Conversations
© 2019 PARC All Rights Reserved
Jesse Vig and Kalai Ramea
DSTC 7 Workshop, AAAI ‘19
Honolulu, HI
Jan. 27, 2019
Recent Trends in
Transfer Learning for NLP
Language Representation
Task Model
Two components of an NLP model
Language Representation
Task 1 Model
Language Representation
Task 2 Model
Language representation is common across
tasks, enabling transfer learning.
Language Representation
Task Model
2013
Language
Representation
Task Model
Present
Deeper language representations support greater transfer
Task Model
92%
negative
3%
neutral
5%
positive
This film really dragged
Sentiment
Bag of Words (<2013)
Language
Representation
Model
(unigrams,
bigrams)
Bag of Words (<2013)
Input text:
What model sees:
This film really dragged
Words as symbols
aardvark
abstain
abstract
ablation
…
Task Model
…
This film really dragged
[I] [pet] [the] dog [and] [it] [barked]
Context
Center Word
Context
Sentiment
Word2Vec / GloVe / FastText
Word embeddings (2013-)
92%
negative
3%
neutral
5%
positive
Input text:
What model sees:
This film really dragged
(This)
(film)
(really)
(dragged)
Word embeddings (2013-)
Input text:
What model sees:
This film really dragged
(This)
(film)
(really)
(dragged)
The movie was boring
(movie)
(was)
(boring)
(The)
Word embeddings (2013-)
Similar words are close
in embedding space
ELMo, CoVe
USE, GPDS, InferSent)
Word:
(Sentence:
ELMo
Contextual embeddings (2017-)
This film really dragged
from “NLP’s ImageNet Moment Has Arrived”
Task Model
92%
negative
3%
neutral
5%
positive
Input text:
What model sees:
(This)
(film)
(really)
(dragged)
This film really dragged
Contextual embeddings (2017-)
Word embedding
dependent on other
words in sentence
Input text:
What model sees:
(This)
(film)
(really)
(dragged)
(movie)
(was)
(boring)
(The)
This film really dragged
The movie was boring
In this context, boring and
dragged are semantically similar
Contextual embeddings (2017-)
This film really dragged
Task head
92%
negative
3%
neutral
5%
positive
The quick brown fox
The quick brown ?
Fine-tuning (2018)
ULMFiT, OpenAI GPT, BERT, LM-LSTM (2015)
LM head
This film really dragged
Task head
92%
negative
3%
neutral
5%
positive
The quick brown fox
The quick brown ?
Fine-tuning (2018)
ULMFiT, OpenAI GPT, BERT, LM-LSTM (2015)
LM head
Almost entire model is pre-trained and
fine-tuned, with a thin task-specific layer.
Result is higher accuracy and reduced
training data requirements.
Comparison of Transfer Learning
Approaches for DSTC7, Track 1
Student:
Partial dialog
I’m looking for an
engineering course
How about CS481?
I already have a heavy
course load
Student:
Advisor:
I suggest CS425
Advisor:
Candidate responses
+ 95 more
Nice talking to you
CS221 is not too demanding
Hi, how can I help you?
What are you interested in?
CS221 is not too demanding
…
Hi, how can I help you?
1
I suggest CS425
Ranking
model
2
100
…
Ranked responses
Student:
Partial dialog
I’m looking for an
engineering course
How about CS481?
I already have a heavy
course load
Student:
Advisor:
I suggest CS425
Advisor:
Candidate response
+ 95 more
Nice talking to you
CS221 is not too demanding
Hi, how can I help you?
What are you interested in?
Classifier
Correct
0.87
Incorrect
0.13
Response ranking as classification problem
Pre-trained, Fine-tuned
Models Evaluated
OpenAI GPT** BERT***Multi-turn ESIM+ELMo (MT-EE)
Multi-Turn ESIM*
Feature-based
*Enhanced Sequential Inference Model
Submitted Model
**Generative Pretrained Transformer ***Bidrectional Encoder Representations
from Transformers
Multi-Turn ESIM+ELMo (MT-EE)
Position-based weighted sum
Classifier
utterance0 response
ELMo
ESIM-
Speaker-specific feedforward+
utteranceT-1 response
ELMo
ESIM-
Speaker-specific feedforward +
α0 αT-1
Context utterances
OpenAI GPT BERT
• Unidirectional self-attention
• Standard language model pretraining
• BooksCorpus
• Designed for arbitrary text inputs
• Single sentence, two sentences,
multiple choice, etc.
• Accomplished through delimiter
tokens
• Bidirectional self-attention
• “Masked” language model pretraining
• BooksCorpus + Wikipedia
• Optimizations for sentence pairs
• Architecture
• Segment embedding
• Pre-training
• Next sentence prediction
Student:
Dialog Context
Do I need to study any
math courses?
Advisor:
Candidate response
You have completed all math
required for your degreeBERT
?
[CLS] do I need to study any math courses ? [SEP] you have completed all math required for your degree . [SEP]
BERT-Base
SentenceA
(context)
SentenceB
(response)
SentenceA
SentenceB
BERT-Base pretrained model (not fine-tuned).
Already knows to attend between sentences,
likely due to NSP pretraining task.
Resources
BERT visualization tool:
https://github.com/jessevig/bertviz
Tutorials
http://jalammar.github.io/illustrated-bert/
http://jalammar.github.io/illustrated-transformer/
https://t.co/ZB9049OPwH
Code:
Blog post:
Paper
http://workshop.colips.org/dstc7/papers/17.pdf

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (6)

NLP Asignment Final Presentation [IIT-Bombay]
NLP Asignment Final Presentation [IIT-Bombay]NLP Asignment Final Presentation [IIT-Bombay]
NLP Asignment Final Presentation [IIT-Bombay]
 
Applicatons of derivative optimization and related rates
Applicatons of derivative optimization and related ratesApplicatons of derivative optimization and related rates
Applicatons of derivative optimization and related rates
 
1 3
1 31 3
1 3
 
American sign language recognizer
American sign language recognizerAmerican sign language recognizer
American sign language recognizer
 
(MRSI- 2/3) Re-engineering Segmentation methodologies for an Enriching Custom...
(MRSI- 2/3) Re-engineering Segmentation methodologies for an Enriching Custom...(MRSI- 2/3) Re-engineering Segmentation methodologies for an Enriching Custom...
(MRSI- 2/3) Re-engineering Segmentation methodologies for an Enriching Custom...
 
Benchmarking transfer learning approaches for NLP
Benchmarking transfer learning approaches for NLPBenchmarking transfer learning approaches for NLP
Benchmarking transfer learning approaches for NLP
 

Ähnlich wie Comparison of Transfer-Learning Approaches for Response Selection in Multi-Turn Conversations

Use of online quizzes to support inquiry-based learning in chemical engineering
Use of online quizzes to support inquiry-based learning in chemical engineeringUse of online quizzes to support inquiry-based learning in chemical engineering
Use of online quizzes to support inquiry-based learning in chemical engineering
cilass.slideshare
 
1 Saint Leo University GBA 334 Applied Decision.docx
 1 Saint Leo University  GBA 334  Applied Decision.docx 1 Saint Leo University  GBA 334  Applied Decision.docx
1 Saint Leo University GBA 334 Applied Decision.docx
aryan532920
 
Software development Program with Specialization in Cloud & DevOps
Software development Program with Specialization in Cloud & DevOpsSoftware development Program with Specialization in Cloud & DevOps
Software development Program with Specialization in Cloud & DevOps
Learnbay
 
Full Stack Software Development
Full Stack Software DevelopmentFull Stack Software Development
Full Stack Software Development
Learnbay
 
University of Pretoria clicker introduction and demonstration
University of Pretoria clicker introduction and demonstrationUniversity of Pretoria clicker introduction and demonstration
University of Pretoria clicker introduction and demonstration
David Wilson
 
tutee feedback
tutee feedbacktutee feedback
tutee feedback
Yuyang Luo
 

Ähnlich wie Comparison of Transfer-Learning Approaches for Response Selection in Multi-Turn Conversations (20)

Use of online quizzes to support inquiry-based learning in chemical engineering
Use of online quizzes to support inquiry-based learning in chemical engineeringUse of online quizzes to support inquiry-based learning in chemical engineering
Use of online quizzes to support inquiry-based learning in chemical engineering
 
1 Saint Leo University GBA 334 Applied Decision.docx
 1 Saint Leo University  GBA 334  Applied Decision.docx 1 Saint Leo University  GBA 334  Applied Decision.docx
1 Saint Leo University GBA 334 Applied Decision.docx
 
Moodle quiz: towards post-paper e-assessment
Moodle quiz: towards post-paper e-assessmentMoodle quiz: towards post-paper e-assessment
Moodle quiz: towards post-paper e-assessment
 
Reconciling a Traditional Syllabus with an Inquiry-Based Introductory Course
Reconciling a Traditional Syllabus with an Inquiry-Based Introductory CourseReconciling a Traditional Syllabus with an Inquiry-Based Introductory Course
Reconciling a Traditional Syllabus with an Inquiry-Based Introductory Course
 
TESTA, Presentation to the SDG Course Leaders, University of West of Scotlan...
 TESTA, Presentation to the SDG Course Leaders, University of West of Scotlan... TESTA, Presentation to the SDG Course Leaders, University of West of Scotlan...
TESTA, Presentation to the SDG Course Leaders, University of West of Scotlan...
 
Software development Program with Specialization in Cloud & DevOps
Software development Program with Specialization in Cloud & DevOpsSoftware development Program with Specialization in Cloud & DevOps
Software development Program with Specialization in Cloud & DevOps
 
To click or not to click? Managing Classroom Response System in a large class
To click or not to click? Managing Classroom Response System in a large classTo click or not to click? Managing Classroom Response System in a large class
To click or not to click? Managing Classroom Response System in a large class
 
Free PMP Exam Sample Question
Free PMP Exam Sample QuestionFree PMP Exam Sample Question
Free PMP Exam Sample Question
 
Classsourcing: Crowd-Based Validation of Question-Answer Learning Objects @ I...
Classsourcing: Crowd-Based Validation of Question-Answer Learning Objects @ I...Classsourcing: Crowd-Based Validation of Question-Answer Learning Objects @ I...
Classsourcing: Crowd-Based Validation of Question-Answer Learning Objects @ I...
 
Full Stack Software Development
Full Stack Software DevelopmentFull Stack Software Development
Full Stack Software Development
 
Visuals -flipping--draft-one
Visuals -flipping--draft-oneVisuals -flipping--draft-one
Visuals -flipping--draft-one
 
PMP Exam Q & A
PMP Exam Q & APMP Exam Q & A
PMP Exam Q & A
 
Addie Builds a House
Addie Builds a HouseAddie Builds a House
Addie Builds a House
 
How Do We Teach TDD Keith Ray
How Do We Teach TDD Keith RayHow Do We Teach TDD Keith Ray
How Do We Teach TDD Keith Ray
 
University of Pretoria clicker introduction and demonstration
University of Pretoria clicker introduction and demonstrationUniversity of Pretoria clicker introduction and demonstration
University of Pretoria clicker introduction and demonstration
 
tutee feedback
tutee feedbacktutee feedback
tutee feedback
 
Socially Shared Metacognition in CSCL Planning
Socially Shared Metacognition in CSCL PlanningSocially Shared Metacognition in CSCL Planning
Socially Shared Metacognition in CSCL Planning
 
[GAN by Hung-yi Lee]Part 3: The recent research of my group
[GAN by Hung-yi Lee]Part 3: The recent research of my group[GAN by Hung-yi Lee]Part 3: The recent research of my group
[GAN by Hung-yi Lee]Part 3: The recent research of my group
 
V Jornadas eMadrid sobre “Educación Digital”. Jesús G. Boticario, Universidad...
V Jornadas eMadrid sobre “Educación Digital”. Jesús G. Boticario, Universidad...V Jornadas eMadrid sobre “Educación Digital”. Jesús G. Boticario, Universidad...
V Jornadas eMadrid sobre “Educación Digital”. Jesús G. Boticario, Universidad...
 
Learning Development Times - Benchmark
Learning Development Times - BenchmarkLearning Development Times - Benchmark
Learning Development Times - Benchmark
 

Kürzlich hochgeladen

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Kürzlich hochgeladen (20)

08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 

Comparison of Transfer-Learning Approaches for Response Selection in Multi-Turn Conversations