SlideShare ist ein Scribd-Unternehmen logo
1 von 11
Downloaden Sie, um offline zu lesen
Building a dialogue system
using a generative model
2020/06/25 1
M1 Kento Tanaka
(⽣成モデルに基づく対話システムの構築)
Background
2020/06/25 2
Introduction
▶ Users are relying on systems able to support an interaction for
searching information (Siri, Alexa, etc.) [Zhou+, 2018]
▶ The use of NN has led to a flurry of research on large-scale,
non-task-oriented DS. [Sordoni+, 2015]
Goal
Create a smooth and sociable dialogue system
2020/06/25 3
Introduction (What is ‘good’ chatbot?)
▶ One crucial step in the development of DS is evaluation.
[Deriu, 2019]
Human evaluations:
・High accuracy but expensive
Automatic evaluations:
・Low accuracy but cheap
・Hard to scale
・Metrics from MT (to compare a generated response to a target.)
Very weakly correlation with human judgements.
2020/06/25 4
Related works (Word overlap-based Metrics)
▶ BLEU-N [Liu, 2016]
▶ ROUGE-L [Liu, 2016]
・Analyze the co-occurrences of n-grams
- tgt : I work on machine learning.
- pred : He works on machine learning.
・BP : Penalizing sentences that are
too short
・It is a F-measure based on the
LCS(Longest Common Subsequence)
2020/06/25 5
Related works (Embedding-based Metrics)
▶ Embedding Average [Liu, 2016]
▶ Vector Extrema [Liu, 2016]
▶ Greedy Matching [Liu, 2016]
・Calculate sentence-level embedding.
・Calculate sentence-level embedding.
・Average of the cosine similarity of the
words with the highest cosine similarity.
2020/06/25 6
Method.1
▶ OpenNMT
- is an open source ecosystem for neural machine translation
and neural sequence learning.
▶ Dataset
- Training data : twitter 1.2M pairs.
- Test data : twitter 100 pairs.
2020/06/25 7
Method.2 (without proper noun)
▶ OpenNMT
- is an open source ecosystem for neural machine translation
and neural sequence learning.
▶ Dataset
- Training data : twitter 0.7M pairs.
- Test data : twitter 100 pairs.
▶ Preprocessing
- Removing proper nouns from a dataset.
- Conversion to Kansai-ben based on rules at the end of words.
2020/06/25 8
Method.3 (Considering context)
▶ OpenNMT
- is an open source ecosystem for neural machine translation
and neural sequence learning.
▶ Dataset
- Training data : twitter 1.5M triple sets.
- Test data : twitter 100 triple sets.
▶ Preprocessing
- Removing proper nouns from a dataset.
- Conversion to Kansai-ben based on rules at the end of words.
▶ Context
- Learning with three sets of data.
「晩ご飯どうする?」 & 「ハンバーグはどう?」Input:
Output: 「昨日も食べたやん!カレーがええなぁ。」
2020/06/25 9
Evaluation
Human evaluations:
Automatic evaluations:
Grice’s Maxims Conversation
[Grice, 1975]
1. Quality
2. Quantity
3. Relation
4. Manner
・Adaptability of dialogue
・Informative
・Completeness of utterance
・Context considerations
Evaluation criteria
Embedding Average
▶ Grading on a 5-point scale
▶ Grading by 4 people
2020/06/25 10
Result
Human evaluations
Automatic
evaluations
Adaptability Informative Completeness Context Embedding Average
Model1 3.045 2.185 3.195 2.45 0.51555
Model2 2.94 2.05 2.97 2.285 0.52623
Model3 3.11 3.18 2.92 2.58 0.93575
Table1. Human evaluations and automatic evaluations
・Increased input have anything to do with it.
・Model3 is the best in embedding avg.
2020/06/25 11
Conclusion
▶ Created a generation-based dialogue system.
▶ Low adaptability
Increase the amount of good quality data
▶ Yielded commonplace responses.
Ideal: more diverse, interesting, and appropriate responses.
▶ Automatic evaluations that are highly correlated with human
judgment are needed.

Weitere ähnliche Inhalte

Was ist angesagt?

A Friendly Introduction to Machine Learning
A Friendly Introduction to Machine LearningA Friendly Introduction to Machine Learning
A Friendly Introduction to Machine LearningHaptik
 
Modular Multitask Reinforcement Learning with Policy Sketches
Modular Multitask Reinforcement Learning with Policy SketchesModular Multitask Reinforcement Learning with Policy Sketches
Modular Multitask Reinforcement Learning with Policy SketchesYoonho Lee
 
Hot machine learning topics
Hot machine learning topicsHot machine learning topics
Hot machine learning topicsWriteMyThesis
 
Anoop_Dobhal_Resume
Anoop_Dobhal_ResumeAnoop_Dobhal_Resume
Anoop_Dobhal_ResumeAnoop Dobhal
 
Hot Topics in Machine Learning for Research and Thesis
Hot Topics in Machine Learning for Research and ThesisHot Topics in Machine Learning for Research and Thesis
Hot Topics in Machine Learning for Research and ThesisWriteMyThesis
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningEng Teong Cheah
 
Hot Topics in Machine Learning For Research and thesis
Hot Topics in Machine Learning For Research and thesisHot Topics in Machine Learning For Research and thesis
Hot Topics in Machine Learning For Research and thesisWriteMyThesis
 
AI Chatbot Service Framework based on Backpropagation Network for Predicting ...
AI Chatbot Service Framework based on Backpropagation Network for Predicting ...AI Chatbot Service Framework based on Backpropagation Network for Predicting ...
AI Chatbot Service Framework based on Backpropagation Network for Predicting ...資彥 解
 
Continual Learning with Deep Architectures - Tutorial ICML 2021
Continual Learning with Deep Architectures - Tutorial ICML 2021Continual Learning with Deep Architectures - Tutorial ICML 2021
Continual Learning with Deep Architectures - Tutorial ICML 2021Vincenzo Lomonaco
 

Was ist angesagt? (14)

A Friendly Introduction to Machine Learning
A Friendly Introduction to Machine LearningA Friendly Introduction to Machine Learning
A Friendly Introduction to Machine Learning
 
Modular Multitask Reinforcement Learning with Policy Sketches
Modular Multitask Reinforcement Learning with Policy SketchesModular Multitask Reinforcement Learning with Policy Sketches
Modular Multitask Reinforcement Learning with Policy Sketches
 
A study on meta learning
A study on meta learningA study on meta learning
A study on meta learning
 
Meta-Learning Presentation
Meta-Learning PresentationMeta-Learning Presentation
Meta-Learning Presentation
 
Hot machine learning topics
Hot machine learning topicsHot machine learning topics
Hot machine learning topics
 
Anoop_Dobhal_Resume
Anoop_Dobhal_ResumeAnoop_Dobhal_Resume
Anoop_Dobhal_Resume
 
Machine learning
Machine learningMachine learning
Machine learning
 
Hot Topics in Machine Learning for Research and Thesis
Hot Topics in Machine Learning for Research and ThesisHot Topics in Machine Learning for Research and Thesis
Hot Topics in Machine Learning for Research and Thesis
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Hot Topics in Machine Learning For Research and thesis
Hot Topics in Machine Learning For Research and thesisHot Topics in Machine Learning For Research and thesis
Hot Topics in Machine Learning For Research and thesis
 
Techniques Machine Learning
Techniques Machine LearningTechniques Machine Learning
Techniques Machine Learning
 
AI Chatbot Service Framework based on Backpropagation Network for Predicting ...
AI Chatbot Service Framework based on Backpropagation Network for Predicting ...AI Chatbot Service Framework based on Backpropagation Network for Predicting ...
AI Chatbot Service Framework based on Backpropagation Network for Predicting ...
 
Continual Learning with Deep Architectures - Tutorial ICML 2021
Continual Learning with Deep Architectures - Tutorial ICML 2021Continual Learning with Deep Architectures - Tutorial ICML 2021
Continual Learning with Deep Architectures - Tutorial ICML 2021
 
CV
CVCV
CV
 

Ähnlich wie Dialogue system②

A survey on Machine Learning and Artificial Neural Networks
A survey on Machine Learning and Artificial Neural NetworksA survey on Machine Learning and Artificial Neural Networks
A survey on Machine Learning and Artificial Neural NetworksIRJET Journal
 
ML-Chapter_one.pptx
ML-Chapter_one.pptxML-Chapter_one.pptx
ML-Chapter_one.pptxbelay41
 
IRJET- Semantic Question Matching
IRJET- Semantic Question MatchingIRJET- Semantic Question Matching
IRJET- Semantic Question MatchingIRJET Journal
 
Machine Learning: Need of Machine Learning, Its Challenges and its Applications
Machine Learning: Need of Machine Learning, Its Challenges and its ApplicationsMachine Learning: Need of Machine Learning, Its Challenges and its Applications
Machine Learning: Need of Machine Learning, Its Challenges and its ApplicationsArpana Awasthi
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learningshivani saluja
 
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.comHABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.comHABIB FIGA GUYE
 
The Generative AI System Shock, and some thoughts on Collective Intelligence ...
The Generative AI System Shock, and some thoughts on Collective Intelligence ...The Generative AI System Shock, and some thoughts on Collective Intelligence ...
The Generative AI System Shock, and some thoughts on Collective Intelligence ...Simon Buckingham Shum
 
IRJET- Sentiment Analysis to Segregate Attributes using Machine Learning Tech...
IRJET- Sentiment Analysis to Segregate Attributes using Machine Learning Tech...IRJET- Sentiment Analysis to Segregate Attributes using Machine Learning Tech...
IRJET- Sentiment Analysis to Segregate Attributes using Machine Learning Tech...IRJET Journal
 
IRJET- Chatbot using NLP and Deep Learning
IRJET-  	  Chatbot using NLP and Deep LearningIRJET-  	  Chatbot using NLP and Deep Learning
IRJET- Chatbot using NLP and Deep LearningIRJET Journal
 
IRJET - E-Assistant: An Interactive Bot for Banking Sector using NLP Process
IRJET -  	  E-Assistant: An Interactive Bot for Banking Sector using NLP ProcessIRJET -  	  E-Assistant: An Interactive Bot for Banking Sector using NLP Process
IRJET - E-Assistant: An Interactive Bot for Banking Sector using NLP ProcessIRJET Journal
 
Semi-Supervised Insight Generation from Petabyte Scale Text Data
Semi-Supervised Insight Generation from Petabyte Scale Text DataSemi-Supervised Insight Generation from Petabyte Scale Text Data
Semi-Supervised Insight Generation from Petabyte Scale Text DataTech Triveni
 
IRJET- NEEV: An Education Informational Chatbot
IRJET-  	  NEEV: An Education Informational ChatbotIRJET-  	  NEEV: An Education Informational Chatbot
IRJET- NEEV: An Education Informational ChatbotIRJET Journal
 
Sentiment Analysis: A comparative study of Deep Learning and Machine Learning
Sentiment Analysis: A comparative study of Deep Learning and Machine LearningSentiment Analysis: A comparative study of Deep Learning and Machine Learning
Sentiment Analysis: A comparative study of Deep Learning and Machine LearningIRJET Journal
 
IRJET- Multimedia Chatbot using Classification
IRJET- Multimedia Chatbot using ClassificationIRJET- Multimedia Chatbot using Classification
IRJET- Multimedia Chatbot using ClassificationIRJET Journal
 
Analyzing Sentiment Of Movie Reviews In Bangla By Applying Machine Learning T...
Analyzing Sentiment Of Movie Reviews In Bangla By Applying Machine Learning T...Analyzing Sentiment Of Movie Reviews In Bangla By Applying Machine Learning T...
Analyzing Sentiment Of Movie Reviews In Bangla By Applying Machine Learning T...Andrew Parish
 
Intelligent Career Guidance System.pptx
Intelligent Career Guidance System.pptxIntelligent Career Guidance System.pptx
Intelligent Career Guidance System.pptxAnonymous366406
 

Ähnlich wie Dialogue system② (20)

Vitriol
VitriolVitriol
Vitriol
 
A survey on Machine Learning and Artificial Neural Networks
A survey on Machine Learning and Artificial Neural NetworksA survey on Machine Learning and Artificial Neural Networks
A survey on Machine Learning and Artificial Neural Networks
 
ML-Chapter_one.pptx
ML-Chapter_one.pptxML-Chapter_one.pptx
ML-Chapter_one.pptx
 
IRJET- Semantic Question Matching
IRJET- Semantic Question MatchingIRJET- Semantic Question Matching
IRJET- Semantic Question Matching
 
Machine Learning: Need of Machine Learning, Its Challenges and its Applications
Machine Learning: Need of Machine Learning, Its Challenges and its ApplicationsMachine Learning: Need of Machine Learning, Its Challenges and its Applications
Machine Learning: Need of Machine Learning, Its Challenges and its Applications
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.comHABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
 
The Generative AI System Shock, and some thoughts on Collective Intelligence ...
The Generative AI System Shock, and some thoughts on Collective Intelligence ...The Generative AI System Shock, and some thoughts on Collective Intelligence ...
The Generative AI System Shock, and some thoughts on Collective Intelligence ...
 
IRJET- Sentiment Analysis to Segregate Attributes using Machine Learning Tech...
IRJET- Sentiment Analysis to Segregate Attributes using Machine Learning Tech...IRJET- Sentiment Analysis to Segregate Attributes using Machine Learning Tech...
IRJET- Sentiment Analysis to Segregate Attributes using Machine Learning Tech...
 
IRJET- Chatbot using NLP and Deep Learning
IRJET-  	  Chatbot using NLP and Deep LearningIRJET-  	  Chatbot using NLP and Deep Learning
IRJET- Chatbot using NLP and Deep Learning
 
IRJET - E-Assistant: An Interactive Bot for Banking Sector using NLP Process
IRJET -  	  E-Assistant: An Interactive Bot for Banking Sector using NLP ProcessIRJET -  	  E-Assistant: An Interactive Bot for Banking Sector using NLP Process
IRJET - E-Assistant: An Interactive Bot for Banking Sector using NLP Process
 
Semi-Supervised Insight Generation from Petabyte Scale Text Data
Semi-Supervised Insight Generation from Petabyte Scale Text DataSemi-Supervised Insight Generation from Petabyte Scale Text Data
Semi-Supervised Insight Generation from Petabyte Scale Text Data
 
IRJET- NEEV: An Education Informational Chatbot
IRJET-  	  NEEV: An Education Informational ChatbotIRJET-  	  NEEV: An Education Informational Chatbot
IRJET- NEEV: An Education Informational Chatbot
 
Learning for Big Data-林軒田
Learning for Big Data-林軒田Learning for Big Data-林軒田
Learning for Big Data-林軒田
 
Machine learning
Machine learningMachine learning
Machine learning
 
Sentiment Analysis: A comparative study of Deep Learning and Machine Learning
Sentiment Analysis: A comparative study of Deep Learning and Machine LearningSentiment Analysis: A comparative study of Deep Learning and Machine Learning
Sentiment Analysis: A comparative study of Deep Learning and Machine Learning
 
Machine learning
 Machine learning Machine learning
Machine learning
 
IRJET- Multimedia Chatbot using Classification
IRJET- Multimedia Chatbot using ClassificationIRJET- Multimedia Chatbot using Classification
IRJET- Multimedia Chatbot using Classification
 
Analyzing Sentiment Of Movie Reviews In Bangla By Applying Machine Learning T...
Analyzing Sentiment Of Movie Reviews In Bangla By Applying Machine Learning T...Analyzing Sentiment Of Movie Reviews In Bangla By Applying Machine Learning T...
Analyzing Sentiment Of Movie Reviews In Bangla By Applying Machine Learning T...
 
Intelligent Career Guidance System.pptx
Intelligent Career Guidance System.pptxIntelligent Career Guidance System.pptx
Intelligent Career Guidance System.pptx
 

Kürzlich hochgeladen

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 

Kürzlich hochgeladen (20)

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 

Dialogue system②

  • 1. Building a dialogue system using a generative model 2020/06/25 1 M1 Kento Tanaka (⽣成モデルに基づく対話システムの構築)
  • 2. Background 2020/06/25 2 Introduction ▶ Users are relying on systems able to support an interaction for searching information (Siri, Alexa, etc.) [Zhou+, 2018] ▶ The use of NN has led to a flurry of research on large-scale, non-task-oriented DS. [Sordoni+, 2015] Goal Create a smooth and sociable dialogue system
  • 3. 2020/06/25 3 Introduction (What is ‘good’ chatbot?) ▶ One crucial step in the development of DS is evaluation. [Deriu, 2019] Human evaluations: ・High accuracy but expensive Automatic evaluations: ・Low accuracy but cheap ・Hard to scale ・Metrics from MT (to compare a generated response to a target.) Very weakly correlation with human judgements.
  • 4. 2020/06/25 4 Related works (Word overlap-based Metrics) ▶ BLEU-N [Liu, 2016] ▶ ROUGE-L [Liu, 2016] ・Analyze the co-occurrences of n-grams - tgt : I work on machine learning. - pred : He works on machine learning. ・BP : Penalizing sentences that are too short ・It is a F-measure based on the LCS(Longest Common Subsequence)
  • 5. 2020/06/25 5 Related works (Embedding-based Metrics) ▶ Embedding Average [Liu, 2016] ▶ Vector Extrema [Liu, 2016] ▶ Greedy Matching [Liu, 2016] ・Calculate sentence-level embedding. ・Calculate sentence-level embedding. ・Average of the cosine similarity of the words with the highest cosine similarity.
  • 6. 2020/06/25 6 Method.1 ▶ OpenNMT - is an open source ecosystem for neural machine translation and neural sequence learning. ▶ Dataset - Training data : twitter 1.2M pairs. - Test data : twitter 100 pairs.
  • 7. 2020/06/25 7 Method.2 (without proper noun) ▶ OpenNMT - is an open source ecosystem for neural machine translation and neural sequence learning. ▶ Dataset - Training data : twitter 0.7M pairs. - Test data : twitter 100 pairs. ▶ Preprocessing - Removing proper nouns from a dataset. - Conversion to Kansai-ben based on rules at the end of words.
  • 8. 2020/06/25 8 Method.3 (Considering context) ▶ OpenNMT - is an open source ecosystem for neural machine translation and neural sequence learning. ▶ Dataset - Training data : twitter 1.5M triple sets. - Test data : twitter 100 triple sets. ▶ Preprocessing - Removing proper nouns from a dataset. - Conversion to Kansai-ben based on rules at the end of words. ▶ Context - Learning with three sets of data. 「晩ご飯どうする?」 & 「ハンバーグはどう?」Input: Output: 「昨日も食べたやん!カレーがええなぁ。」
  • 9. 2020/06/25 9 Evaluation Human evaluations: Automatic evaluations: Grice’s Maxims Conversation [Grice, 1975] 1. Quality 2. Quantity 3. Relation 4. Manner ・Adaptability of dialogue ・Informative ・Completeness of utterance ・Context considerations Evaluation criteria Embedding Average ▶ Grading on a 5-point scale ▶ Grading by 4 people
  • 10. 2020/06/25 10 Result Human evaluations Automatic evaluations Adaptability Informative Completeness Context Embedding Average Model1 3.045 2.185 3.195 2.45 0.51555 Model2 2.94 2.05 2.97 2.285 0.52623 Model3 3.11 3.18 2.92 2.58 0.93575 Table1. Human evaluations and automatic evaluations ・Increased input have anything to do with it. ・Model3 is the best in embedding avg.
  • 11. 2020/06/25 11 Conclusion ▶ Created a generation-based dialogue system. ▶ Low adaptability Increase the amount of good quality data ▶ Yielded commonplace responses. Ideal: more diverse, interesting, and appropriate responses. ▶ Automatic evaluations that are highly correlated with human judgment are needed.