Dialogue system②

•

0 gefällt mir•116 views

Kent T

評価手法を取り入れた雑談のできる対話システムの開発

Technologie

Building a dialogue system
using a generative model
2020/06/25 1
M1 Kento Tanaka
（⽣成モデルに基づく対話システムの構築）

Background
2020/06/25 2
Introduction
▶ Users are relying on systems able to support an interaction for
searching information (Siri, Alexa, etc.) [Zhou+, 2018]
▶ The use of NN has led to a flurry of research on large-scale,
non-task-oriented DS. [Sordoni+, 2015]
Goal
Create a smooth and sociable dialogue system

2020/06/25 3
Introduction (What is ‘good’ chatbot?)
▶ One crucial step in the development of DS is evaluation.
[Deriu, 2019]
Human evaluations:
・High accuracy but expensive
Automatic evaluations:
・Low accuracy but cheap
・Hard to scale
・Metrics from MT (to compare a generated response to a target.)
Very weakly correlation with human judgements.

2020/06/25 4
Related works (Word overlap-based Metrics)
▶ BLEU-N [Liu, 2016]
▶ ROUGE-L [Liu, 2016]
・Analyze the co-occurrences of n-grams
- tgt : I work on machine learning.
- pred : He works on machine learning.
・BP : Penalizing sentences that are
too short
・It is a F-measure based on the
LCS(Longest Common Subsequence)

2020/06/25 5
Related works (Embedding-based Metrics)
▶ Embedding Average [Liu, 2016]
▶ Vector Extrema [Liu, 2016]
▶ Greedy Matching [Liu, 2016]
・Calculate sentence-level embedding.
・Calculate sentence-level embedding.
・Average of the cosine similarity of the
words with the highest cosine similarity.

2020/06/25 6
Method.1
▶ OpenNMT
- is an open source ecosystem for neural machine translation
and neural sequence learning.
▶ Dataset
- Training data : twitter 1.2M pairs.
- Test data : twitter 100 pairs.

2020/06/25 7
Method.2 (without proper noun)
▶ OpenNMT
- is an open source ecosystem for neural machine translation
and neural sequence learning.
▶ Dataset
- Training data : twitter 0.7M pairs.
- Test data : twitter 100 pairs.
▶ Preprocessing
- Removing proper nouns from a dataset.
- Conversion to Kansai-ben based on rules at the end of words.

2020/06/25 8
Method.3 (Considering context)
▶ OpenNMT
- is an open source ecosystem for neural machine translation
and neural sequence learning.
▶ Dataset
- Training data : twitter 1.5M triple sets.
- Test data : twitter 100 triple sets.
▶ Preprocessing
- Removing proper nouns from a dataset.
- Conversion to Kansai-ben based on rules at the end of words.
▶ Context
- Learning with three sets of data.
「晩ご飯どうする？」 & 「ハンバーグはどう？」Input:
Output: 「昨日も食べたやん！カレーがええなぁ。」

2020/06/25 9
Evaluation
Human evaluations:
Automatic evaluations:
Grice’s Maxims Conversation
[Grice, 1975]
1. Quality
2. Quantity
3. Relation
4. Manner
・Adaptability of dialogue
・Informative
・Completeness of utterance
・Context considerations
Evaluation criteria
Embedding Average
▶ Grading on a 5-point scale
▶ Grading by 4 people

2020/06/25 10
Result
Human evaluations
Automatic
evaluations
Adaptability Informative Completeness Context Embedding Average
Model1 3.045 2.185 3.195 2.45 0.51555
Model2 2.94 2.05 2.97 2.285 0.52623
Model3 3.11 3.18 2.92 2.58 0.93575
Table1. Human evaluations and automatic evaluations
・Increased input have anything to do with it.
・Model3 is the best in embedding avg.

2020/06/25 11
Conclusion
▶ Created a generation-based dialogue system.
▶ Low adaptability
Increase the amount of good quality data
▶ Yielded commonplace responses.
Ideal: more diverse, interesting, and appropriate responses.
▶ Automatic evaluations that are highly correlated with human
judgment are needed.

Empfohlen

Introduction to Few shot learningRidge-i, Inc.

Model-Agnostic Meta-Learning for Fast Adaptation of Deep NetworksYoonho Lee

sourabh_bajaj_resumeYipei Wang

Jlqiongyu

Introduction To Machine LearningKnoldus Inc.

Learning to learn with meta learningShreeGowriRadhakrish

Evolution Strategies as a Scalable Alternative to Reinforcement LearningYoonho Lee

Gradient-Based Meta-Learning with Learned Layerwise Metric and SubspaceYoonho Lee

Empfohlen

Introduction to Few shot learningRidge-i, Inc.

Model-Agnostic Meta-Learning for Fast Adaptation of Deep NetworksYoonho Lee

sourabh_bajaj_resumeYipei Wang

Jlqiongyu

Introduction To Machine LearningKnoldus Inc.

Learning to learn with meta learningShreeGowriRadhakrish

Evolution Strategies as a Scalable Alternative to Reinforcement LearningYoonho Lee

Gradient-Based Meta-Learning with Learned Layerwise Metric and SubspaceYoonho Lee

A Friendly Introduction to Machine LearningHaptik

Modular Multitask Reinforcement Learning with Policy SketchesYoonho Lee

A study on meta learningShreeGowriRadhakrish

Meta-Learning PresentationAkshayaNagarajan10

Hot machine learning topicsWriteMyThesis

Anoop_Dobhal_ResumeAnoop Dobhal

Machine learningRajesh Chittampally

Hot Topics in Machine Learning for Research and ThesisWriteMyThesis

Introduction to Machine LearningEng Teong Cheah

Hot Topics in Machine Learning For Research and thesisWriteMyThesis

Techniques Machine LearningDataminingTools Inc

AI Chatbot Service Framework based on Backpropagation Network for Predicting ...資彥解

Continual Learning with Deep Architectures - Tutorial ICML 2021Vincenzo Lomonaco

CVJiaxin Qiu

VitriolSertaç Kağan Aydın

A survey on Machine Learning and Artificial Neural NetworksIRJET Journal

ML-Chapter_one.pptxbelay41

IRJET- Semantic Question MatchingIRJET Journal

Machine Learning: Need of Machine Learning, Its Challenges and its ApplicationsArpana Awasthi

Introduction to Machine Learningshivani saluja

HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.comHABIB FIGA GUYE

The Generative AI System Shock, and some thoughts on Collective Intelligence ...Simon Buckingham Shum

Weitere ähnliche Inhalte

Was ist angesagt?

A Friendly Introduction to Machine LearningHaptik

Modular Multitask Reinforcement Learning with Policy SketchesYoonho Lee

A study on meta learningShreeGowriRadhakrish

Meta-Learning PresentationAkshayaNagarajan10

Hot machine learning topicsWriteMyThesis

Anoop_Dobhal_ResumeAnoop Dobhal

Machine learningRajesh Chittampally

Hot Topics in Machine Learning for Research and ThesisWriteMyThesis

Introduction to Machine LearningEng Teong Cheah

Hot Topics in Machine Learning For Research and thesisWriteMyThesis

Techniques Machine LearningDataminingTools Inc

AI Chatbot Service Framework based on Backpropagation Network for Predicting ...資彥解

Continual Learning with Deep Architectures - Tutorial ICML 2021Vincenzo Lomonaco

CVJiaxin Qiu

Was ist angesagt? (14)

A Friendly Introduction to Machine Learning

Modular Multitask Reinforcement Learning with Policy Sketches

A study on meta learning

Meta-Learning Presentation

Hot machine learning topics

Anoop_Dobhal_Resume

Machine learning

Hot Topics in Machine Learning for Research and Thesis

Introduction to Machine Learning

Hot Topics in Machine Learning For Research and thesis

Techniques Machine Learning

AI Chatbot Service Framework based on Backpropagation Network for Predicting ...

Continual Learning with Deep Architectures - Tutorial ICML 2021

Ähnlich wie Dialogue system②

VitriolSertaç Kağan Aydın

A survey on Machine Learning and Artificial Neural NetworksIRJET Journal

ML-Chapter_one.pptxbelay41

IRJET- Semantic Question MatchingIRJET Journal

Machine Learning: Need of Machine Learning, Its Challenges and its ApplicationsArpana Awasthi

Introduction to Machine Learningshivani saluja

HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.comHABIB FIGA GUYE

The Generative AI System Shock, and some thoughts on Collective Intelligence ...Simon Buckingham Shum

IRJET- Sentiment Analysis to Segregate Attributes using Machine Learning Tech...IRJET Journal

IRJET- Chatbot using NLP and Deep LearningIRJET Journal

IRJET - E-Assistant: An Interactive Bot for Banking Sector using NLP ProcessIRJET Journal

Semi-Supervised Insight Generation from Petabyte Scale Text DataTech Triveni

IRJET- NEEV: An Education Informational ChatbotIRJET Journal

Learning for Big Data－林軒田台灣資料科學年會

Machine learningosman ansari

Sentiment Analysis: A comparative study of Deep Learning and Machine LearningIRJET Journal

Machine learningSiddharth Kar

IRJET- Multimedia Chatbot using ClassificationIRJET Journal

Analyzing Sentiment Of Movie Reviews In Bangla By Applying Machine Learning T...Andrew Parish

Intelligent Career Guidance System.pptxAnonymous366406

Ähnlich wie Dialogue system② (20)

Vitriol

A survey on Machine Learning and Artificial Neural Networks

ML-Chapter_one.pptx

IRJET- Semantic Question Matching

Machine Learning: Need of Machine Learning, Its Challenges and its Applications

Introduction to Machine Learning

HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com

The Generative AI System Shock, and some thoughts on Collective Intelligence ...

IRJET- Sentiment Analysis to Segregate Attributes using Machine Learning Tech...

IRJET- Chatbot using NLP and Deep Learning

IRJET - E-Assistant: An Interactive Bot for Banking Sector using NLP Process

Semi-Supervised Insight Generation from Petabyte Scale Text Data

IRJET- NEEV: An Education Informational Chatbot

Learning for Big Data－林軒田

Machine learning

Sentiment Analysis: A comparative study of Deep Learning and Machine Learning

Machine learning

IRJET- Multimedia Chatbot using Classification

Analyzing Sentiment Of Movie Reviews In Bangla By Applying Machine Learning T...

Intelligent Career Guidance System.pptx

Kürzlich hochgeladen

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc

GenCyber Cyber Security Day PresentationMichael W. Hawkins

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia

Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

A Call to Action for Generative AI in 2024Results

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

Automating Google Workspace (GWS) & more with Apps Scriptwesley chun

Slack Application Development 101 Slidespraypatel2

A Domino Admins Adventures (Engage 2024)Gabriella Davis

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science

Histor y of HAM Radio presentation slidevu2urc

Kürzlich hochgeladen (20)

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

GenCyber Cyber Security Day Presentation

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Driving Behavioral Change for Information Management through Data-Driven Gree...

Exploring the Future Potential of AI-Enabled Smartphone Processors

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...

Factors to Consider When Choosing Accounts Payable Services Providers.pptx

Finology Group – Insurtech Innovation Award 2024

A Call to Action for Generative AI in 2024

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

Axa Assurance Maroc - Insurer Innovation Award 2024

Tata AIG General Insurance Company - Insurer Innovation Award 2024

[2024]Digital Global Overview Report 2024 Meltwater.pdf

Automating Google Workspace (GWS) & more with Apps Script

Slack Application Development 101 Slides

A Domino Admins Adventures (Engage 2024)

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

Histor y of HAM Radio presentation slide

Dialogue system②

1. Building a dialogue system using a generative model 2020/06/25 1 M1 Kento Tanaka （⽣成モデルに基づく対話システムの構築）

2. Background 2020/06/25 2 Introduction ▶ Users are relying on systems able to support an interaction for searching information (Siri, Alexa, etc.) [Zhou+, 2018] ▶ The use of NN has led to a flurry of research on large-scale, non-task-oriented DS. [Sordoni+, 2015] Goal Create a smooth and sociable dialogue system

3. 2020/06/25 3 Introduction (What is ‘good’ chatbot?) ▶ One crucial step in the development of DS is evaluation. [Deriu, 2019] Human evaluations: ・High accuracy but expensive Automatic evaluations: ・Low accuracy but cheap ・Hard to scale ・Metrics from MT (to compare a generated response to a target.) Very weakly correlation with human judgements.

4. 2020/06/25 4 Related works (Word overlap-based Metrics) ▶ BLEU-N [Liu, 2016] ▶ ROUGE-L [Liu, 2016] ・Analyze the co-occurrences of n-grams - tgt : I work on machine learning. - pred : He works on machine learning. ・BP : Penalizing sentences that are too short ・It is a F-measure based on the LCS(Longest Common Subsequence)

5. 2020/06/25 5 Related works (Embedding-based Metrics) ▶ Embedding Average [Liu, 2016] ▶ Vector Extrema [Liu, 2016] ▶ Greedy Matching [Liu, 2016] ・Calculate sentence-level embedding. ・Calculate sentence-level embedding. ・Average of the cosine similarity of the words with the highest cosine similarity.

6. 2020/06/25 6 Method.1 ▶ OpenNMT - is an open source ecosystem for neural machine translation and neural sequence learning. ▶ Dataset - Training data : twitter 1.2M pairs. - Test data : twitter 100 pairs.

7. 2020/06/25 7 Method.2 (without proper noun) ▶ OpenNMT - is an open source ecosystem for neural machine translation and neural sequence learning. ▶ Dataset - Training data : twitter 0.7M pairs. - Test data : twitter 100 pairs. ▶ Preprocessing - Removing proper nouns from a dataset. - Conversion to Kansai-ben based on rules at the end of words.

8. 2020/06/25 8 Method.3 (Considering context) ▶ OpenNMT - is an open source ecosystem for neural machine translation and neural sequence learning. ▶ Dataset - Training data : twitter 1.5M triple sets. - Test data : twitter 100 triple sets. ▶ Preprocessing - Removing proper nouns from a dataset. - Conversion to Kansai-ben based on rules at the end of words. ▶ Context - Learning with three sets of data. 「晩ご飯どうする？」 & 「ハンバーグはどう？」Input: Output: 「昨日も食べたやん！カレーがええなぁ。」

9. 2020/06/25 9 Evaluation Human evaluations: Automatic evaluations: Grice’s Maxims Conversation [Grice, 1975] 1. Quality 2. Quantity 3. Relation 4. Manner ・Adaptability of dialogue ・Informative ・Completeness of utterance ・Context considerations Evaluation criteria Embedding Average ▶ Grading on a 5-point scale ▶ Grading by 4 people

10. 2020/06/25 10 Result Human evaluations Automatic evaluations Adaptability Informative Completeness Context Embedding Average Model1 3.045 2.185 3.195 2.45 0.51555 Model2 2.94 2.05 2.97 2.285 0.52623 Model3 3.11 3.18 2.92 2.58 0.93575 Table1. Human evaluations and automatic evaluations ・Increased input have anything to do with it. ・Model3 is the best in embedding avg.

11. 2020/06/25 11 Conclusion ▶ Created a generation-based dialogue system. ▶ Low adaptability Increase the amount of good quality data ▶ Yielded commonplace responses. Ideal: more diverse, interesting, and appropriate responses. ▶ Automatic evaluations that are highly correlated with human judgment are needed.