SlideShare ist ein Scribd-Unternehmen logo
1 von 46
Downloaden Sie, um offline zu lesen
DIALOG STRATEGY MANAGEMENT IN
MULTISKILL AI ASSISTANTS WITH
DEEPPAVLOV
Daniel Kornev, CPO & Dilyara Baymurzina, Researcher @ DeepPavlov.ai
I’M AVAILABLE TO CHAT DURING THIS SESSION
Click on “1:1 Chat,” then “Ask the Presenter/Moderator” button to submit your question.
After the session is over, connect with me via attendee chat by searching for my name.
Skills for consumer
AI Assistants
Simple chatbots
Skills for consumer
AI Assistants
?
Simple chatbots
Skills for consumer
AI Assistants
?
Simple chatbots
Skills for consumer
AI Assistants
DeepPavlov.ai
© Copyright PresentationGO.com
Pre-Purchase
Post-Purchase
Surveys
Promotions
Campaigns
Customer Service
Technical Support
Product Usage
Billing & Payment
Account Management
Logistics
▪ Customer experience spans
multiple domains
• Surveys
• Promotions
• Campaigns
• Customer Service
• Technical Support
• …
▪ Every domain requires specific
skill
Goal-oriented Bot
+
Chit-Chat Mode
+
Response Selector
Simple chatbots
Skills for consumer
AI Assistants
Vendor X
Simple chatbots
Skills for consumer
AI Assistants
Simple chatbots
Skills for consumer
AI Assistants
Simple chatbots
Skills for consumer
AI Assistants
Simple chatbots
Skills for consumer
AI Assistants
Multiskill AI Assistants:
DeepPavlov
Grand challenge: create a socialbot that can engage in a fun, high
quality conversation on popular societal topics for 20 minutes and
achieve an average rating of at least 4.0/5.0.
Alexa Prize 3 Winners:
1.Emora - $500K, 3.8/5.0, 7 min, 32 sec, Emora University
2.Chirpy Cardinal -- $100K, 3.17/5.0, Stanford University
3.Alquist -- 50k$, (2nd in ‘17, 3rd in ‘19, ‘20), Czech Technical University
DeepPavlov.ai
ngc.nvidia.com/catalog/containers/partners:deeppavlov
• Open repository of NLP models
and pipelines
• easy to find and reuse NLP
components for development of
new skills or extension of
existing
• Open repository of
conversational skills
• alternative implementations of
the most popular skills
• Open hub for AI Assistant
distributions
• general and domainindustry
specific distributions of skill sets
DeepPavlov.ai +
DeepPavlov.ai
MEET GERTY 3000
Can help Sam with problems on
the Moon Base Sarang?
Can entertain Sam? Yes ✔
Yes ✔
Can we emulate it with
DeepPavlov DREAM?
Yes ✔
Main Question
Functionality Analysis
© Copyright Sony Pictures Classics
Text OR
Voice
Input
TTS
(NeMo)
Spell
Checking
NeMo ASR Harvesters
Status
Chit-Chat
(AIML)
Emotion
BUILT-IN
SKILL
SELECTOR
RULE-BASED
RESPONSE
SELECTOR
DeepPavlov.ai
services:
agent:
[..]
depends_on:
- mongo
harvesters_maintenance_skill:
[..]
mongo:
[..]
rule_based_response_selector:
[..]
nemo:
[..]
depends_on:
- agent
emotion_classification:
[..]
program_y:
[..]
spell_checking:
[..]
Spell Checking
Annotators
Emotion Classification
Harvesters Status Skill
Skills
Chit-Chat Skill
NeMo ASR & TTS
Other Services
Rule-Based Response Selector
DeepPavlov.ai
services:
agent:
[..]
depends_on:
- mongo
harvesters_maintenance_skill:
[..]
mongo:
[..]
rule_based_response_selector:
[..]
nemo:
[..]
depends_on:
- agent
emotion_classification:
[..]
program_y:
[..]
clone_tts:
[..]
Annotators
Services Groups
Skills
Response Annotators
Depend on groups (e.g., “skills”)
Services Are Isolated
Limited in what they see in dialog
Invoke Agent’s State Manager
Can run via HTTP or be Python-based
"skills": {
"harvesters_maintenance_skill": {
"connector": {
"protocol": "http",
"url": "http://harvesters_maintenance
_skill:3002/respond"
},
"dialog_formatter": "dp_formatters:ful
l_dialog",
"response_formatter": "dp_formatters:b
ase_skill_formatter",
"state_manager_method": "add_hypothesi
s",
"previous_services": ["annotators"]
},
Response Selectors
DeepPavlov.ai
What is (all) harvesters’ status?
Intents
What is harvester status?
Prepare rover for a trip
domain.yml
intents:
- all_statuses_request
- status_request
[..]
- trip_request
responses:
utter_status_request:
- text: "The harvester {harv_id} is {harv_status}.“
[..]
nlu.md
## intent:all_statuses_request
- What is the harvesters status?
- What is the combines status?
[..]
stories.md
## harv_status + prepare_trip
* status_request
- utter_status_request
stories.md – training for dialogs
RASA Configs
nlu.md – training for intents & slots
domain.yml – basic ontology for skill
Simple and easy to use
DeepPavlov.ai
Works with GoBot
GoBotWrapper
Obtains data from DB
Generates NLG
stories.md – training for dialogs
Tutorial in Google Colab
nlu.md – training for intents & slots
domain.yml – basic ontology for skill
Full sample: use it to train your GoBot
and save its output to your Skill
GoBotWrapper
[..]
@app.route("/respond", methods=["POST"])
def respond():
[..]
dialogs = request.json["dialogs"]
for dialog in dialogs:
sentence = dialog['human_utterances'][-1]
['annotations'].get("spelling_preprocessing")
[..]
uttr_resp, conf = gobot(sentence)
response = gobot.getNlg(uttr_resp)
responses.append(response)
confidences.append(conf)
return jsonify(list(zip(responses, confidences)))
DeepPavlov.ai
<?xml version="1.0" encoding="UTF-8"?>
<aiml version="2.0">
<category>
<pattern>I AM ^ TIRED</pattern>
<template>
🙁
<random>
<li>Get some sleep<get name="name"/>.
You're very tired.</li>
<li>Have a rest and be happy! How can
I help you?</li>
</random>
</template>
</category>
[..]
</aiml>
</xml>
Assistant Profile (Name, Place, etc.)
Patterns
Greeting scenario
Topics
Looks up for patterns
Dialog Processing
Picks random pre-defined response
If not sure, confidence is low (0.2)
Returns response + confidence
DEMO TIME
Deepy as a Multiskill AI Assistant
DeepPavlov.ai +
> docker-compose up --build
> curl --location --request POST 'localhost:4242' --header 'Content-
Type: application/json' --data-raw '{"user_id": "name", "payload":
“what do I do here?"}'
DeepPavlov.ai
Dream AI Assistant Demo
• Own scenarios + Wikidata & ODQA
Amazon Alexa
• Own scenarios (w/ 3rd Party integrations) + Evi
+ 3rd Party Skills
Yandex Alice
• Own scenarios (w/ 3rd Party integrations) +
Yandex Search + 3rd Party Skills
DeepPavlov.ai
Step 1
Dialog Management is complex
Correctly identify user’s utterance’s goal
Step 2 Correctly generate response(s)
Step 3 Correctly pick the best response
Alice: what’s happened in city hall?
Bot: [Which city hall [Entity Disambiguation]? Where (NYC | Local)? When (Today | Yesterday | at some time)?]
Domain:News: Occupy City Hall happened in July 2020 | Confidence: 0.85
Domain:Factoid: City Hall is the seat of New York City government | Confidence: 0.95
Retrieval: By 1969 City Hall was described as badly crowded because of Bellevue growing … | Confidence: 0.75
Alice: What’s happened in city hall?
Bot: City Hall is the seat of New York City government
DeepPavlov.ai
At Skill Selector Level you may have more than skill to choose from
Inside each the Skill you may have more than one context to choose from:
• Story/Scenario (in goal-oriented systems)
• Scenario/Dialog Tree (in chat-based systems)
Challenge: How to make sure that the response we’ve
chosen addresses the user’s goal the best?
At Response Selector Level you may have more than one hypothesis to choose
from
DeepPavlov.ai
* DREAM technical report for the Alexa Prize 2019, Yuri Kuratov, et al. http://bit.ly/DP_Dream_TR_2020
Skill Selector
● Based on annotated human utterance and dialog state, in particular, topics and dialog
acts, picks up list of skills to try to generate response hypotheses
Skills
● Different types of skills: template-based, AIML, retrieval skills
● Each of selected skills may generate zero/one/several hypotheses
Response Selector
● Based on toxicity and blacklisted words annotations filters inappropriate hypotheses
● Based on dialog state, annotated hypotheses, their confidences and evaluation scores
picks up the best hypothesis
DeepPavlov.ai
Filtration
● Based on toxicity and blacklisted words annotations filters inappropriate hypotheses
Evaluation
● Calculates final single-value scores for hypotheses from confidences, conversation
evaluation scores
Hand-written Heuristics
● Gives priority to special cases, like high-priority intents, significantly increasing final score
Penalties
● Decreases final scores for repetitions
Prompts
● Adds link-to questions to final response if short with no requests reply from some
particular skills
* DREAM technical report for the Alexa Prize 2019, Yuri Kuratov, et al. http://bit.ly/DP_Dream_TR_2020
DeepPavlov.ai
● Final score depends on confidence which is assigned by hands/rules in template-
based skills
● Final score formula is empirically created
● No dependency on dialog acts
● Almost only single-turn dialog management
● Latency **
* Latency is partially solved with setting up timeout management and software optimization, but in
Conversational AI it ultimately requires powerful AI-optimized hardware like NVIDIA GPU clusters
DeepPavlov.ai
● Final score should not depend on confidence
● Final score should be calculated by one ranking model
● Some dialog acts require responses with particular dialog acts
● Priority to multi-turn scripted skills
* DREAM technical report for the Alexa Prize 2019, Yuri Kuratov, et al. http://bit.ly/DP_Dream_TR_2020
DeepPavlov.ai
There are at least 2 ways* look at a Conversation:
* S. Eggins & D. Slade, Analysing Casual Conversation.London: Cassell, 1997
Pragmatic Conversation
• Motivated by clear pragmatic purpose. Aka task-
oriented. Usually very short. Formal.
Casual Conversation
• NOT motivated by clear pragmatic purpose. Can
and often are lengthy. Informal, can have humor.
Aka chit-chat.
There are at least 4 different approaches to classify utterances & sentences:
Speech Acts*
• Work at utterance level. Hearer interprets
speaker’s intentions and tries to interpret
desired actions from hearer.
Dialog Acts
• Work at sentence level. Ascribe each sentence’s
dialog function to the entire utterance.
Speech Functions
• Work at utterance level. Similar to Speech Acts
but they produce utterance’s through its role in
Discourse.
Utterance Acts**
• Work at utterance level but
include body movements.
**Not applicable for us as we can’t see the person
*Original authors were not concerned with
Discourse
Open.Initiate
Sustain.Continue
React.Respond.Support
Sustain.Continue
Open.Initiate
React.Respond.Support
React.Rejoinder.Support
React.Respond.Support
Give.Fact
Append.Elaborate
Reply.Acknowledge
Prolong.Elaborate
Demand.Closed.Opinion
Reply.Affirm
Track.Confirm
Reply.Affirm
Discourse Move Speech Act
DeepPavlov.ai
Eggins and Martin
(1997)
Casual Conversation is about people, not facts
Discourse Strategy Advice:
In a conversation w/ user:
Explore their interpersonal relations through
confronting moves
Eggins and Slade
(1997)
Speech Functions control Discourse:
Give
information
Demand
information
Speech Acts
Discourse Moves
Speech Function Example:
open:initiate:give_opinion
Text OR
Voice
Input
TTS
(NeMo)
Spell
Checking
NeMo ASR Harvesters
Status
Chit-Chat
(AIML)
Emotion
BUILT-IN
SKILL
SELECTOR
RULE-BASED
RESPONSE
SELECTOR
Speech Function Classifier
Discourse Management
Speech Function Predictor
Speech
Function
Classifier
Discourse Mgmt
Speech
Function
Predictor
DeepPavlov.ai
Step 1
Part I: Use Speech Function to understand user’s goal
Classify user utterance’s Speech Function
Step 2 Predict the Speech Function for the best response
Step 3 Identify whether you understand user’s goal
Alice: what’s happened in city hall?
Speech Function: React.Rejoinder.Support.Track.Clarify
Speech Function Predictor: React.Rejoinder.Support.Track.Clarify | Confidence: 0.85
Speech Function Predictor: React.Respond.Support.Reply.Answer | Confidence: 0.79
Speech Function Predictor: React.Rejoinder.Confront.Response.Rechallenge | Confidence: 0.75
Speech Function Predictor: Predicts that it is a good idea to React.Rejoinder.Support.Track.Clarify
Bot: [Which city hall [Entity Disambiguation]? Where (NYC | Local)? When (Today | Yesterday | at some time)?
DeepPavlov.ai
Step 4 In each Skill, generate relevant Speech Function response
Step 5 For each hypothesis, predict Speech Function for user response
Domain:Factoid: Which City Hall? | React.Rejoinder.Support.Track.Clarify | Confidence: 0.85
Domain:News: When? | React.Rejoinder.Support.Track.Clarify | Confidence: 0.95
…
Speech Function Predictor: React.Rejoinder.Support.Track.Clarify | Confidence: 0.83
Speech Function Predictor: React.Respond.Support.Reply.Answer | Confidence: 0.73
Speech Function Predictor: React.Resoind.Confront.Reply.Disawow | Confidence: 0.65
Domain:Factoid:
Which City Hall?
Speech Function Predictor: React.Rejoinder.Support.Track.Clarify | Confidence: 0.81
Speech Function Predictor: React.Respond.Support.Reply.Answer | Confidence: 0.75
Speech Function Predictor: React.Resoind.Confront.Reply.Disawow | Confidence: 0.64
Domain:News:
When?
Part II: Use Speech Function Predictor to predict user’s response
DeepPavlov.ai
Step 6
Part III: Use Speech Function to understand user’s goal
In Response Selector, ignore irrelevant hypotheses
Step 7 In Response Selector, identify what Conversation path you’re in
Step 8 In Response Selector, give greenlight to hypothesis that is best
for the recognized Conversation type
Domain:Factoid: Which City Hall? | React.Rejoinder.Support.Track.Clarify | Confidence: 0.85
Domain:News: When? | React.Rejoinder.Support.Track.Clarify | Confidence: 0.95
User’s Utterance: React.Rejoinder.Support.Track.Clarify | Conversation Type: Casual
Domain:Factoid: Which City Hall? | React.Rejoinder.Support.Track.Clarify | Confidence: 0.85
Domain:News: When? | React.Rejoinder.Support.Track.Clarify | Confidence: 0.95
C. Type: Pragmatic
C. Type: Casual
DEMO TIME
Sneak Peek at Discourse-Driven Dialog Management in DeepPavlov Deepy
Multiskill
orchestration
Conversa-
tionalskills
NLP
frameworks
ML platforms
Proprietary Open Source
▪ Multiskill orchestration
• DeepPavlov Agent is an engine for
conversational skill deployment and
orchestration
▪ Conversational skills
• DeepPavlov Dream is a collection of pre-
build conversational skills and a default
distribution package for Dream AI
Assistant
▪ NLP frameworks
• DeepPavlov Library provides pretrained
models and simple declarative approach
to build NLP processing pipelines
▪ ML platforms
• TensorFlow and PyTorch as backends
demo.deeppavlov.ai select [Deepy]
Web Demo
@deeppavlov_dream_ai_bot
TG Bot
github.com/deepmipt/deepy
Build Your AI Assistant:
Clone and build your own!
medium.com/deeppavlov
Read us:
forum.deeppavlov.ai
Talk to us: @DeepPavlovDreamDiscussio
ns
TG:
@DeepPavlov
Twitter/TG:
DeepPavlov.ai +

Weitere ähnliche Inhalte

Ähnlich wie Managing Dialog Strategy In Multiskill AI Assistant.pdf

Digital Assistant embedded with Banking Process
Digital Assistant embedded with Banking ProcessDigital Assistant embedded with Banking Process
Digital Assistant embedded with Banking ProcessTamalDatta4
 
Ignify_RubyMehto_5year
Ignify_RubyMehto_5yearIgnify_RubyMehto_5year
Ignify_RubyMehto_5yearruby mehto
 
Multiskill Conversational AI
Multiskill Conversational AIMultiskill Conversational AI
Multiskill Conversational AIDaniel Kornev
 
Intelligent Decision Dimensions SAP certified Add-on
Intelligent Decision Dimensions SAP certified Add-onIntelligent Decision Dimensions SAP certified Add-on
Intelligent Decision Dimensions SAP certified Add-onTatiana Kojar
 
Using Chatbots in Extension Programming
Using Chatbots in Extension ProgrammingUsing Chatbots in Extension Programming
Using Chatbots in Extension ProgrammingAmy Cole
 
SAP Conversational AI as a Service managed by Skybuffer (SAP chatbot)
SAP Conversational AI as a Service managed by Skybuffer (SAP chatbot)SAP Conversational AI as a Service managed by Skybuffer (SAP chatbot)
SAP Conversational AI as a Service managed by Skybuffer (SAP chatbot)Tatiana Kojar
 
May MarTech Madness - London MUG May 18th
May MarTech Madness - London MUG May 18thMay MarTech Madness - London MUG May 18th
May MarTech Madness - London MUG May 18thAdele Miller
 
'FAST' Ai based Contact Center
'FAST' Ai based Contact Center'FAST' Ai based Contact Center
'FAST' Ai based Contact CenterTaejoon Yoo
 
Building products people actually can use – why all developers need to unders...
Building products people actually can use – why all developers need to unders...Building products people actually can use – why all developers need to unders...
Building products people actually can use – why all developers need to unders...Cyber-Duck
 
Regatta VR Customer Service Case Study
Regatta VR Customer Service Case StudyRegatta VR Customer Service Case Study
Regatta VR Customer Service Case StudyWilliam West
 
lijo_resume_singapore
lijo_resume_singaporelijo_resume_singapore
lijo_resume_singaporeLijo George
 
Intelligent Decision Dimensions for Banking in English and Azerbaijani
Intelligent Decision Dimensions for Banking in English and AzerbaijaniIntelligent Decision Dimensions for Banking in English and Azerbaijani
Intelligent Decision Dimensions for Banking in English and AzerbaijaniTatiana Kojar
 
deloitte-nl-chatbots-moving-beyond-the-hype.pdf
deloitte-nl-chatbots-moving-beyond-the-hype.pdfdeloitte-nl-chatbots-moving-beyond-the-hype.pdf
deloitte-nl-chatbots-moving-beyond-the-hype.pdfSandeep Ram
 
London MUG | May MarTech Madness
London MUG | May MarTech MadnessLondon MUG | May MarTech Madness
London MUG | May MarTech MadnessEmilyPoulton2
 
Watson assistant chatbot webinar
Watson assistant chatbot webinarWatson assistant chatbot webinar
Watson assistant chatbot webinarAnam Mahmood
 
deloitte-nl-chatbots-moving-beyond-the-hype.pdf
deloitte-nl-chatbots-moving-beyond-the-hype.pdfdeloitte-nl-chatbots-moving-beyond-the-hype.pdf
deloitte-nl-chatbots-moving-beyond-the-hype.pdfShujatAli47
 

Ähnlich wie Managing Dialog Strategy In Multiskill AI Assistant.pdf (20)

Digital Assistant embedded with Banking Process
Digital Assistant embedded with Banking ProcessDigital Assistant embedded with Banking Process
Digital Assistant embedded with Banking Process
 
SRE_Chatbot_workflow.pptx
SRE_Chatbot_workflow.pptxSRE_Chatbot_workflow.pptx
SRE_Chatbot_workflow.pptx
 
Ignify_RubyMehto_5year
Ignify_RubyMehto_5yearIgnify_RubyMehto_5year
Ignify_RubyMehto_5year
 
Multiskill Conversational AI
Multiskill Conversational AIMultiskill Conversational AI
Multiskill Conversational AI
 
Intelligent Decision Dimensions SAP certified Add-on
Intelligent Decision Dimensions SAP certified Add-onIntelligent Decision Dimensions SAP certified Add-on
Intelligent Decision Dimensions SAP certified Add-on
 
Using Chatbots in Extension Programming
Using Chatbots in Extension ProgrammingUsing Chatbots in Extension Programming
Using Chatbots in Extension Programming
 
SAP Conversational AI as a Service managed by Skybuffer (SAP chatbot)
SAP Conversational AI as a Service managed by Skybuffer (SAP chatbot)SAP Conversational AI as a Service managed by Skybuffer (SAP chatbot)
SAP Conversational AI as a Service managed by Skybuffer (SAP chatbot)
 
May MarTech Madness - London MUG May 18th
May MarTech Madness - London MUG May 18thMay MarTech Madness - London MUG May 18th
May MarTech Madness - London MUG May 18th
 
Redefining the relationship with clients
Redefining the relationship with clientsRedefining the relationship with clients
Redefining the relationship with clients
 
Mobile Strategy 101
Mobile Strategy 101Mobile Strategy 101
Mobile Strategy 101
 
'FAST' Ai based Contact Center
'FAST' Ai based Contact Center'FAST' Ai based Contact Center
'FAST' Ai based Contact Center
 
Building products people actually can use – why all developers need to unders...
Building products people actually can use – why all developers need to unders...Building products people actually can use – why all developers need to unders...
Building products people actually can use – why all developers need to unders...
 
Regatta VR Customer Service Case Study
Regatta VR Customer Service Case StudyRegatta VR Customer Service Case Study
Regatta VR Customer Service Case Study
 
lijo_resume_singapore
lijo_resume_singaporelijo_resume_singapore
lijo_resume_singapore
 
Intelligent Decision Dimensions for Banking in English and Azerbaijani
Intelligent Decision Dimensions for Banking in English and AzerbaijaniIntelligent Decision Dimensions for Banking in English and Azerbaijani
Intelligent Decision Dimensions for Banking in English and Azerbaijani
 
deloitte-nl-chatbots-moving-beyond-the-hype.pdf
deloitte-nl-chatbots-moving-beyond-the-hype.pdfdeloitte-nl-chatbots-moving-beyond-the-hype.pdf
deloitte-nl-chatbots-moving-beyond-the-hype.pdf
 
London MUG | May MarTech Madness
London MUG | May MarTech MadnessLondon MUG | May MarTech Madness
London MUG | May MarTech Madness
 
Watson assistant chatbot webinar
Watson assistant chatbot webinarWatson assistant chatbot webinar
Watson assistant chatbot webinar
 
deloitte-nl-chatbots-moving-beyond-the-hype.pdf
deloitte-nl-chatbots-moving-beyond-the-hype.pdfdeloitte-nl-chatbots-moving-beyond-the-hype.pdf
deloitte-nl-chatbots-moving-beyond-the-hype.pdf
 
Kalpesh jain
Kalpesh jain Kalpesh jain
Kalpesh jain
 

Mehr von Daniel Kornev

Building AI Assistants with DeepPavlov - truly final.pdf
Building AI Assistants with DeepPavlov - truly final.pdfBuilding AI Assistants with DeepPavlov - truly final.pdf
Building AI Assistants with DeepPavlov - truly final.pdfDaniel Kornev
 
Multimodality at Dialogue 2022 by DeepPavlov.pdf
Multimodality at Dialogue 2022 by DeepPavlov.pdfMultimodality at Dialogue 2022 by DeepPavlov.pdf
Multimodality at Dialogue 2022 by DeepPavlov.pdfDaniel Kornev
 
Daniel Kornev's Slides for Working in Digital Media and Tech Services event
Daniel Kornev's Slides for Working in Digital Media and Tech Services eventDaniel Kornev's Slides for Working in Digital Media and Tech Services event
Daniel Kornev's Slides for Working in Digital Media and Tech Services eventDaniel Kornev
 
Functional Iliteracy
Functional IliteracyFunctional Iliteracy
Functional IliteracyDaniel Kornev
 
Digital Work Environments - History and What's Next after Siri and Cortana?
Digital Work Environments - History and What's Next after Siri and Cortana?Digital Work Environments - History and What's Next after Siri and Cortana?
Digital Work Environments - History and What's Next after Siri and Cortana?Daniel Kornev
 
Cortana - The Internals
Cortana - The InternalsCortana - The Internals
Cortana - The InternalsDaniel Kornev
 
Palantir, Quid, RecordedFuture: Augmented Intelligence Frontier
Palantir, Quid, RecordedFuture: Augmented Intelligence FrontierPalantir, Quid, RecordedFuture: Augmented Intelligence Frontier
Palantir, Quid, RecordedFuture: Augmented Intelligence FrontierDaniel Kornev
 
Augmented Intelligence 2.0
Augmented Intelligence 2.0Augmented Intelligence 2.0
Augmented Intelligence 2.0Daniel Kornev
 
Developer Relations, Google Russia - VC & Startups Outreach Program
Developer Relations, Google Russia - VC & Startups Outreach ProgramDeveloper Relations, Google Russia - VC & Startups Outreach Program
Developer Relations, Google Russia - VC & Startups Outreach ProgramDaniel Kornev
 
Project Universe – Context-aware Project Management System
Project Universe – Context-aware Project Management SystemProject Universe – Context-aware Project Management System
Project Universe – Context-aware Project Management SystemDaniel Kornev
 
Brave New World of Computer Science - Part I
Brave New World of Computer Science - Part IBrave New World of Computer Science - Part I
Brave New World of Computer Science - Part IDaniel Kornev
 
Brave New World of Computer Science - Part II
Brave New World of Computer Science - Part IIBrave New World of Computer Science - Part II
Brave New World of Computer Science - Part IIDaniel Kornev
 
Ubiquitous Computing
Ubiquitous ComputingUbiquitous Computing
Ubiquitous ComputingDaniel Kornev
 

Mehr von Daniel Kornev (14)

Building AI Assistants with DeepPavlov - truly final.pdf
Building AI Assistants with DeepPavlov - truly final.pdfBuilding AI Assistants with DeepPavlov - truly final.pdf
Building AI Assistants with DeepPavlov - truly final.pdf
 
Multimodality at Dialogue 2022 by DeepPavlov.pdf
Multimodality at Dialogue 2022 by DeepPavlov.pdfMultimodality at Dialogue 2022 by DeepPavlov.pdf
Multimodality at Dialogue 2022 by DeepPavlov.pdf
 
Daniel Kornev's Slides for Working in Digital Media and Tech Services event
Daniel Kornev's Slides for Working in Digital Media and Tech Services eventDaniel Kornev's Slides for Working in Digital Media and Tech Services event
Daniel Kornev's Slides for Working in Digital Media and Tech Services event
 
Functional Iliteracy
Functional IliteracyFunctional Iliteracy
Functional Iliteracy
 
Digital Work Environments - History and What's Next after Siri and Cortana?
Digital Work Environments - History and What's Next after Siri and Cortana?Digital Work Environments - History and What's Next after Siri and Cortana?
Digital Work Environments - History and What's Next after Siri and Cortana?
 
Cortana - The Internals
Cortana - The InternalsCortana - The Internals
Cortana - The Internals
 
Palantir, Quid, RecordedFuture: Augmented Intelligence Frontier
Palantir, Quid, RecordedFuture: Augmented Intelligence FrontierPalantir, Quid, RecordedFuture: Augmented Intelligence Frontier
Palantir, Quid, RecordedFuture: Augmented Intelligence Frontier
 
Augmented Intelligence 2.0
Augmented Intelligence 2.0Augmented Intelligence 2.0
Augmented Intelligence 2.0
 
Developer Relations, Google Russia - VC & Startups Outreach Program
Developer Relations, Google Russia - VC & Startups Outreach ProgramDeveloper Relations, Google Russia - VC & Startups Outreach Program
Developer Relations, Google Russia - VC & Startups Outreach Program
 
Project Universe – Context-aware Project Management System
Project Universe – Context-aware Project Management SystemProject Universe – Context-aware Project Management System
Project Universe – Context-aware Project Management System
 
Brave New World of Computer Science - Part I
Brave New World of Computer Science - Part IBrave New World of Computer Science - Part I
Brave New World of Computer Science - Part I
 
Brave New World of Computer Science - Part II
Brave New World of Computer Science - Part IIBrave New World of Computer Science - Part II
Brave New World of Computer Science - Part II
 
Context In UX
Context In UXContext In UX
Context In UX
 
Ubiquitous Computing
Ubiquitous ComputingUbiquitous Computing
Ubiquitous Computing
 

Kürzlich hochgeladen

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 

Kürzlich hochgeladen (20)

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 

Managing Dialog Strategy In Multiskill AI Assistant.pdf

  • 1. DIALOG STRATEGY MANAGEMENT IN MULTISKILL AI ASSISTANTS WITH DEEPPAVLOV Daniel Kornev, CPO & Dilyara Baymurzina, Researcher @ DeepPavlov.ai
  • 2. I’M AVAILABLE TO CHAT DURING THIS SESSION Click on “1:1 Chat,” then “Ask the Presenter/Moderator” button to submit your question. After the session is over, connect with me via attendee chat by searching for my name.
  • 3.
  • 5. Simple chatbots Skills for consumer AI Assistants
  • 6. ? Simple chatbots Skills for consumer AI Assistants
  • 7. ? Simple chatbots Skills for consumer AI Assistants
  • 8. DeepPavlov.ai © Copyright PresentationGO.com Pre-Purchase Post-Purchase Surveys Promotions Campaigns Customer Service Technical Support Product Usage Billing & Payment Account Management Logistics ▪ Customer experience spans multiple domains • Surveys • Promotions • Campaigns • Customer Service • Technical Support • … ▪ Every domain requires specific skill
  • 9. Goal-oriented Bot + Chit-Chat Mode + Response Selector Simple chatbots Skills for consumer AI Assistants Vendor X
  • 10. Simple chatbots Skills for consumer AI Assistants
  • 11. Simple chatbots Skills for consumer AI Assistants
  • 12. Simple chatbots Skills for consumer AI Assistants
  • 13. Simple chatbots Skills for consumer AI Assistants Multiskill AI Assistants: DeepPavlov
  • 14. Grand challenge: create a socialbot that can engage in a fun, high quality conversation on popular societal topics for 20 minutes and achieve an average rating of at least 4.0/5.0. Alexa Prize 3 Winners: 1.Emora - $500K, 3.8/5.0, 7 min, 32 sec, Emora University 2.Chirpy Cardinal -- $100K, 3.17/5.0, Stanford University 3.Alquist -- 50k$, (2nd in ‘17, 3rd in ‘19, ‘20), Czech Technical University
  • 15.
  • 17. • Open repository of NLP models and pipelines • easy to find and reuse NLP components for development of new skills or extension of existing • Open repository of conversational skills • alternative implementations of the most popular skills • Open hub for AI Assistant distributions • general and domainindustry specific distributions of skill sets DeepPavlov.ai +
  • 18. DeepPavlov.ai MEET GERTY 3000 Can help Sam with problems on the Moon Base Sarang? Can entertain Sam? Yes ✔ Yes ✔ Can we emulate it with DeepPavlov DREAM? Yes ✔ Main Question Functionality Analysis © Copyright Sony Pictures Classics
  • 19. Text OR Voice Input TTS (NeMo) Spell Checking NeMo ASR Harvesters Status Chit-Chat (AIML) Emotion BUILT-IN SKILL SELECTOR RULE-BASED RESPONSE SELECTOR
  • 21. DeepPavlov.ai services: agent: [..] depends_on: - mongo harvesters_maintenance_skill: [..] mongo: [..] rule_based_response_selector: [..] nemo: [..] depends_on: - agent emotion_classification: [..] program_y: [..] clone_tts: [..] Annotators Services Groups Skills Response Annotators Depend on groups (e.g., “skills”) Services Are Isolated Limited in what they see in dialog Invoke Agent’s State Manager Can run via HTTP or be Python-based "skills": { "harvesters_maintenance_skill": { "connector": { "protocol": "http", "url": "http://harvesters_maintenance _skill:3002/respond" }, "dialog_formatter": "dp_formatters:ful l_dialog", "response_formatter": "dp_formatters:b ase_skill_formatter", "state_manager_method": "add_hypothesi s", "previous_services": ["annotators"] }, Response Selectors
  • 22. DeepPavlov.ai What is (all) harvesters’ status? Intents What is harvester status? Prepare rover for a trip domain.yml intents: - all_statuses_request - status_request [..] - trip_request responses: utter_status_request: - text: "The harvester {harv_id} is {harv_status}.“ [..] nlu.md ## intent:all_statuses_request - What is the harvesters status? - What is the combines status? [..] stories.md ## harv_status + prepare_trip * status_request - utter_status_request stories.md – training for dialogs RASA Configs nlu.md – training for intents & slots domain.yml – basic ontology for skill Simple and easy to use
  • 23. DeepPavlov.ai Works with GoBot GoBotWrapper Obtains data from DB Generates NLG stories.md – training for dialogs Tutorial in Google Colab nlu.md – training for intents & slots domain.yml – basic ontology for skill Full sample: use it to train your GoBot and save its output to your Skill GoBotWrapper [..] @app.route("/respond", methods=["POST"]) def respond(): [..] dialogs = request.json["dialogs"] for dialog in dialogs: sentence = dialog['human_utterances'][-1] ['annotations'].get("spelling_preprocessing") [..] uttr_resp, conf = gobot(sentence) response = gobot.getNlg(uttr_resp) responses.append(response) confidences.append(conf) return jsonify(list(zip(responses, confidences)))
  • 24. DeepPavlov.ai <?xml version="1.0" encoding="UTF-8"?> <aiml version="2.0"> <category> <pattern>I AM ^ TIRED</pattern> <template> 🙁 <random> <li>Get some sleep<get name="name"/>. You're very tired.</li> <li>Have a rest and be happy! How can I help you?</li> </random> </template> </category> [..] </aiml> </xml> Assistant Profile (Name, Place, etc.) Patterns Greeting scenario Topics Looks up for patterns Dialog Processing Picks random pre-defined response If not sure, confidence is low (0.2) Returns response + confidence
  • 25. DEMO TIME Deepy as a Multiskill AI Assistant DeepPavlov.ai + > docker-compose up --build > curl --location --request POST 'localhost:4242' --header 'Content- Type: application/json' --data-raw '{"user_id": "name", "payload": “what do I do here?"}'
  • 26.
  • 27. DeepPavlov.ai Dream AI Assistant Demo • Own scenarios + Wikidata & ODQA Amazon Alexa • Own scenarios (w/ 3rd Party integrations) + Evi + 3rd Party Skills Yandex Alice • Own scenarios (w/ 3rd Party integrations) + Yandex Search + 3rd Party Skills
  • 28. DeepPavlov.ai Step 1 Dialog Management is complex Correctly identify user’s utterance’s goal Step 2 Correctly generate response(s) Step 3 Correctly pick the best response Alice: what’s happened in city hall? Bot: [Which city hall [Entity Disambiguation]? Where (NYC | Local)? When (Today | Yesterday | at some time)?] Domain:News: Occupy City Hall happened in July 2020 | Confidence: 0.85 Domain:Factoid: City Hall is the seat of New York City government | Confidence: 0.95 Retrieval: By 1969 City Hall was described as badly crowded because of Bellevue growing … | Confidence: 0.75 Alice: What’s happened in city hall? Bot: City Hall is the seat of New York City government
  • 29. DeepPavlov.ai At Skill Selector Level you may have more than skill to choose from Inside each the Skill you may have more than one context to choose from: • Story/Scenario (in goal-oriented systems) • Scenario/Dialog Tree (in chat-based systems) Challenge: How to make sure that the response we’ve chosen addresses the user’s goal the best? At Response Selector Level you may have more than one hypothesis to choose from
  • 30.
  • 31. DeepPavlov.ai * DREAM technical report for the Alexa Prize 2019, Yuri Kuratov, et al. http://bit.ly/DP_Dream_TR_2020 Skill Selector ● Based on annotated human utterance and dialog state, in particular, topics and dialog acts, picks up list of skills to try to generate response hypotheses Skills ● Different types of skills: template-based, AIML, retrieval skills ● Each of selected skills may generate zero/one/several hypotheses Response Selector ● Based on toxicity and blacklisted words annotations filters inappropriate hypotheses ● Based on dialog state, annotated hypotheses, their confidences and evaluation scores picks up the best hypothesis
  • 32. DeepPavlov.ai Filtration ● Based on toxicity and blacklisted words annotations filters inappropriate hypotheses Evaluation ● Calculates final single-value scores for hypotheses from confidences, conversation evaluation scores Hand-written Heuristics ● Gives priority to special cases, like high-priority intents, significantly increasing final score Penalties ● Decreases final scores for repetitions Prompts ● Adds link-to questions to final response if short with no requests reply from some particular skills * DREAM technical report for the Alexa Prize 2019, Yuri Kuratov, et al. http://bit.ly/DP_Dream_TR_2020
  • 33. DeepPavlov.ai ● Final score depends on confidence which is assigned by hands/rules in template- based skills ● Final score formula is empirically created ● No dependency on dialog acts ● Almost only single-turn dialog management ● Latency ** * Latency is partially solved with setting up timeout management and software optimization, but in Conversational AI it ultimately requires powerful AI-optimized hardware like NVIDIA GPU clusters
  • 34. DeepPavlov.ai ● Final score should not depend on confidence ● Final score should be calculated by one ranking model ● Some dialog acts require responses with particular dialog acts ● Priority to multi-turn scripted skills * DREAM technical report for the Alexa Prize 2019, Yuri Kuratov, et al. http://bit.ly/DP_Dream_TR_2020
  • 35. DeepPavlov.ai There are at least 2 ways* look at a Conversation: * S. Eggins & D. Slade, Analysing Casual Conversation.London: Cassell, 1997 Pragmatic Conversation • Motivated by clear pragmatic purpose. Aka task- oriented. Usually very short. Formal. Casual Conversation • NOT motivated by clear pragmatic purpose. Can and often are lengthy. Informal, can have humor. Aka chit-chat.
  • 36. There are at least 4 different approaches to classify utterances & sentences: Speech Acts* • Work at utterance level. Hearer interprets speaker’s intentions and tries to interpret desired actions from hearer. Dialog Acts • Work at sentence level. Ascribe each sentence’s dialog function to the entire utterance. Speech Functions • Work at utterance level. Similar to Speech Acts but they produce utterance’s through its role in Discourse. Utterance Acts** • Work at utterance level but include body movements. **Not applicable for us as we can’t see the person *Original authors were not concerned with Discourse
  • 38. DeepPavlov.ai Eggins and Martin (1997) Casual Conversation is about people, not facts Discourse Strategy Advice: In a conversation w/ user: Explore their interpersonal relations through confronting moves
  • 39. Eggins and Slade (1997) Speech Functions control Discourse: Give information Demand information Speech Acts Discourse Moves Speech Function Example: open:initiate:give_opinion
  • 40. Text OR Voice Input TTS (NeMo) Spell Checking NeMo ASR Harvesters Status Chit-Chat (AIML) Emotion BUILT-IN SKILL SELECTOR RULE-BASED RESPONSE SELECTOR Speech Function Classifier Discourse Management Speech Function Predictor Speech Function Classifier Discourse Mgmt Speech Function Predictor
  • 41. DeepPavlov.ai Step 1 Part I: Use Speech Function to understand user’s goal Classify user utterance’s Speech Function Step 2 Predict the Speech Function for the best response Step 3 Identify whether you understand user’s goal Alice: what’s happened in city hall? Speech Function: React.Rejoinder.Support.Track.Clarify Speech Function Predictor: React.Rejoinder.Support.Track.Clarify | Confidence: 0.85 Speech Function Predictor: React.Respond.Support.Reply.Answer | Confidence: 0.79 Speech Function Predictor: React.Rejoinder.Confront.Response.Rechallenge | Confidence: 0.75 Speech Function Predictor: Predicts that it is a good idea to React.Rejoinder.Support.Track.Clarify Bot: [Which city hall [Entity Disambiguation]? Where (NYC | Local)? When (Today | Yesterday | at some time)?
  • 42. DeepPavlov.ai Step 4 In each Skill, generate relevant Speech Function response Step 5 For each hypothesis, predict Speech Function for user response Domain:Factoid: Which City Hall? | React.Rejoinder.Support.Track.Clarify | Confidence: 0.85 Domain:News: When? | React.Rejoinder.Support.Track.Clarify | Confidence: 0.95 … Speech Function Predictor: React.Rejoinder.Support.Track.Clarify | Confidence: 0.83 Speech Function Predictor: React.Respond.Support.Reply.Answer | Confidence: 0.73 Speech Function Predictor: React.Resoind.Confront.Reply.Disawow | Confidence: 0.65 Domain:Factoid: Which City Hall? Speech Function Predictor: React.Rejoinder.Support.Track.Clarify | Confidence: 0.81 Speech Function Predictor: React.Respond.Support.Reply.Answer | Confidence: 0.75 Speech Function Predictor: React.Resoind.Confront.Reply.Disawow | Confidence: 0.64 Domain:News: When? Part II: Use Speech Function Predictor to predict user’s response
  • 43. DeepPavlov.ai Step 6 Part III: Use Speech Function to understand user’s goal In Response Selector, ignore irrelevant hypotheses Step 7 In Response Selector, identify what Conversation path you’re in Step 8 In Response Selector, give greenlight to hypothesis that is best for the recognized Conversation type Domain:Factoid: Which City Hall? | React.Rejoinder.Support.Track.Clarify | Confidence: 0.85 Domain:News: When? | React.Rejoinder.Support.Track.Clarify | Confidence: 0.95 User’s Utterance: React.Rejoinder.Support.Track.Clarify | Conversation Type: Casual Domain:Factoid: Which City Hall? | React.Rejoinder.Support.Track.Clarify | Confidence: 0.85 Domain:News: When? | React.Rejoinder.Support.Track.Clarify | Confidence: 0.95 C. Type: Pragmatic C. Type: Casual
  • 44. DEMO TIME Sneak Peek at Discourse-Driven Dialog Management in DeepPavlov Deepy
  • 45. Multiskill orchestration Conversa- tionalskills NLP frameworks ML platforms Proprietary Open Source ▪ Multiskill orchestration • DeepPavlov Agent is an engine for conversational skill deployment and orchestration ▪ Conversational skills • DeepPavlov Dream is a collection of pre- build conversational skills and a default distribution package for Dream AI Assistant ▪ NLP frameworks • DeepPavlov Library provides pretrained models and simple declarative approach to build NLP processing pipelines ▪ ML platforms • TensorFlow and PyTorch as backends
  • 46. demo.deeppavlov.ai select [Deepy] Web Demo @deeppavlov_dream_ai_bot TG Bot github.com/deepmipt/deepy Build Your AI Assistant: Clone and build your own! medium.com/deeppavlov Read us: forum.deeppavlov.ai Talk to us: @DeepPavlovDreamDiscussio ns TG: @DeepPavlov Twitter/TG: DeepPavlov.ai +