SlideShare a Scribd company logo
1 of 17
Download to read offline
Human-AI communication for human-human communication:


Applying interpretable unsupervised anomaly detection to executive coaching
(equal contribution)
CHAI Workshop @ IJCAI '22


July 24, 2022
Riku Arakawa†


Carnegie Mellon University, USA
Hiromu Yakura†


University of Tsukuba, Japan
Background: Deep-learning-based human behavior analysis
Advancement in human behavior
analysis techniques:


・Facial expression recognition [1]


・Posture estimation [2]
[1] I. Çugu, et al., 2017. MicroExpNet: An Extremely Small and Fast Model For Expression Recognition From Frontal Face Images. arXiv.


[2] S.-E. Wei, et al., 2016. Convolutional Pose Machines. IEEE CVPR.
It is expected that we can analyze
and support human communication
by applying these techniques.
2
Background: A tool for helping public speaking with feedback
[3] M. I. Tanveer, et al., 2015. A Real-Time In-Situ Intelligent Interface to Help People With Public Speaking. ACM IUI.


[4] I. Damian, et al., 2015. Measuring the impact of multimodal behavioural feedback loops on social interactions.. ACM ICMI.
Speech-feature-based feedback [3]
Show feedback such as “louder”


and “faster” on a Google Glass


based on speech speed or volume.
Posture-based feedback [4]
Alert a speaker when they cross


their arm for a long time


based on posture estimation.
Our perspective: Limitation of heuristic approach
Human-to-human communication is very contextual:
[5] J. Navarro and M. Karlins, 2008. What Every BODY Is Saying: An Ex-FBI Agent’s Guide to Speed Reading People. HarperCollins, New York.


[6] R Friedman and A. J. Elliot, 2008. The effect of arm crossing on persistence and performance. Europ. J. Soc. Psych.
Heuristic approach
Unsupervised approach


w/o rules or training data
4
Defensive attitude [5] Deeply thinking [6]
Thus, we need a new framework of human-AI communication:
Supervised approach w/


training data of numerous classes
Research object: Executive coaching
• It consists of one-on-one conversation, in
which coaches are required to observe the
nonverbal behavior of coachees [7].


• The importance of observing nonverbal
behavior is emphasized in terms of reading
the nuance of what the coachee said [8].
But, notifying the detection of specific postures (e.g., crossing arms)


or emotions (e.g., confusing) without context was not appreciated.
[7] E. Cox, et al., 2009. The Complete Handbook of Coaching. SAGE Publications, Los Angeles.


[8] D. B. Drake, 2009. Narrative coaching. In The Complete Hand- book of Coaching. SAGE Publications, Los Angeles. 5
We hypothesized that AI can help novice coaches in the observation process.
Key idea: Separating observation and judgement
Coaches ignored the outputs
once the outputs contradicted


their observation or intuition.
They found it difficult to rely on
outputs based on simplified classes
that are indifferent to subtle context.
Human


Pros: Good at understanding context


Cons: Difficult to keep stable perspective


due to their skills or mental load




AIs


Pros: Stable performance


Cons: Not good at dealing with context
Separation of observation and
judgment would be an alternative


way of human-AI communication.
This guided us to reframe the way of


human-AI communication:
6
REsCUE: Real-time feedback using anomaly detection
1. Extract posture and gaze
information of the coachee.


2. Calculate outlierness score using
anomaly detection algorithm.


3. Notify the coach in real-time with
an interpretive visualization.
We developed a supporting system that observes


the nonverbal behavior of coachees using unsupervised anomaly detection.
It detects informative cues of the behavior and notifies the coach in real-time.
Detailed workflow
7
• The GMM gradually adapts to newly obtained nonverbal behavior data.








• When the trend of the input data suddenly changes,


it is detected by the spike of negative log-likelihood.
REsCUE: How anomaly detection algorithm works
[61] Kenji Yamanishi, et al. 2004. On-Line Unsupervised Outlier Detection Using Finite Mixtures with Discounting Learning Algorithms. Data Mining and Knowledge Discovery.
We use an algorithm based on a time-adaptive gaussian mixture model [9].
Time series behavior data of


the coachee taken from webcam:
The parameters of


GMM (e.g., mean and cov)
are updated with


a forgetting rate r.
REsCUE: Visualization based on GMM
The GMM allows us to provide interpretative visualization.
In GMM, each component fits


the past representative states.
Most anomalous frames can be


specified by sorting with the likelihoods.
Just by arranging these frames, the coach can compare them
and understand the change easily even during the session.
9
REsCIE: Detection results
10
These behaviors were detected without
any rules or heuristics and regarded as
informative by professional coaches.
The algorithm sometimes detected


apparent behavioral changes.


(e.g., taking a personal organizer out of a bag)
The visualization allows the coach to


interpret why the scene is detected,


which avoids destroying their trust.
Now, REsCUE is practically deployed
as a supporting system.
Lens of Parasuraman’s framework of automation
11
The design of our approach can be explained using Parasuraman's framework.
Information


acquisition
10: the computer decides everything,


acts autonomously, ignoring the human
1: the computer offers no assistance;


human must take all decisions and actions
Information


analysis
Decision & action


selection
Action


implementation
Realm of automation
human performance


automation reliability


cost of consequences
Trade-off between
Lens of Parasuraman’s framework of automation
12
The design of our approach can be explained using Parasuraman's framework.
Information


acquisition
Information


analysis
Decision & action


selection
Action


implementation
Realm of automation
10: the computer decides everything,


acts autonomously, ignoring the human
1: the computer offers no assistance;


human must take all decisions and actions
human performance


automation reliability


cost of consequences
Trade-off between
Lens of Parasuraman’s framework of automation
13
The design of our approach can be explained using Parasuraman's framework.
Information


acquisition
Information


analysis
Decision & action


selection
Action


implementation
Realm of automation
Low human performance:


• Dependency on the skills


or mental load


High automation reliability:


• No dependency on


heuristics or training data


Low cost of consequence:


• Interpretable visualization to


discern uninformative cues
This characteristic plot


of our approach came from ...
observation
Lens of Parasuraman’s framework of automation
14
The design of our approach can be explained using Parasuraman's framework.
Information


acquisition
Information


analysis
Decision & action


selection
Action


implementation
Realm of automation
High human performance:


• Good at dealing with context


Low automation reliability:


• Automatic interpretation can


be insensitive to subtle context


High cost of consequence:


• Risk of asking irrelevant questions


that disturbs the session
This characteristic plot


of our approach came from ...
interpretation
Application: Supporting skill transfer
The informativeness of the detected cues depends on the coach's skill:
15
Skillful coach gains information
from trifling behaviors.
Novice coach often disregards


such behaviors.
The difference in how each coach interprets the cues


reveals the difference in their skills.
This can be utilized for skill transfer of coaches by helping novice coaches to


learn how skillful coaches gain information from various behaviors.
Application: Supporting skill transfer
16
Annotation phase:
They classify whether each
detected cues is informative or not.
Skillful coach
Novice coach
Discussion phase:
Through the discussion about the discrepancies,


the novice coach can learn the way of interpretation.
The transparency of the results and the design of
allowing open-ended interpretation enable this tool.
Conclusion & On-going work
• We introduced a new framework of human-AI communication that is based on


the unsupervised anomaly detection algorithm.


• Its design of separating observation and interpretation enables human-AI
collaboration in highly contextual situations, such as executive coaching.


• Its interpretable visualization enabled by GMM provides transparency in


its detection results, which helps maintain trust with humans.
We remark that REsCUE does not require any prior
knowledge or rules and can be used in various domains.
Now, we are working on applying this
to analyzing sales communication
17
Read our


paper!

More Related Content

Similar to Human-AI communication for human-human communication / CHAI Workshop @ IJCAI '22

For graduate students
For graduate studentsFor graduate students
For graduate studentsannchaks
 
Interpretable machine learning : Methods for understanding complex models
Interpretable machine learning : Methods for understanding complex modelsInterpretable machine learning : Methods for understanding complex models
Interpretable machine learning : Methods for understanding complex modelsManojit Nandi
 
State representation learning for control: an overview
State representation learning for control: an overview State representation learning for control: an overview
State representation learning for control: an overview Natalia Díaz Rodríguez
 
Virtual Characters Pres Ispi Masie
Virtual Characters Pres Ispi MasieVirtual Characters Pres Ispi Masie
Virtual Characters Pres Ispi Masiegsapnar
 
ELearning Design and Rollout
ELearning Design and RolloutELearning Design and Rollout
ELearning Design and RolloutJen Milner
 
Shanghai deep learning meetup 4
Shanghai deep learning meetup 4Shanghai deep learning meetup 4
Shanghai deep learning meetup 4Xiaohu ZHU
 
Who C.A.R.E.S.!? How Learning Got Its Bad Rap & What We Can Do About it.mp4
Who C.A.R.E.S.!? How Learning Got Its Bad Rap & What We Can Do About it.mp4Who C.A.R.E.S.!? How Learning Got Its Bad Rap & What We Can Do About it.mp4
Who C.A.R.E.S.!? How Learning Got Its Bad Rap & What We Can Do About it.mp4Aggregage
 
Visual Saliency Model Using Sift and Comparison of Learning Approaches
Visual Saliency Model Using Sift and Comparison of Learning ApproachesVisual Saliency Model Using Sift and Comparison of Learning Approaches
Visual Saliency Model Using Sift and Comparison of Learning Approachescsandit
 
Review On In-Context Leaning.pptx
Review On In-Context Leaning.pptxReview On In-Context Leaning.pptx
Review On In-Context Leaning.pptxwesleyshih4
 
AI_Unit-4_Learning.pptx
AI_Unit-4_Learning.pptxAI_Unit-4_Learning.pptx
AI_Unit-4_Learning.pptxMohammadAsim91
 
Maximum Likelihood Estimation
Maximum Likelihood EstimationMaximum Likelihood Estimation
Maximum Likelihood EstimationAvinash Chamwad
 
Effectiveness of Organizational Training
Effectiveness of Organizational TrainingEffectiveness of Organizational Training
Effectiveness of Organizational TrainingJorge Boria
 
Chapter 6 - Learning data and analytics course
Chapter 6 - Learning data and analytics courseChapter 6 - Learning data and analytics course
Chapter 6 - Learning data and analytics coursegideymichael
 
Rahul_Kirtoniya_11800121032_CSE_Machine_Learning.pptx
Rahul_Kirtoniya_11800121032_CSE_Machine_Learning.pptxRahul_Kirtoniya_11800121032_CSE_Machine_Learning.pptx
Rahul_Kirtoniya_11800121032_CSE_Machine_Learning.pptxRahulKirtoniya
 
IODA - The Promise & Perils of Narrative Research
IODA - The Promise & Perils of Narrative ResearchIODA - The Promise & Perils of Narrative Research
IODA - The Promise & Perils of Narrative ResearchChris Fletcher
 
Km knowledge application.11
Km  knowledge application.11Km  knowledge application.11
Km knowledge application.11leilajannati
 
ML-Chapter_one.pptx
ML-Chapter_one.pptxML-Chapter_one.pptx
ML-Chapter_one.pptxbelay41
 
Personality Traits and Visualization Survey by Christy Case
Personality Traits and Visualization Survey by Christy CasePersonality Traits and Visualization Survey by Christy Case
Personality Traits and Visualization Survey by Christy CaseChristy C Langdon
 

Similar to Human-AI communication for human-human communication / CHAI Workshop @ IJCAI '22 (20)

For graduate students
For graduate studentsFor graduate students
For graduate students
 
Interpretable machine learning : Methods for understanding complex models
Interpretable machine learning : Methods for understanding complex modelsInterpretable machine learning : Methods for understanding complex models
Interpretable machine learning : Methods for understanding complex models
 
State representation learning for control: an overview
State representation learning for control: an overview State representation learning for control: an overview
State representation learning for control: an overview
 
Presentation Brain Perform
Presentation Brain PerformPresentation Brain Perform
Presentation Brain Perform
 
Virtual Characters Pres Ispi Masie
Virtual Characters Pres Ispi MasieVirtual Characters Pres Ispi Masie
Virtual Characters Pres Ispi Masie
 
ELearning Design and Rollout
ELearning Design and RolloutELearning Design and Rollout
ELearning Design and Rollout
 
Real time facial expression analysis using pca
Real time facial expression analysis using pcaReal time facial expression analysis using pca
Real time facial expression analysis using pca
 
Shanghai deep learning meetup 4
Shanghai deep learning meetup 4Shanghai deep learning meetup 4
Shanghai deep learning meetup 4
 
Who C.A.R.E.S.!? How Learning Got Its Bad Rap & What We Can Do About it.mp4
Who C.A.R.E.S.!? How Learning Got Its Bad Rap & What We Can Do About it.mp4Who C.A.R.E.S.!? How Learning Got Its Bad Rap & What We Can Do About it.mp4
Who C.A.R.E.S.!? How Learning Got Its Bad Rap & What We Can Do About it.mp4
 
Visual Saliency Model Using Sift and Comparison of Learning Approaches
Visual Saliency Model Using Sift and Comparison of Learning ApproachesVisual Saliency Model Using Sift and Comparison of Learning Approaches
Visual Saliency Model Using Sift and Comparison of Learning Approaches
 
Review On In-Context Leaning.pptx
Review On In-Context Leaning.pptxReview On In-Context Leaning.pptx
Review On In-Context Leaning.pptx
 
AI_Unit-4_Learning.pptx
AI_Unit-4_Learning.pptxAI_Unit-4_Learning.pptx
AI_Unit-4_Learning.pptx
 
Maximum Likelihood Estimation
Maximum Likelihood EstimationMaximum Likelihood Estimation
Maximum Likelihood Estimation
 
Effectiveness of Organizational Training
Effectiveness of Organizational TrainingEffectiveness of Organizational Training
Effectiveness of Organizational Training
 
Chapter 6 - Learning data and analytics course
Chapter 6 - Learning data and analytics courseChapter 6 - Learning data and analytics course
Chapter 6 - Learning data and analytics course
 
Rahul_Kirtoniya_11800121032_CSE_Machine_Learning.pptx
Rahul_Kirtoniya_11800121032_CSE_Machine_Learning.pptxRahul_Kirtoniya_11800121032_CSE_Machine_Learning.pptx
Rahul_Kirtoniya_11800121032_CSE_Machine_Learning.pptx
 
IODA - The Promise & Perils of Narrative Research
IODA - The Promise & Perils of Narrative ResearchIODA - The Promise & Perils of Narrative Research
IODA - The Promise & Perils of Narrative Research
 
Km knowledge application.11
Km  knowledge application.11Km  knowledge application.11
Km knowledge application.11
 
ML-Chapter_one.pptx
ML-Chapter_one.pptxML-Chapter_one.pptx
ML-Chapter_one.pptx
 
Personality Traits and Visualization Survey by Christy Case
Personality Traits and Visualization Survey by Christy CasePersonality Traits and Visualization Survey by Christy Case
Personality Traits and Visualization Survey by Christy Case
 

More from Hiromu Yakura

How to make good use of AI technologies? @ Tsukuba Conference 2021
How to make good use of AI technologies? @ Tsukuba Conference 2021How to make good use of AI technologies? @ Tsukuba Conference 2021
How to make good use of AI technologies? @ Tsukuba Conference 2021Hiromu Yakura
 
機械学習を"良く"使うには
機械学習を"良く"使うには機械学習を"良く"使うには
機械学習を"良く"使うにはHiromu Yakura
 
自己紹介@名状しがたいお茶会
自己紹介@名状しがたいお茶会自己紹介@名状しがたいお茶会
自己紹介@名状しがたいお茶会Hiromu Yakura
 
Python 3のWebシステムでDDDに入門してみた
Python 3のWebシステムでDDDに入門してみたPython 3のWebシステムでDDDに入門してみた
Python 3のWebシステムでDDDに入門してみたHiromu Yakura
 
Robust Audio Adversarial Example for a Physical Attack
Robust Audio Adversarial Example for a Physical AttackRobust Audio Adversarial Example for a Physical Attack
Robust Audio Adversarial Example for a Physical AttackHiromu Yakura
 
セキュリティ・キャンプのススメ
セキュリティ・キャンプのススメセキュリティ・キャンプのススメ
セキュリティ・キャンプのススメHiromu Yakura
 
新しいライブ映像再生システムを研究してみた
新しいライブ映像再生システムを研究してみた新しいライブ映像再生システムを研究してみた
新しいライブ映像再生システムを研究してみたHiromu Yakura
 
GAME ON 特別イベント 「ゲームってなんでプログラミング?」
GAME ON 特別イベント 「ゲームってなんでプログラミング?」GAME ON 特別イベント 「ゲームってなんでプログラミング?」
GAME ON 特別イベント 「ゲームってなんでプログラミング?」Hiromu Yakura
 
Arduinoでプログラミングに触れてみよう 続編
Arduinoでプログラミングに触れてみよう 続編Arduinoでプログラミングに触れてみよう 続編
Arduinoでプログラミングに触れてみよう 続編Hiromu Yakura
 
Arduinoでプログラミングに触れてみよう
Arduinoでプログラミングに触れてみようArduinoでプログラミングに触れてみよう
Arduinoでプログラミングに触れてみようHiromu Yakura
 
機械学習コン講評
機械学習コン講評機械学習コン講評
機械学習コン講評Hiromu Yakura
 
NPCA夏合宿 2014 講義資料
NPCA夏合宿 2014 講義資料NPCA夏合宿 2014 講義資料
NPCA夏合宿 2014 講義資料Hiromu Yakura
 
イタリアでパスタを食べた話
イタリアでパスタを食べた話イタリアでパスタを食べた話
イタリアでパスタを食べた話Hiromu Yakura
 
Why don't you learn programming?
Why don't you learn programming?Why don't you learn programming?
Why don't you learn programming?Hiromu Yakura
 
CTF, What's in it for me?
CTF, What's in it for me?CTF, What's in it for me?
CTF, What's in it for me?Hiromu Yakura
 
CTFはとんでもないものを 盗んでいきました。私の時間です…
CTFはとんでもないものを 盗んでいきました。私の時間です…CTFはとんでもないものを 盗んでいきました。私の時間です…
CTFはとんでもないものを 盗んでいきました。私の時間です…Hiromu Yakura
 

More from Hiromu Yakura (20)

How to make good use of AI technologies? @ Tsukuba Conference 2021
How to make good use of AI technologies? @ Tsukuba Conference 2021How to make good use of AI technologies? @ Tsukuba Conference 2021
How to make good use of AI technologies? @ Tsukuba Conference 2021
 
機械学習を"良く"使うには
機械学習を"良く"使うには機械学習を"良く"使うには
機械学習を"良く"使うには
 
自己紹介@名状しがたいお茶会
自己紹介@名状しがたいお茶会自己紹介@名状しがたいお茶会
自己紹介@名状しがたいお茶会
 
Python 3のWebシステムでDDDに入門してみた
Python 3のWebシステムでDDDに入門してみたPython 3のWebシステムでDDDに入門してみた
Python 3のWebシステムでDDDに入門してみた
 
Robust Audio Adversarial Example for a Physical Attack
Robust Audio Adversarial Example for a Physical AttackRobust Audio Adversarial Example for a Physical Attack
Robust Audio Adversarial Example for a Physical Attack
 
セキュリティ・キャンプのススメ
セキュリティ・キャンプのススメセキュリティ・キャンプのススメ
セキュリティ・キャンプのススメ
 
新しいライブ映像再生システムを研究してみた
新しいライブ映像再生システムを研究してみた新しいライブ映像再生システムを研究してみた
新しいライブ映像再生システムを研究してみた
 
GAME ON 特別イベント 「ゲームってなんでプログラミング?」
GAME ON 特別イベント 「ゲームってなんでプログラミング?」GAME ON 特別イベント 「ゲームってなんでプログラミング?」
GAME ON 特別イベント 「ゲームってなんでプログラミング?」
 
Arduinoでプログラミングに触れてみよう 続編
Arduinoでプログラミングに触れてみよう 続編Arduinoでプログラミングに触れてみよう 続編
Arduinoでプログラミングに触れてみよう 続編
 
Arduinoでプログラミングに触れてみよう
Arduinoでプログラミングに触れてみようArduinoでプログラミングに触れてみよう
Arduinoでプログラミングに触れてみよう
 
機械学習コン講評
機械学習コン講評機械学習コン講評
機械学習コン講評
 
NPCA夏合宿 2014 講義資料
NPCA夏合宿 2014 講義資料NPCA夏合宿 2014 講義資料
NPCA夏合宿 2014 講義資料
 
イタリアでパスタを食べた話
イタリアでパスタを食べた話イタリアでパスタを食べた話
イタリアでパスタを食べた話
 
Why don't you learn programming?
Why don't you learn programming?Why don't you learn programming?
Why don't you learn programming?
 
CTF, What's in it for me?
CTF, What's in it for me?CTF, What's in it for me?
CTF, What's in it for me?
 
Arrow Judge
Arrow JudgeArrow Judge
Arrow Judge
 
CTFとは
CTFとはCTFとは
CTFとは
 
CTFはとんでもないものを 盗んでいきました。私の時間です…
CTFはとんでもないものを 盗んでいきました。私の時間です…CTFはとんでもないものを 盗んでいきました。私の時間です…
CTFはとんでもないものを 盗んでいきました。私の時間です…
 
ぃぬ速@LKML
ぃぬ速@LKMLぃぬ速@LKML
ぃぬ速@LKML
 
プラトン
プラトンプラトン
プラトン
 

Recently uploaded

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 

Recently uploaded (20)

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 

Human-AI communication for human-human communication / CHAI Workshop @ IJCAI '22

  • 1. Human-AI communication for human-human communication: 
 Applying interpretable unsupervised anomaly detection to executive coaching (equal contribution) CHAI Workshop @ IJCAI '22 July 24, 2022 Riku Arakawa† Carnegie Mellon University, USA Hiromu Yakura† University of Tsukuba, Japan
  • 2. Background: Deep-learning-based human behavior analysis Advancement in human behavior analysis techniques: ・Facial expression recognition [1] ・Posture estimation [2] [1] I. Çugu, et al., 2017. MicroExpNet: An Extremely Small and Fast Model For Expression Recognition From Frontal Face Images. arXiv. [2] S.-E. Wei, et al., 2016. Convolutional Pose Machines. IEEE CVPR. It is expected that we can analyze and support human communication by applying these techniques. 2
  • 3. Background: A tool for helping public speaking with feedback [3] M. I. Tanveer, et al., 2015. A Real-Time In-Situ Intelligent Interface to Help People With Public Speaking. ACM IUI. [4] I. Damian, et al., 2015. Measuring the impact of multimodal behavioural feedback loops on social interactions.. ACM ICMI. Speech-feature-based feedback [3] Show feedback such as “louder” 
 and “faster” on a Google Glass 
 based on speech speed or volume. Posture-based feedback [4] Alert a speaker when they cross 
 their arm for a long time 
 based on posture estimation.
  • 4. Our perspective: Limitation of heuristic approach Human-to-human communication is very contextual: [5] J. Navarro and M. Karlins, 2008. What Every BODY Is Saying: An Ex-FBI Agent’s Guide to Speed Reading People. HarperCollins, New York. [6] R Friedman and A. J. Elliot, 2008. The effect of arm crossing on persistence and performance. Europ. J. Soc. Psych. Heuristic approach Unsupervised approach w/o rules or training data 4 Defensive attitude [5] Deeply thinking [6] Thus, we need a new framework of human-AI communication: Supervised approach w/ 
 training data of numerous classes
  • 5. Research object: Executive coaching • It consists of one-on-one conversation, in which coaches are required to observe the nonverbal behavior of coachees [7]. • The importance of observing nonverbal behavior is emphasized in terms of reading the nuance of what the coachee said [8]. But, notifying the detection of specific postures (e.g., crossing arms) 
 or emotions (e.g., confusing) without context was not appreciated. [7] E. Cox, et al., 2009. The Complete Handbook of Coaching. SAGE Publications, Los Angeles. [8] D. B. Drake, 2009. Narrative coaching. In The Complete Hand- book of Coaching. SAGE Publications, Los Angeles. 5 We hypothesized that AI can help novice coaches in the observation process.
  • 6. Key idea: Separating observation and judgement Coaches ignored the outputs once the outputs contradicted 
 their observation or intuition. They found it difficult to rely on outputs based on simplified classes that are indifferent to subtle context. Human 
 Pros: Good at understanding context 
 Cons: Difficult to keep stable perspective 
 due to their skills or mental load 
 
 AIs 
 Pros: Stable performance 
 Cons: Not good at dealing with context Separation of observation and judgment would be an alternative 
 way of human-AI communication. This guided us to reframe the way of 
 human-AI communication: 6
  • 7. REsCUE: Real-time feedback using anomaly detection 1. Extract posture and gaze information of the coachee. 2. Calculate outlierness score using anomaly detection algorithm. 3. Notify the coach in real-time with an interpretive visualization. We developed a supporting system that observes 
 the nonverbal behavior of coachees using unsupervised anomaly detection. It detects informative cues of the behavior and notifies the coach in real-time. Detailed workflow 7
  • 8. • The GMM gradually adapts to newly obtained nonverbal behavior data. 
 




 • When the trend of the input data suddenly changes, 
 it is detected by the spike of negative log-likelihood. REsCUE: How anomaly detection algorithm works [61] Kenji Yamanishi, et al. 2004. On-Line Unsupervised Outlier Detection Using Finite Mixtures with Discounting Learning Algorithms. Data Mining and Knowledge Discovery. We use an algorithm based on a time-adaptive gaussian mixture model [9]. Time series behavior data of 
 the coachee taken from webcam: The parameters of 
 GMM (e.g., mean and cov) are updated with 
 a forgetting rate r.
  • 9. REsCUE: Visualization based on GMM The GMM allows us to provide interpretative visualization. In GMM, each component fits 
 the past representative states. Most anomalous frames can be 
 specified by sorting with the likelihoods. Just by arranging these frames, the coach can compare them and understand the change easily even during the session. 9
  • 10. REsCIE: Detection results 10 These behaviors were detected without any rules or heuristics and regarded as informative by professional coaches. The algorithm sometimes detected 
 apparent behavioral changes. 
 (e.g., taking a personal organizer out of a bag) The visualization allows the coach to 
 interpret why the scene is detected, 
 which avoids destroying their trust. Now, REsCUE is practically deployed as a supporting system.
  • 11. Lens of Parasuraman’s framework of automation 11 The design of our approach can be explained using Parasuraman's framework. Information 
 acquisition 10: the computer decides everything, 
 acts autonomously, ignoring the human 1: the computer offers no assistance; 
 human must take all decisions and actions Information 
 analysis Decision & action 
 selection Action 
 implementation Realm of automation human performance 
 automation reliability 
 cost of consequences Trade-off between
  • 12. Lens of Parasuraman’s framework of automation 12 The design of our approach can be explained using Parasuraman's framework. Information 
 acquisition Information 
 analysis Decision & action 
 selection Action 
 implementation Realm of automation 10: the computer decides everything, 
 acts autonomously, ignoring the human 1: the computer offers no assistance; 
 human must take all decisions and actions human performance 
 automation reliability 
 cost of consequences Trade-off between
  • 13. Lens of Parasuraman’s framework of automation 13 The design of our approach can be explained using Parasuraman's framework. Information 
 acquisition Information 
 analysis Decision & action 
 selection Action 
 implementation Realm of automation Low human performance: • Dependency on the skills 
 or mental load High automation reliability: • No dependency on 
 heuristics or training data Low cost of consequence: • Interpretable visualization to 
 discern uninformative cues This characteristic plot 
 of our approach came from ... observation
  • 14. Lens of Parasuraman’s framework of automation 14 The design of our approach can be explained using Parasuraman's framework. Information 
 acquisition Information 
 analysis Decision & action 
 selection Action 
 implementation Realm of automation High human performance: • Good at dealing with context Low automation reliability: • Automatic interpretation can 
 be insensitive to subtle context High cost of consequence: • Risk of asking irrelevant questions 
 that disturbs the session This characteristic plot 
 of our approach came from ... interpretation
  • 15. Application: Supporting skill transfer The informativeness of the detected cues depends on the coach's skill: 15 Skillful coach gains information from trifling behaviors. Novice coach often disregards 
 such behaviors. The difference in how each coach interprets the cues 
 reveals the difference in their skills. This can be utilized for skill transfer of coaches by helping novice coaches to learn how skillful coaches gain information from various behaviors.
  • 16. Application: Supporting skill transfer 16 Annotation phase: They classify whether each detected cues is informative or not. Skillful coach Novice coach Discussion phase: Through the discussion about the discrepancies, 
 the novice coach can learn the way of interpretation. The transparency of the results and the design of allowing open-ended interpretation enable this tool.
  • 17. Conclusion & On-going work • We introduced a new framework of human-AI communication that is based on 
 the unsupervised anomaly detection algorithm. • Its design of separating observation and interpretation enables human-AI collaboration in highly contextual situations, such as executive coaching. • Its interpretable visualization enabled by GMM provides transparency in 
 its detection results, which helps maintain trust with humans. We remark that REsCUE does not require any prior knowledge or rules and can be used in various domains. Now, we are working on applying this to analyzing sales communication 17 Read our 
 paper!