SlideShare ist ein Scribd-Unternehmen logo
1 von 15
Downloaden Sie, um offline zu lesen
TO FUSE OR NOT TO FUSE:
COGNITIVE DIVERSITY FOR COMBINING
MULTIPLE SCORING SYSTEMS (MSS)
Frank Hsu
Fordham University
 IBM Cognitive System Institute Group (CSIG),
Dec. 17, 2015
1
2
To rank a list of choices (subjects, objects, items, options, …)
Genes, ligands,
or DNA
fragments in
Biomedical
Science
Targets, documents,
trajectories, or host
names in Technology
or Engineering
Movies, books,
apartments,
skaters, or sports
teams in Social
Network or Social
Choices
Customers,
vendors,
corporate risks,
or stocks in
Business and
Finance
Customers,
vendors,
corporate risks,
or stocks in
Business and
Finance
Biomedical and Health
STEM Areas
Society and Social Choices Business and Finance
Genes, ligands,
or DNA
fragments in
Biomedical
Science
Targets, documents,
trajectories, or host
names in Technology
or Engineering
Movies, books,
apartments,
skaters, or sports
teams in Social
Network or Social
Choices
Labels and
degree of stress
in classification
and affective
computing
respectively
Customers,
vendors,
corporate risks,
or stocks in
Business and
Finance
3
Each choice (or option) has (or can be described by)
a set of variables:
Attributes,
criteria, cues,
features,
indicators, judges,
parameters, …
Variables
A, B, and C, D.
C = SC(A, B)
D = RC(A, B)
Scoring Systems
sA rA sB rB sC rC sD rD
d1
d2
.
.
di
.
.
dn
A B C D
* * * ** *
4
Domain Examples:
Active Search in
Chemical Space
Internet Search Strategy Figure Skating Judgment
Crossing the street
5
Combining Multiple Scoring Systems (MSS) to
rank a group of skaters:
J1 J2 J3 SC Final Rank
d1 8.5 7 9.7 25.2 4
d2 7.6 8.4 9.6 25.6 3
d3 8.3 5.6 9.75 23.65 7
d4 6.4 7.4 9.81 21.61 8
d5 9.4 7.8 9.68 26.88 2
d6 9.5 8.5 9.2 27.2 1
d7 7.9 6.3 10 24.2 6
d8 10 10 5.1 25.1 5
J1 J2 J3 RC Final Rank
d1 4 5 4 13 4.5
d2 7 3 6 16 7
d3 5 7 3 15 6
d4 8 8 2 18 8
d5 3 4 5 12 3
d6 2 2 7 11 2
d7 6 6 1 13 4.5
d8 1 1 8 10 1
(a) Scores and Score Combination (b) Ranks and Rank Combination
6
Similarity between two scoring systems, d(A, B):
(a) Data correlation (1885 - )
 Pearson’s correlation coefficiency (P).
 Spearman’s footrule (F).
 Kendall’s rank correlation tau (T).
 Spearman’s rank correlation rho (R).
■ RSC Functions fJ1, fJ2, fJ3
(b) Information Diversity
■ Cognitive Diversity d(A,B) between two
Scoring systems A and B is based on the rank-score
Characteristic (RSC) function of A and B (fA and fB).
J1 J2 J3
1 1 1 1
2 0.86 0.75 0.97
3 0.71 0.63 0.93
4 0.57 0.5 0.9
5 0.43 0.38 0.86
6 0.28 0.25 0.83
7 0.14 0.13 0.8
8 0 0 0
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0 2 4 6 8
Score
Rank
J1
J2
J3
fJ1 fJ2 fJ3
fJ2
fJ1
fJ3
7
Combinatorial Fusion Algorithm(CFA):
D= set of classes, documents, genes, molecules with
|D| =n.
N= the set {1,2,….,n}
R= a set of real numbers
f(i)=(s ° r-1) (i) =s (r-1(i))
Ref: Hsu et al in Advanced Data Mining Technologies in Bioinformatics, Idea Group Inc. 2006.
(a) Multiple Scoring Systems (MSS)
Each scoring system has a score function sA, rank function rA, and the rank-
score characteristic function (RSC) fA.
(b) Diversity (or similarity) between two scoring systems A and B, d(A, B) can be defined
using score functions, rank functions, or rank-score characteristic (RSC) functions:
d(A, B) = d(sA, sB), or d(rA, rB), or d(fA, fB).
8
Combining MSS for structure-based virtual screening:
(I) Combining 2 to 5 scoring systems (by rank or by score)
with performance comparisons
Combinations of different methods improve the performances
The combination of B and D works best on thymidine kinase (TK)
Ref: Yang et al. Journal of Chemical Information and Modeling. 45, (2005). pp. 1134-1146.
The Performance of Thymidine Kinase (TK)
0.00
0.10
0.20
0.30
0.40
0.50
0.60
0.70
0.80
0.90
1.00
0 200 400 600 800 1000
Rank
Score
GEMDOCK-Binding
GEMDOCK-Pharma
GOLD-GoldScore
GOLD-Goldinter
GOLD-ChemScore
TK
0.00
0.10
0.20
0.30
0.40
0.50
0.60
0.70
E
D
C
A
B
DE
CE
AE
BE
CD
AD
AC
BC
AB
BD
CDE
ACE
ABE
ADE
BCE
BDE
ACD
ABD
BCD
ABC
ACDE
BCDE
ABCE
ABDE
ABCD
ABCDE
Combinations
AverageGHScore
rank combination
score combination
TK
9
Combining MSS for structure-based virtual screening: (II) Positive
cases(o) vs negative cases (x) for 80 2-combinations in terms of
performance ration (x-coordinate) and cognitive
diversity ( y-coordinate)
10
It was shown in the information retrieval domain that under certain
conditions (one of these condition is higher cognitive diversity), rank
combination can be better than score combination.
Ref: Hsu, D.F., Taksa, I. Information Retrieval 8(3), pp. 449–480, 2005.
11
Target Tracking with Three Features:
We use three features:
• Color – average normalized RGB color
• Position – location of the target region centroid
• Shape – area of the target region
+
Color
Position
Shape
Ref: Lyons, D.M., Hsu, D.F. Information Fusion 10(2): pp. 124-136, 2009.
12
Target Tracking
Seq. RUN2
Score fusio
n
MSSD Avg
. MSSD V
ar.
RUN3
Score and r
ank fusion
using groun
d truth to se
lect
MSSD Avg
. MSSD V
ar.
RUN4
Score and r
ank fusion u
sing rank-sc
ore function
to select
MSSD Avg
. MSSD Va
r.
1 1537.22 694.47 1536.65 695.49 1536.9 694.24
2 816.53 8732.13 723.13 3512.19 723.09 3511.41
3 108.89 61.61 108.34 60.58 108.89 61.61
4 23.14 2.39 23.04 2.30 23.14 2.39
5 334.13 120.11 332.89 119.39 334.138 120.11
6 96.40 119.22 66.9 12.91 67.28 13.38
7 577.78 201.29 548.6 127.78 577.78 201.29
8 538.35 605.84 500.9 57.91 534.3 602.85
9 143.04 339.73 140.18 297.07 142.33 294.94
10 260.24 86.65 252.17 84.99 258.64 85.94
11 520.13 2991.17 440.98 2544.69 470.27 2791.62
12 1188.81 745.01 1188.81 745.01 1188.81 745.01
RUN4 is as good or better
(highlighted in gray) than
RUN2 in all cases
RUN4 is, predictably, not
always as good as RUN3
(‘best case’).
Note: Lower MSSD implies
better tracking performance.
13
Cognitive Informatics: Combining Two Visual Perception
Systems
Ref: A Batallones et al; On the combination of two visual cognition systems using
combinatorial fusion, Brain Informatics (2015), 2, p.21 - 32.
14
Cognitive Diversity provides information diversity
(complementary to and in contrast with the statistical
data correlation):
■ In Similarity measurement between two scoring systems(or data
distributions):
■ In Goodness of Fit between two models (or hypotheses):
■ In Cognitive Computing between two hypotheses (or scoring systems) in
order to decide when and how To Fuse (or to combine) multiple scoring
systems.
Pearson, foot-
rule, Kendall
tau, Spearman
rho.
CDvs
Chi-square
test,
Kolomogorov-
Smirnov test.
CDvs
NLP, ML, DM,
IR, ensemble,
MADM
SC, RC, majority
voting, weighted
SC, weighted
RC, POSet, max,
min, ave., …
&
15
Cognitive Systems that are capable of combining a group of diverse
and good-performance scoring systems from a variety of sensors,
sources, and software
Can serve as a resilient engine and effective telescope
For the new scientific discovery paradigm (integration vs. reduction)
In the era of data-driven human-interactive knowledge discovery.
 D. F. Hsu; IBM CSIG seminar , Dec. 17, 2015

Weitere ähnliche Inhalte

Andere mochten auch

Andere mochten auch (20)

Hipoteca multidivisa y préstamos en divisas
Hipoteca multidivisa y préstamos en divisasHipoteca multidivisa y préstamos en divisas
Hipoteca multidivisa y préstamos en divisas
 
"Natural Language Access to Data: Where Reasoning Makes Sense"
"Natural Language Access to Data: Where Reasoning Makes Sense""Natural Language Access to Data: Where Reasoning Makes Sense"
"Natural Language Access to Data: Where Reasoning Makes Sense"
 
Cognitive systems institute group speaker series nov13 v1
Cognitive systems institute group speaker series nov13 v1Cognitive systems institute group speaker series nov13 v1
Cognitive systems institute group speaker series nov13 v1
 
Motivating and Prioritizing Ongoing Student Feedback using Collaborative Filt...
Motivating and Prioritizing Ongoing Student Feedback using Collaborative Filt...Motivating and Prioritizing Ongoing Student Feedback using Collaborative Filt...
Motivating and Prioritizing Ongoing Student Feedback using Collaborative Filt...
 
Cognitive systems institute group update speaker series june 25 2015
Cognitive systems institute group update speaker series june 25 2015Cognitive systems institute group update speaker series june 25 2015
Cognitive systems institute group update speaker series june 25 2015
 
"Cognitive Computing: A Future Pathway for Global Affairs Students"
"Cognitive Computing: A Future Pathway for Global Affairs Students"  "Cognitive Computing: A Future Pathway for Global Affairs Students"
"Cognitive Computing: A Future Pathway for Global Affairs Students"
 
Tom Finin: “From Strings to Things: Populating Knowledge Bases from Text”
Tom Finin:  “From Strings to Things: Populating Knowledge Bases from Text”Tom Finin:  “From Strings to Things: Populating Knowledge Bases from Text”
Tom Finin: “From Strings to Things: Populating Knowledge Bases from Text”
 
Ken Forbus presented “Software Social Organisms: Implications for measuring ...
Ken Forbus  presented “Software Social Organisms: Implications for measuring ...Ken Forbus  presented “Software Social Organisms: Implications for measuring ...
Ken Forbus presented “Software Social Organisms: Implications for measuring ...
 
"Toward Generating Domain-specific / Personalized Problem Lists from Electron...
"Toward Generating Domain-specific / Personalized Problem Lists from Electron..."Toward Generating Domain-specific / Personalized Problem Lists from Electron...
"Toward Generating Domain-specific / Personalized Problem Lists from Electron...
 
Martin Takac - “Solving Large-Scale Machine Learning Problems in a Distribute...
Martin Takac - “Solving Large-Scale Machine Learning Problems in a Distribute...Martin Takac - “Solving Large-Scale Machine Learning Problems in a Distribute...
Martin Takac - “Solving Large-Scale Machine Learning Problems in a Distribute...
 
“A Universal Translator as a Cognitive System, beginning as a Guidebook with ...
“A Universal Translator as a Cognitive System, beginning as a Guidebook with ...“A Universal Translator as a Cognitive System, beginning as a Guidebook with ...
“A Universal Translator as a Cognitive System, beginning as a Guidebook with ...
 
Biological Foundations for Deep Learning: Towards Decision Networks
 Biological Foundations for Deep Learning: Towards Decision Networks Biological Foundations for Deep Learning: Towards Decision Networks
Biological Foundations for Deep Learning: Towards Decision Networks
 
Cognitive Systems Institute Group Speaker Series - Virtual Reality, Game Desi...
Cognitive Systems Institute Group Speaker Series - Virtual Reality, Game Desi...Cognitive Systems Institute Group Speaker Series - Virtual Reality, Game Desi...
Cognitive Systems Institute Group Speaker Series - Virtual Reality, Game Desi...
 
Cognitive Computing by Professor Gordon Pipa
Cognitive Computing by Professor Gordon PipaCognitive Computing by Professor Gordon Pipa
Cognitive Computing by Professor Gordon Pipa
 
Multimodal behavior signal analysis and interpretation for young kids with ASD
Multimodal behavior signal analysis and interpretation for young kids with ASDMultimodal behavior signal analysis and interpretation for young kids with ASD
Multimodal behavior signal analysis and interpretation for young kids with ASD
 
"Curious Learning: using a mobile platform for early literacy education as a ...
"Curious Learning: using a mobile platform for early literacy education as a ..."Curious Learning: using a mobile platform for early literacy education as a ...
"Curious Learning: using a mobile platform for early literacy education as a ...
 
“Towards Building a Cognitive System to Fight for National College Admission ...
“Towards Building a Cognitive System to Fight for National College Admission ...“Towards Building a Cognitive System to Fight for National College Admission ...
“Towards Building a Cognitive System to Fight for National College Admission ...
 
Cars 2015 classification and staging of lung cancer 1.6
Cars 2015   classification and staging of lung cancer 1.6Cars 2015   classification and staging of lung cancer 1.6
Cars 2015 classification and staging of lung cancer 1.6
 
“Towards Multi-Step Expert Advice for Cognitive Computing” - Dr. Achim Rettin...
“Towards Multi-Step Expert Advice for Cognitive Computing” - Dr. Achim Rettin...“Towards Multi-Step Expert Advice for Cognitive Computing” - Dr. Achim Rettin...
“Towards Multi-Step Expert Advice for Cognitive Computing” - Dr. Achim Rettin...
 
Theoretical and Practical Aspects of Knowledge Representation and Reasoning
Theoretical and Practical Aspects of Knowledge Representation and ReasoningTheoretical and Practical Aspects of Knowledge Representation and Reasoning
Theoretical and Practical Aspects of Knowledge Representation and Reasoning
 

Ähnlich wie “To Fuse or Not to Fuse: Cognitive Diversity for Combining Multiple Scoring Systems”

Project Presentation
Project PresentationProject Presentation
Project Presentation
butest
 
ENHANCED BREAST CANCER RECOGNITION BASED ON ROTATION FOREST FEATURE SELECTIO...
 ENHANCED BREAST CANCER RECOGNITION BASED ON ROTATION FOREST FEATURE SELECTIO... ENHANCED BREAST CANCER RECOGNITION BASED ON ROTATION FOREST FEATURE SELECTIO...
ENHANCED BREAST CANCER RECOGNITION BASED ON ROTATION FOREST FEATURE SELECTIO...
cscpconf
 
TCS: A new multiple sequence alignment reliability measure to estimate align...
 TCS: A new multiple sequence alignment reliability measure to estimate align... TCS: A new multiple sequence alignment reliability measure to estimate align...
TCS: A new multiple sequence alignment reliability measure to estimate align...
JIA-MING CHANG
 
Comparison of Genomic DNA to cDNA Alignment Methods
Comparison of Genomic DNA to cDNA Alignment MethodsComparison of Genomic DNA to cDNA Alignment Methods
Comparison of Genomic DNA to cDNA Alignment Methods
Miguel Galves
 
Game Data Science: The State of the Art
Game Data Science: The State of the ArtGame Data Science: The State of the Art
Game Data Science: The State of the Art
Africa Perianez
 
Mimo system-order-reduction-using-real-coded-genetic-algorithm
Mimo system-order-reduction-using-real-coded-genetic-algorithmMimo system-order-reduction-using-real-coded-genetic-algorithm
Mimo system-order-reduction-using-real-coded-genetic-algorithm
Cemal Ardil
 

Ähnlich wie “To Fuse or Not to Fuse: Cognitive Diversity for Combining Multiple Scoring Systems” (20)

Project Presentation
Project PresentationProject Presentation
Project Presentation
 
ENHANCED BREAST CANCER RECOGNITION BASED ON ROTATION FOREST FEATURE SELECTIO...
 ENHANCED BREAST CANCER RECOGNITION BASED ON ROTATION FOREST FEATURE SELECTIO... ENHANCED BREAST CANCER RECOGNITION BASED ON ROTATION FOREST FEATURE SELECTIO...
ENHANCED BREAST CANCER RECOGNITION BASED ON ROTATION FOREST FEATURE SELECTIO...
 
FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS
FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSISFUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS
FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS
 
AMIA Joint Summits 2017 - Electronic phenotyping with APHRODITE and the Obser...
AMIA Joint Summits 2017 - Electronic phenotyping with APHRODITE and the Obser...AMIA Joint Summits 2017 - Electronic phenotyping with APHRODITE and the Obser...
AMIA Joint Summits 2017 - Electronic phenotyping with APHRODITE and the Obser...
 
TCS: A new multiple sequence alignment reliability measure to estimate align...
 TCS: A new multiple sequence alignment reliability measure to estimate align... TCS: A new multiple sequence alignment reliability measure to estimate align...
TCS: A new multiple sequence alignment reliability measure to estimate align...
 
A new graph-based approach for biometric fusion at hybrid rank-score level
A new graph-based approach for biometric fusion at hybrid rank-score levelA new graph-based approach for biometric fusion at hybrid rank-score level
A new graph-based approach for biometric fusion at hybrid rank-score level
 
Automation of building reliable models
Automation of building reliable modelsAutomation of building reliable models
Automation of building reliable models
 
Comparison of Genomic DNA to cDNA Alignment Methods
Comparison of Genomic DNA to cDNA Alignment MethodsComparison of Genomic DNA to cDNA Alignment Methods
Comparison of Genomic DNA to cDNA Alignment Methods
 
Game Data Science: The State of the Art
Game Data Science: The State of the ArtGame Data Science: The State of the Art
Game Data Science: The State of the Art
 
Rough set based decision tree for identifying vulnerable and food insecure ho...
Rough set based decision tree for identifying vulnerable and food insecure ho...Rough set based decision tree for identifying vulnerable and food insecure ho...
Rough set based decision tree for identifying vulnerable and food insecure ho...
 
Deep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpointsDeep learning methods applied to physicochemical and toxicological endpoints
Deep learning methods applied to physicochemical and toxicological endpoints
 
Meru_A_Patil
Meru_A_PatilMeru_A_Patil
Meru_A_Patil
 
Machine Learning for Understanding Biomedical Publications
Machine Learning for Understanding Biomedical PublicationsMachine Learning for Understanding Biomedical Publications
Machine Learning for Understanding Biomedical Publications
 
Protein folding prediction using Alphafold 1
Protein folding prediction using Alphafold 1Protein folding prediction using Alphafold 1
Protein folding prediction using Alphafold 1
 
Unit 3classification
Unit 3classificationUnit 3classification
Unit 3classification
 
Mimo system-order-reduction-using-real-coded-genetic-algorithm
Mimo system-order-reduction-using-real-coded-genetic-algorithmMimo system-order-reduction-using-real-coded-genetic-algorithm
Mimo system-order-reduction-using-real-coded-genetic-algorithm
 
forest-cover-type
forest-cover-typeforest-cover-type
forest-cover-type
 
AIRS2016
AIRS2016AIRS2016
AIRS2016
 
Crystallization classification semisupervised
Crystallization classification semisupervisedCrystallization classification semisupervised
Crystallization classification semisupervised
 
Bioinformatics life sciences_v2015
Bioinformatics life sciences_v2015Bioinformatics life sciences_v2015
Bioinformatics life sciences_v2015
 

Mehr von diannepatricia

Mehr von diannepatricia (20)

Teaching cognitive computing with ibm watson
Teaching cognitive computing with ibm watsonTeaching cognitive computing with ibm watson
Teaching cognitive computing with ibm watson
 
Cognitive systems institute talk 8 june 2017 - v.1.0
Cognitive systems institute talk   8 june 2017 - v.1.0Cognitive systems institute talk   8 june 2017 - v.1.0
Cognitive systems institute talk 8 june 2017 - v.1.0
 
Building Compassionate Conversational Systems
Building Compassionate Conversational SystemsBuilding Compassionate Conversational Systems
Building Compassionate Conversational Systems
 
“Artificial Intelligence, Cognitive Computing and Innovating in Practice”
“Artificial Intelligence, Cognitive Computing and Innovating in Practice”“Artificial Intelligence, Cognitive Computing and Innovating in Practice”
“Artificial Intelligence, Cognitive Computing and Innovating in Practice”
 
Cognitive Insights drive self-driving Accessibility
Cognitive Insights drive self-driving AccessibilityCognitive Insights drive self-driving Accessibility
Cognitive Insights drive self-driving Accessibility
 
Artificial Intellingence in the Car
Artificial Intellingence in the CarArtificial Intellingence in the Car
Artificial Intellingence in the Car
 
“Semantic PDF Processing & Document Representation”
“Semantic PDF Processing & Document Representation”“Semantic PDF Processing & Document Representation”
“Semantic PDF Processing & Document Representation”
 
Joining Industry and Students for Cognitive Solutions at Karlsruhe Services R...
Joining Industry and Students for Cognitive Solutions at Karlsruhe Services R...Joining Industry and Students for Cognitive Solutions at Karlsruhe Services R...
Joining Industry and Students for Cognitive Solutions at Karlsruhe Services R...
 
170330 cognitive systems institute speaker series mark sherman - watson pr...
170330 cognitive systems institute speaker series    mark sherman - watson pr...170330 cognitive systems institute speaker series    mark sherman - watson pr...
170330 cognitive systems institute speaker series mark sherman - watson pr...
 
“Fairness Cases as an Accelerant and Enabler for Cognitive Assistance Adoption”
“Fairness Cases as an Accelerant and Enabler for Cognitive Assistance Adoption”“Fairness Cases as an Accelerant and Enabler for Cognitive Assistance Adoption”
“Fairness Cases as an Accelerant and Enabler for Cognitive Assistance Adoption”
 
Cognitive Assistance for the Aging
Cognitive Assistance for the AgingCognitive Assistance for the Aging
Cognitive Assistance for the Aging
 
From complex Systems to Networks: Discovering and Modeling the Correct Network"
From complex Systems to Networks: Discovering and Modeling the Correct Network"From complex Systems to Networks: Discovering and Modeling the Correct Network"
From complex Systems to Networks: Discovering and Modeling the Correct Network"
 
The Role of Dialog in Augmented Intelligence
The Role of Dialog in Augmented IntelligenceThe Role of Dialog in Augmented Intelligence
The Role of Dialog in Augmented Intelligence
 
Developing Cognitive Systems to Support Team Cognition
Developing Cognitive Systems to Support Team CognitionDeveloping Cognitive Systems to Support Team Cognition
Developing Cognitive Systems to Support Team Cognition
 
Cyber-Social Learning Systems
Cyber-Social Learning SystemsCyber-Social Learning Systems
Cyber-Social Learning Systems
 
“IT Technology Trends in 2017… and Beyond”
“IT Technology Trends in 2017… and Beyond”“IT Technology Trends in 2017… and Beyond”
“IT Technology Trends in 2017… and Beyond”
 
Embodied Cognition - Booch HICSS50
Embodied Cognition - Booch HICSS50Embodied Cognition - Booch HICSS50
Embodied Cognition - Booch HICSS50
 
KATE - a Platform for Machine Learning
KATE - a Platform for Machine LearningKATE - a Platform for Machine Learning
KATE - a Platform for Machine Learning
 
Cognitive Computing for Aging Society
Cognitive Computing for Aging SocietyCognitive Computing for Aging Society
Cognitive Computing for Aging Society
 
Hicss17 asakawa
Hicss17 asakawaHicss17 asakawa
Hicss17 asakawa
 

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 

“To Fuse or Not to Fuse: Cognitive Diversity for Combining Multiple Scoring Systems”

  • 1. TO FUSE OR NOT TO FUSE: COGNITIVE DIVERSITY FOR COMBINING MULTIPLE SCORING SYSTEMS (MSS) Frank Hsu Fordham University  IBM Cognitive System Institute Group (CSIG), Dec. 17, 2015 1
  • 2. 2 To rank a list of choices (subjects, objects, items, options, …) Genes, ligands, or DNA fragments in Biomedical Science Targets, documents, trajectories, or host names in Technology or Engineering Movies, books, apartments, skaters, or sports teams in Social Network or Social Choices Customers, vendors, corporate risks, or stocks in Business and Finance Customers, vendors, corporate risks, or stocks in Business and Finance Biomedical and Health STEM Areas Society and Social Choices Business and Finance Genes, ligands, or DNA fragments in Biomedical Science Targets, documents, trajectories, or host names in Technology or Engineering Movies, books, apartments, skaters, or sports teams in Social Network or Social Choices Labels and degree of stress in classification and affective computing respectively Customers, vendors, corporate risks, or stocks in Business and Finance
  • 3. 3 Each choice (or option) has (or can be described by) a set of variables: Attributes, criteria, cues, features, indicators, judges, parameters, … Variables A, B, and C, D. C = SC(A, B) D = RC(A, B) Scoring Systems sA rA sB rB sC rC sD rD d1 d2 . . di . . dn A B C D * * * ** *
  • 4. 4 Domain Examples: Active Search in Chemical Space Internet Search Strategy Figure Skating Judgment Crossing the street
  • 5. 5 Combining Multiple Scoring Systems (MSS) to rank a group of skaters: J1 J2 J3 SC Final Rank d1 8.5 7 9.7 25.2 4 d2 7.6 8.4 9.6 25.6 3 d3 8.3 5.6 9.75 23.65 7 d4 6.4 7.4 9.81 21.61 8 d5 9.4 7.8 9.68 26.88 2 d6 9.5 8.5 9.2 27.2 1 d7 7.9 6.3 10 24.2 6 d8 10 10 5.1 25.1 5 J1 J2 J3 RC Final Rank d1 4 5 4 13 4.5 d2 7 3 6 16 7 d3 5 7 3 15 6 d4 8 8 2 18 8 d5 3 4 5 12 3 d6 2 2 7 11 2 d7 6 6 1 13 4.5 d8 1 1 8 10 1 (a) Scores and Score Combination (b) Ranks and Rank Combination
  • 6. 6 Similarity between two scoring systems, d(A, B): (a) Data correlation (1885 - )  Pearson’s correlation coefficiency (P).  Spearman’s footrule (F).  Kendall’s rank correlation tau (T).  Spearman’s rank correlation rho (R). ■ RSC Functions fJ1, fJ2, fJ3 (b) Information Diversity ■ Cognitive Diversity d(A,B) between two Scoring systems A and B is based on the rank-score Characteristic (RSC) function of A and B (fA and fB). J1 J2 J3 1 1 1 1 2 0.86 0.75 0.97 3 0.71 0.63 0.93 4 0.57 0.5 0.9 5 0.43 0.38 0.86 6 0.28 0.25 0.83 7 0.14 0.13 0.8 8 0 0 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0 2 4 6 8 Score Rank J1 J2 J3 fJ1 fJ2 fJ3 fJ2 fJ1 fJ3
  • 7. 7 Combinatorial Fusion Algorithm(CFA): D= set of classes, documents, genes, molecules with |D| =n. N= the set {1,2,….,n} R= a set of real numbers f(i)=(s ° r-1) (i) =s (r-1(i)) Ref: Hsu et al in Advanced Data Mining Technologies in Bioinformatics, Idea Group Inc. 2006. (a) Multiple Scoring Systems (MSS) Each scoring system has a score function sA, rank function rA, and the rank- score characteristic function (RSC) fA. (b) Diversity (or similarity) between two scoring systems A and B, d(A, B) can be defined using score functions, rank functions, or rank-score characteristic (RSC) functions: d(A, B) = d(sA, sB), or d(rA, rB), or d(fA, fB).
  • 8. 8 Combining MSS for structure-based virtual screening: (I) Combining 2 to 5 scoring systems (by rank or by score) with performance comparisons Combinations of different methods improve the performances The combination of B and D works best on thymidine kinase (TK) Ref: Yang et al. Journal of Chemical Information and Modeling. 45, (2005). pp. 1134-1146. The Performance of Thymidine Kinase (TK) 0.00 0.10 0.20 0.30 0.40 0.50 0.60 0.70 0.80 0.90 1.00 0 200 400 600 800 1000 Rank Score GEMDOCK-Binding GEMDOCK-Pharma GOLD-GoldScore GOLD-Goldinter GOLD-ChemScore TK 0.00 0.10 0.20 0.30 0.40 0.50 0.60 0.70 E D C A B DE CE AE BE CD AD AC BC AB BD CDE ACE ABE ADE BCE BDE ACD ABD BCD ABC ACDE BCDE ABCE ABDE ABCD ABCDE Combinations AverageGHScore rank combination score combination TK
  • 9. 9 Combining MSS for structure-based virtual screening: (II) Positive cases(o) vs negative cases (x) for 80 2-combinations in terms of performance ration (x-coordinate) and cognitive diversity ( y-coordinate)
  • 10. 10 It was shown in the information retrieval domain that under certain conditions (one of these condition is higher cognitive diversity), rank combination can be better than score combination. Ref: Hsu, D.F., Taksa, I. Information Retrieval 8(3), pp. 449–480, 2005.
  • 11. 11 Target Tracking with Three Features: We use three features: • Color – average normalized RGB color • Position – location of the target region centroid • Shape – area of the target region + Color Position Shape Ref: Lyons, D.M., Hsu, D.F. Information Fusion 10(2): pp. 124-136, 2009.
  • 12. 12 Target Tracking Seq. RUN2 Score fusio n MSSD Avg . MSSD V ar. RUN3 Score and r ank fusion using groun d truth to se lect MSSD Avg . MSSD V ar. RUN4 Score and r ank fusion u sing rank-sc ore function to select MSSD Avg . MSSD Va r. 1 1537.22 694.47 1536.65 695.49 1536.9 694.24 2 816.53 8732.13 723.13 3512.19 723.09 3511.41 3 108.89 61.61 108.34 60.58 108.89 61.61 4 23.14 2.39 23.04 2.30 23.14 2.39 5 334.13 120.11 332.89 119.39 334.138 120.11 6 96.40 119.22 66.9 12.91 67.28 13.38 7 577.78 201.29 548.6 127.78 577.78 201.29 8 538.35 605.84 500.9 57.91 534.3 602.85 9 143.04 339.73 140.18 297.07 142.33 294.94 10 260.24 86.65 252.17 84.99 258.64 85.94 11 520.13 2991.17 440.98 2544.69 470.27 2791.62 12 1188.81 745.01 1188.81 745.01 1188.81 745.01 RUN4 is as good or better (highlighted in gray) than RUN2 in all cases RUN4 is, predictably, not always as good as RUN3 (‘best case’). Note: Lower MSSD implies better tracking performance.
  • 13. 13 Cognitive Informatics: Combining Two Visual Perception Systems Ref: A Batallones et al; On the combination of two visual cognition systems using combinatorial fusion, Brain Informatics (2015), 2, p.21 - 32.
  • 14. 14 Cognitive Diversity provides information diversity (complementary to and in contrast with the statistical data correlation): ■ In Similarity measurement between two scoring systems(or data distributions): ■ In Goodness of Fit between two models (or hypotheses): ■ In Cognitive Computing between two hypotheses (or scoring systems) in order to decide when and how To Fuse (or to combine) multiple scoring systems. Pearson, foot- rule, Kendall tau, Spearman rho. CDvs Chi-square test, Kolomogorov- Smirnov test. CDvs NLP, ML, DM, IR, ensemble, MADM SC, RC, majority voting, weighted SC, weighted RC, POSet, max, min, ave., … &
  • 15. 15 Cognitive Systems that are capable of combining a group of diverse and good-performance scoring systems from a variety of sensors, sources, and software Can serve as a resilient engine and effective telescope For the new scientific discovery paradigm (integration vs. reduction) In the era of data-driven human-interactive knowledge discovery.  D. F. Hsu; IBM CSIG seminar , Dec. 17, 2015