SlideShare ist ein Scribd-Unternehmen logo
1 von 11
Downloaden Sie, um offline zu lesen
FRAMING FEW-SHOT
KNOWLEDGE GRAPH
COMPLETION WITH LARGE
LANGUAGE MODELS
Adrian M.P. Brașoveanu
Lyndon J.B. Nixon
Albert Weichselbraun
Arno Scharl
NLP4KGC@SEMANTICS 2023
LLMs 2020-2023
LARGE LANGUAGE MODELS
Generative AI
ChatGPT 3.5/4.0
Claude 2
Cohere Chat
Falcon
LLaMa2
Flan-T5
Core Innovation:
Ecosystems
Agents
LangChain
KGs
Tools
Problem Solving
Mixture of
Experts (MoE)?
Image Copyright © Language Models are Few-Shot Learners (2020) by Tom
B. Brown et al. NeurIPS 2020.
LLM Reasoning Strategies(1): CoT
LARGE LANGUAGE MODELS
Relation
Extraction with
CoT
Explanation is All
You Need!
Step-by-step
reasoning
Augmented Text
leads to better
results!
Image Copyright © Revisiting Relation Extraction in the era of Large Language
Models by Wadhwa et al. ACL(1) 2023.
LLM Reasoning Strategies (2): ToT
LARGE LANGUAGE MODELS
CoT contains
explanations
ToT extends CoT
Multiple paths
towards an answer
CoT-SC – Majority
voting mechanism
ToT – more similar
to the human
selection process
ToT allows for
parallel exploration
of ideas as
opposed to linear
exploration (CoT).
Image Copyright © Tree of Thoughts: Deliberate Problem Solving with Large
Language Models (2023) by Yao et al.
Knowledge Graphs (KG)
LARGE LANGUAGE MODELS
Sustainability
KG
Built with Wikidata.
Missing relations:
- country-specific
- region-specific
KG Completion
(KGC)
Can we fill the
missing relations
using LLMs?
Evaluating Large Language Models
LARGE LANGUAGE MODELS
Single interface
nat.dev/chat
Includes
ChatGPT3.5/4
(with 32k cw)
Claude1/2
(with 100k cw)
Cohere Chat
MPT30B
Falcon40B
LLaMa2
Functionality
Playground
Compare
Chat
Metrics
Evaluating Large Language Models
LARGE LANGUAGE MODELS
Relations
Only Relations
Explanations
CoT
Completions
Restricted CoT
Self-Scoring
Truthfulness Proxy
Evaluating Large Language Models
LARGE LANGUAGE MODELS
Tools
GPT-3.5
GPT-4.0
Claude2
MPT-30B
Few-Shot
Input: 12-14
annotated texts
Output: 50
annotated texts
We want all the
texts annotated in
a large batch if
possible
Evaluating Large Language Models
LARGE LANGUAGE MODELS
Taxonomy of
Errors
These are only the
most frequent
errors!
And the Winner
Is?
ChatGPT and
Claude2 have
similar
performance
Conclusion?
LARGE LANGUAGE MODELS
Self-Scoring
Consecutive runs
Huge differences
And the Winner
Is?
ChatGPT and
Claude2 have
similar
performance
Acknowledgments
PROJECTS
DWBI Vienna - Vienna Science and Technology Fund (WWTF) [10.47379/ICT20096]
SDG-HUB – FFG (GA No. 892212)
CONTACT
adrian.brasoveanu@modul.ac.at
THANK YOU!

Weitere ähnliche Inhalte

Ähnlich wie Framing Few Shot Knowledge Graph Completion with Large Language Models

A Deep Learning Model to Predict Congressional Roll Call Votes from Legislati...
A Deep Learning Model to Predict Congressional Roll Call Votes from Legislati...A Deep Learning Model to Predict Congressional Roll Call Votes from Legislati...
A Deep Learning Model to Predict Congressional Roll Call Votes from Legislati...mlaij
 
AN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATION
AN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATIONAN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATION
AN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATIONijaia
 
AN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATION
AN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATIONAN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATION
AN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATIONgerogepatton
 
AN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATION
AN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATIONAN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATION
AN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATIONgerogepatton
 
Analysis of the evolution of advanced transformer-based language models: Expe...
Analysis of the evolution of advanced transformer-based language models: Expe...Analysis of the evolution of advanced transformer-based language models: Expe...
Analysis of the evolution of advanced transformer-based language models: Expe...IAESIJAI
 
OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIAR...
OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIAR...OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIAR...
OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIAR...mathsjournal
 
OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIAR...
OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIAR...OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIAR...
OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIAR...mathsjournal
 
OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIAR...
OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIAR...OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIAR...
OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIAR...mathsjournal
 
ESR11 Hoang Cuong - EXPERT Summer School - Malaga 2015
ESR11 Hoang Cuong - EXPERT Summer School - Malaga 2015ESR11 Hoang Cuong - EXPERT Summer School - Malaga 2015
ESR11 Hoang Cuong - EXPERT Summer School - Malaga 2015RIILP
 
BERT Explained_ State of the art language model for NLP.pdf
BERT Explained_ State of the art language model for NLP.pdfBERT Explained_ State of the art language model for NLP.pdf
BERT Explained_ State of the art language model for NLP.pdfsudeshnakundu10
 
Fine grained irony classification through transfer learning approach
Fine grained irony classification through transfer learning approachFine grained irony classification through transfer learning approach
Fine grained irony classification through transfer learning approachCSITiaesprime
 
Language Models for Information Retrieval
Language Models for Information RetrievalLanguage Models for Information Retrieval
Language Models for Information RetrievalDustin Smith
 
Deep Learning | Speaker Indentification
Deep Learning | Speaker IndentificationDeep Learning | Speaker Indentification
Deep Learning | Speaker IndentificationSai Kiran Kadam
 
AUTOMATIC DETECTION AND LANGUAGE IDENTIFICATION OF MULTILINGUAL DOCUMENTS
AUTOMATIC DETECTION AND LANGUAGE IDENTIFICATION OF MULTILINGUAL DOCUMENTSAUTOMATIC DETECTION AND LANGUAGE IDENTIFICATION OF MULTILINGUAL DOCUMENTS
AUTOMATIC DETECTION AND LANGUAGE IDENTIFICATION OF MULTILINGUAL DOCUMENTSIRJET Journal
 
2010 INTERSPEECH
2010 INTERSPEECH 2010 INTERSPEECH
2010 INTERSPEECH WarNik Chow
 
TEXTS CLASSIFICATION WITH THE USAGE OF NEURAL NETWORK BASED ON THE WORD2VEC’S...
TEXTS CLASSIFICATION WITH THE USAGE OF NEURAL NETWORK BASED ON THE WORD2VEC’S...TEXTS CLASSIFICATION WITH THE USAGE OF NEURAL NETWORK BASED ON THE WORD2VEC’S...
TEXTS CLASSIFICATION WITH THE USAGE OF NEURAL NETWORK BASED ON THE WORD2VEC’S...ijsc
 
Texts Classification with the usage of Neural Network based on the Word2vec’s...
Texts Classification with the usage of Neural Network based on the Word2vec’s...Texts Classification with the usage of Neural Network based on the Word2vec’s...
Texts Classification with the usage of Neural Network based on the Word2vec’s...ijsc
 

Ähnlich wie Framing Few Shot Knowledge Graph Completion with Large Language Models (20)

A Deep Learning Model to Predict Congressional Roll Call Votes from Legislati...
A Deep Learning Model to Predict Congressional Roll Call Votes from Legislati...A Deep Learning Model to Predict Congressional Roll Call Votes from Legislati...
A Deep Learning Model to Predict Congressional Roll Call Votes from Legislati...
 
AN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATION
AN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATIONAN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATION
AN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATION
 
AN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATION
AN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATIONAN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATION
AN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATION
 
AN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATION
AN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATIONAN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATION
AN IMPROVED MT5 MODEL FOR CHINESE TEXT SUMMARY GENERATION
 
Analysis of the evolution of advanced transformer-based language models: Expe...
Analysis of the evolution of advanced transformer-based language models: Expe...Analysis of the evolution of advanced transformer-based language models: Expe...
Analysis of the evolution of advanced transformer-based language models: Expe...
 
OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIAR...
OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIAR...OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIAR...
OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIAR...
 
OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIAR...
OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIAR...OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIAR...
OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIAR...
 
OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIAR...
OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIAR...OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIAR...
OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIAR...
 
ESR11 Hoang Cuong - EXPERT Summer School - Malaga 2015
ESR11 Hoang Cuong - EXPERT Summer School - Malaga 2015ESR11 Hoang Cuong - EXPERT Summer School - Malaga 2015
ESR11 Hoang Cuong - EXPERT Summer School - Malaga 2015
 
BERT Explained_ State of the art language model for NLP.pdf
BERT Explained_ State of the art language model for NLP.pdfBERT Explained_ State of the art language model for NLP.pdf
BERT Explained_ State of the art language model for NLP.pdf
 
Topicmodels
TopicmodelsTopicmodels
Topicmodels
 
Fine grained irony classification through transfer learning approach
Fine grained irony classification through transfer learning approachFine grained irony classification through transfer learning approach
Fine grained irony classification through transfer learning approach
 
Language Models for Information Retrieval
Language Models for Information RetrievalLanguage Models for Information Retrieval
Language Models for Information Retrieval
 
Plug play language_models
Plug play language_modelsPlug play language_models
Plug play language_models
 
Deep Learning | Speaker Indentification
Deep Learning | Speaker IndentificationDeep Learning | Speaker Indentification
Deep Learning | Speaker Indentification
 
AUTOMATIC DETECTION AND LANGUAGE IDENTIFICATION OF MULTILINGUAL DOCUMENTS
AUTOMATIC DETECTION AND LANGUAGE IDENTIFICATION OF MULTILINGUAL DOCUMENTSAUTOMATIC DETECTION AND LANGUAGE IDENTIFICATION OF MULTILINGUAL DOCUMENTS
AUTOMATIC DETECTION AND LANGUAGE IDENTIFICATION OF MULTILINGUAL DOCUMENTS
 
2010 INTERSPEECH
2010 INTERSPEECH 2010 INTERSPEECH
2010 INTERSPEECH
 
The Value and Benefits of Data-to-Text Technologies
The Value and Benefits of Data-to-Text TechnologiesThe Value and Benefits of Data-to-Text Technologies
The Value and Benefits of Data-to-Text Technologies
 
TEXTS CLASSIFICATION WITH THE USAGE OF NEURAL NETWORK BASED ON THE WORD2VEC’S...
TEXTS CLASSIFICATION WITH THE USAGE OF NEURAL NETWORK BASED ON THE WORD2VEC’S...TEXTS CLASSIFICATION WITH THE USAGE OF NEURAL NETWORK BASED ON THE WORD2VEC’S...
TEXTS CLASSIFICATION WITH THE USAGE OF NEURAL NETWORK BASED ON THE WORD2VEC’S...
 
Texts Classification with the usage of Neural Network based on the Word2vec’s...
Texts Classification with the usage of Neural Network based on the Word2vec’s...Texts Classification with the usage of Neural Network based on the Word2vec’s...
Texts Classification with the usage of Neural Network based on the Word2vec’s...
 

Mehr von MODUL Technology GmbH

How distinct and aligned with UGC is European capitals’ DMO branding on Insta...
How distinct and aligned with UGC is European capitals’ DMO branding on Insta...How distinct and aligned with UGC is European capitals’ DMO branding on Insta...
How distinct and aligned with UGC is European capitals’ DMO branding on Insta...MODUL Technology GmbH
 
Unsupervised Topic Modeling with BERTopic for Coarse and Fine-Grained News Cl...
Unsupervised Topic Modeling with BERTopic for Coarse and Fine-Grained News Cl...Unsupervised Topic Modeling with BERTopic for Coarse and Fine-Grained News Cl...
Unsupervised Topic Modeling with BERTopic for Coarse and Fine-Grained News Cl...MODUL Technology GmbH
 
Breaking New Ground with EPOCH: AI and Web Intelligence Transform Price Forec...
Breaking New Ground with EPOCH: AI and Web Intelligence Transform Price Forec...Breaking New Ground with EPOCH: AI and Web Intelligence Transform Price Forec...
Breaking New Ground with EPOCH: AI and Web Intelligence Transform Price Forec...MODUL Technology GmbH
 
New Opportunities for Understanding Tourist Photography.pptx
New Opportunities for Understanding Tourist Photography.pptxNew Opportunities for Understanding Tourist Photography.pptx
New Opportunities for Understanding Tourist Photography.pptxMODUL Technology GmbH
 
How do destinations relate to one another? A study of visual destination bran...
How do destinations relate to one another? A study of visual destination bran...How do destinations relate to one another? A study of visual destination bran...
How do destinations relate to one another? A study of visual destination bran...MODUL Technology GmbH
 
Do DMOs promote the right aspects of the destination? A study of Instagram ph...
Do DMOs promote the right aspects of the destination? A study of Instagram ph...Do DMOs promote the right aspects of the destination? A study of Instagram ph...
Do DMOs promote the right aspects of the destination? A study of Instagram ph...MODUL Technology GmbH
 
The Impact of Social Media on perceived Destination Image: case of Mexico Ci...
The Impact of Social Media on perceived Destination Image:  case of Mexico Ci...The Impact of Social Media on perceived Destination Image:  case of Mexico Ci...
The Impact of Social Media on perceived Destination Image: case of Mexico Ci...MODUL Technology GmbH
 
The Impact of Social Media on perceived Destination Image: the case of Mexico...
The Impact of Social Media on perceived Destination Image:the case of Mexico...The Impact of Social Media on perceived Destination Image:the case of Mexico...
The Impact of Social Media on perceived Destination Image: the case of Mexico...MODUL Technology GmbH
 
How Instagram influences Visual Destination Image - a case study of Jordan an...
How Instagram influences Visual Destination Image - a case study of Jordan an...How Instagram influences Visual Destination Image - a case study of Jordan an...
How Instagram influences Visual Destination Image - a case study of Jordan an...MODUL Technology GmbH
 
NoTube: Pattern-based Recommendations (part 3)
NoTube: Pattern-based Recommendations (part 3)NoTube: Pattern-based Recommendations (part 3)
NoTube: Pattern-based Recommendations (part 3)MODUL Technology GmbH
 
NoTube: Pattern-based Recommendations (part 1)
NoTube: Pattern-based Recommendations (part 1)NoTube: Pattern-based Recommendations (part 1)
NoTube: Pattern-based Recommendations (part 1)MODUL Technology GmbH
 
NoTube: Pattern-based Recommendations (part 1)
NoTube: Pattern-based Recommendations (part 1)NoTube: Pattern-based Recommendations (part 1)
NoTube: Pattern-based Recommendations (part 1)MODUL Technology GmbH
 
NoTube: Recommendations (Collaborative)
NoTube: Recommendations (Collaborative)NoTube: Recommendations (Collaborative)
NoTube: Recommendations (Collaborative)MODUL Technology GmbH
 
NoTube: User Profiling (Beancounter)
NoTube: User Profiling (Beancounter)NoTube: User Profiling (Beancounter)
NoTube: User Profiling (Beancounter)MODUL Technology GmbH
 
14 no tube dissemination and showcases [compatibility mode]
14 no tube dissemination and showcases [compatibility mode]14 no tube dissemination and showcases [compatibility mode]
14 no tube dissemination and showcases [compatibility mode]MODUL Technology GmbH
 

Mehr von MODUL Technology GmbH (20)

How distinct and aligned with UGC is European capitals’ DMO branding on Insta...
How distinct and aligned with UGC is European capitals’ DMO branding on Insta...How distinct and aligned with UGC is European capitals’ DMO branding on Insta...
How distinct and aligned with UGC is European capitals’ DMO branding on Insta...
 
Unsupervised Topic Modeling with BERTopic for Coarse and Fine-Grained News Cl...
Unsupervised Topic Modeling with BERTopic for Coarse and Fine-Grained News Cl...Unsupervised Topic Modeling with BERTopic for Coarse and Fine-Grained News Cl...
Unsupervised Topic Modeling with BERTopic for Coarse and Fine-Grained News Cl...
 
Breaking New Ground with EPOCH: AI and Web Intelligence Transform Price Forec...
Breaking New Ground with EPOCH: AI and Web Intelligence Transform Price Forec...Breaking New Ground with EPOCH: AI and Web Intelligence Transform Price Forec...
Breaking New Ground with EPOCH: AI and Web Intelligence Transform Price Forec...
 
New Opportunities for Understanding Tourist Photography.pptx
New Opportunities for Understanding Tourist Photography.pptxNew Opportunities for Understanding Tourist Photography.pptx
New Opportunities for Understanding Tourist Photography.pptx
 
How do destinations relate to one another? A study of visual destination bran...
How do destinations relate to one another? A study of visual destination bran...How do destinations relate to one another? A study of visual destination bran...
How do destinations relate to one another? A study of visual destination bran...
 
Do DMOs promote the right aspects of the destination? A study of Instagram ph...
Do DMOs promote the right aspects of the destination? A study of Instagram ph...Do DMOs promote the right aspects of the destination? A study of Instagram ph...
Do DMOs promote the right aspects of the destination? A study of Instagram ph...
 
The Impact of Social Media on perceived Destination Image: case of Mexico Ci...
The Impact of Social Media on perceived Destination Image:  case of Mexico Ci...The Impact of Social Media on perceived Destination Image:  case of Mexico Ci...
The Impact of Social Media on perceived Destination Image: case of Mexico Ci...
 
The Impact of Social Media on perceived Destination Image: the case of Mexico...
The Impact of Social Media on perceived Destination Image:the case of Mexico...The Impact of Social Media on perceived Destination Image:the case of Mexico...
The Impact of Social Media on perceived Destination Image: the case of Mexico...
 
How Instagram influences Visual Destination Image - a case study of Jordan an...
How Instagram influences Visual Destination Image - a case study of Jordan an...How Instagram influences Visual Destination Image - a case study of Jordan an...
How Instagram influences Visual Destination Image - a case study of Jordan an...
 
Media mining for smarter tourism
Media mining for smarter tourismMedia mining for smarter tourism
Media mining for smarter tourism
 
NoTube: Pattern-based Recommendations (part 3)
NoTube: Pattern-based Recommendations (part 3)NoTube: Pattern-based Recommendations (part 3)
NoTube: Pattern-based Recommendations (part 3)
 
NoTube: Pattern-based Recommendations (part 1)
NoTube: Pattern-based Recommendations (part 1)NoTube: Pattern-based Recommendations (part 1)
NoTube: Pattern-based Recommendations (part 1)
 
NoTube: Pattern-based Recommendations (part 1)
NoTube: Pattern-based Recommendations (part 1)NoTube: Pattern-based Recommendations (part 1)
NoTube: Pattern-based Recommendations (part 1)
 
NoTube: Recommendations (Collaborative)
NoTube: Recommendations (Collaborative)NoTube: Recommendations (Collaborative)
NoTube: Recommendations (Collaborative)
 
NoTube: User Profiling (Beancounter)
NoTube: User Profiling (Beancounter)NoTube: User Profiling (Beancounter)
NoTube: User Profiling (Beancounter)
 
14 no tube dissemination and showcases [compatibility mode]
14 no tube dissemination and showcases [compatibility mode]14 no tube dissemination and showcases [compatibility mode]
14 no tube dissemination and showcases [compatibility mode]
 
NoTube: BBC show case
NoTube: BBC show caseNoTube: BBC show case
NoTube: BBC show case
 
NoTube: Stoneroos show case
NoTube: Stoneroos show caseNoTube: Stoneroos show case
NoTube: Stoneroos show case
 
NoTube: RAI Show Case
NoTube: RAI Show CaseNoTube: RAI Show Case
NoTube: RAI Show Case
 
NoTube: Architecture
NoTube: ArchitectureNoTube: Architecture
NoTube: Architecture
 

Kürzlich hochgeladen

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 

Kürzlich hochgeladen (20)

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 

Framing Few Shot Knowledge Graph Completion with Large Language Models

  • 1. FRAMING FEW-SHOT KNOWLEDGE GRAPH COMPLETION WITH LARGE LANGUAGE MODELS Adrian M.P. Brașoveanu Lyndon J.B. Nixon Albert Weichselbraun Arno Scharl NLP4KGC@SEMANTICS 2023
  • 2. LLMs 2020-2023 LARGE LANGUAGE MODELS Generative AI ChatGPT 3.5/4.0 Claude 2 Cohere Chat Falcon LLaMa2 Flan-T5 Core Innovation: Ecosystems Agents LangChain KGs Tools Problem Solving Mixture of Experts (MoE)? Image Copyright © Language Models are Few-Shot Learners (2020) by Tom B. Brown et al. NeurIPS 2020.
  • 3. LLM Reasoning Strategies(1): CoT LARGE LANGUAGE MODELS Relation Extraction with CoT Explanation is All You Need! Step-by-step reasoning Augmented Text leads to better results! Image Copyright © Revisiting Relation Extraction in the era of Large Language Models by Wadhwa et al. ACL(1) 2023.
  • 4. LLM Reasoning Strategies (2): ToT LARGE LANGUAGE MODELS CoT contains explanations ToT extends CoT Multiple paths towards an answer CoT-SC – Majority voting mechanism ToT – more similar to the human selection process ToT allows for parallel exploration of ideas as opposed to linear exploration (CoT). Image Copyright © Tree of Thoughts: Deliberate Problem Solving with Large Language Models (2023) by Yao et al.
  • 5. Knowledge Graphs (KG) LARGE LANGUAGE MODELS Sustainability KG Built with Wikidata. Missing relations: - country-specific - region-specific KG Completion (KGC) Can we fill the missing relations using LLMs?
  • 6. Evaluating Large Language Models LARGE LANGUAGE MODELS Single interface nat.dev/chat Includes ChatGPT3.5/4 (with 32k cw) Claude1/2 (with 100k cw) Cohere Chat MPT30B Falcon40B LLaMa2 Functionality Playground Compare Chat Metrics
  • 7. Evaluating Large Language Models LARGE LANGUAGE MODELS Relations Only Relations Explanations CoT Completions Restricted CoT Self-Scoring Truthfulness Proxy
  • 8. Evaluating Large Language Models LARGE LANGUAGE MODELS Tools GPT-3.5 GPT-4.0 Claude2 MPT-30B Few-Shot Input: 12-14 annotated texts Output: 50 annotated texts We want all the texts annotated in a large batch if possible
  • 9. Evaluating Large Language Models LARGE LANGUAGE MODELS Taxonomy of Errors These are only the most frequent errors! And the Winner Is? ChatGPT and Claude2 have similar performance
  • 10. Conclusion? LARGE LANGUAGE MODELS Self-Scoring Consecutive runs Huge differences And the Winner Is? ChatGPT and Claude2 have similar performance
  • 11. Acknowledgments PROJECTS DWBI Vienna - Vienna Science and Technology Fund (WWTF) [10.47379/ICT20096] SDG-HUB – FFG (GA No. 892212) CONTACT adrian.brasoveanu@modul.ac.at THANK YOU!