SlideShare ist ein Scribd-Unternehmen logo
NoCode, Data & AI
LLM Inside Bootcamp
Design Patterns: Retrieval
Augmentation with LLMs
The Next Frontier in AI: Retrieval Augmentation
Rahul Xavier Singh Anant Corporation
Nocode Data & AI
Retreival Augmented or
Data Augmented LLM
responses allow you to get
accurate answers from your
own data instead of
hallucinated answers.
Our Customers
NoCode, Data & AI
LLM Inside Bootcamp
with Cassandra
Full day bootcamp to familiarize product managers, software
professionals, and data engineers to creating next generation
experts, assistants, and platforms powered by Generative AI
with Large Language Models (LLM, OpenAI, GPT)
Rahul Xavier Singh Anant Corporation
Nocode Data & AI
kono.io/bootcamp
Agenda
● I: Strategy & Theory
● II: LLM Design Patterns
● III: NoCode/Code LLM Stacks
● IV: Build a Custom ChatBot
with LLM your Data
Today’s Agenda
1. Retrieval Augmentation
2. Design Patterns: Basic Database
3. Vector Database / Embeddings
4. Design Patterns: Vector Database
Retrieval Augmentation
● Basic Data Augmentation
● Vectorized Data Augmentation
● General Patterns & Architectures
What is data / retrieval augmentation?
● Information Retrieval : For data that we have not
trained in an LLM, we can get data from another source.
● Contextual Relevance : If the data we retrieve is
relevant to the user’s query, we only send what we need
to manage context length limits.
● Enhanced Capability : This allows us to get data from
different sources & systems to meet the needs of the
user.
● Dynamic Learning: Unlike training or fine tuning which
takes days & hours, we can have new data available
immediately.
Prompting Techniques: ReAct
1. Reasoning / Acting (ReAct) Continues to build on CoT reasoning, but
enhances it by acting as in getting other information that can help it.
2. It can act by asking more questions by itself, or potentially going out to
outside systems. This is the basis of systems like Langchain, LllamaIndex
ReAct: Synergizing Reasoning and Acting in Language Models
https://react-lm.github.io/
Using information retrieval with LLMs.
● Your code intercepts User’s Query
● Talks to an Information Retrieval System
○ Search Index
○ SQL Database
○ API …
● Constructs a prompt with the query, the “context” that
it got from the IR system
● Sends that new constructed prompt to the LLM
● Gets the answer, formats it, and sends it back to the
user.
Basic Information Retrieval
Augmentation
● You preprocess embeddings of your data into a vector
database with an LLM
● Your code intercepts User’s Query, embeds it with an
LLM
● Find similar documents from a vector database
● Constructs a prompt with the query, the “context” that it
got from the vector database
● Sends that new constructed prompt to the LLM
● Gets the answer, formats it, and sends it back to the
user.
Vectorized data augmentation
Vector Information Retrieval
Augmentation - Part 0
https://blog.christianperone.com/2013/09/machine-learnin
g-cosine-similarity-for-vector-space-models-part-iii/
https://milvus.io/blog/scalable-and-blazing-fast-similarity-
search-with-milvus-vector-database.md
● Vector databases
seem like the best
“memory” for
machine learning.
Vector Information Retrieval
Augmentation - Part 1
Vector Information Retrieval
Augmentation - Part 2
Vector Information Retrieval
Augmentation - Real World
Before LLM Engineering, Machine
Learning was Hard
https://planetcassandra.org/post/building-an-infinitely-smart-ai-powered-by-the-worlds-largest-scalable-datab
ase-apache-cassandra-part-1/
Now Making Intelligent Platforms is a lot
easier.
https://planetcassandra.org/post/building-an-infinitely-smart-ai-powered-by-the-worlds-largest-scalable-datab
ase-apache-cassandra-part-1/
LLM Frameworks
● LlamaIndex
● LangChain
● Semantic Kernel
20
Key Takeaways: Retrieval Augmentation
Prompt Engineering
Software Engineering
Use Open Frameworks
Data Engineering
- The core of LLM Frameworks is retrieval
augmentation.
- The first pillar of retrieval augmentation
is made up of prompt engineering to
know how to ask the question.
- The second pillar of retrieval
augmentation is made up of data
engineering, how to prepare the data.
- The last pillar of retrieval is basic
software engineering.
- Try it out on your own first, but quickly go
to a framework
Try it on your own ..
21
Thank you and Dream Big.
Hire us
- Design Workshops
- Innovation Sprints
- Service Catalog
Anant.us
- Read our Playbook
- Join our Mailing List
- Read up on Data Platforms
- Watch our Videos
- Download Examples

Weitere ähnliche Inhalte

Ähnlich wie NoCode, Data & AI LLM Inside Bootcamp: Episode 6 - Design Patterns: Retrieval Augmentation with LLMs

Danny Bickson - Python based predictive analytics with GraphLab Create
Danny Bickson - Python based predictive analytics with GraphLab Create Danny Bickson - Python based predictive analytics with GraphLab Create
Danny Bickson - Python based predictive analytics with GraphLab Create
PyData
 
So you want to do Data Science.... what now?
So you want to do Data Science.... what now?So you want to do Data Science.... what now?
So you want to do Data Science.... what now?
Raja Chandra Rangineni
 
Running Data Platforms Like Products
Running Data Platforms Like ProductsRunning Data Platforms Like Products
Running Data Platforms Like Products
VMware Tanzu
 
Digital_IOT_(Microsoft_Solution).pdf
Digital_IOT_(Microsoft_Solution).pdfDigital_IOT_(Microsoft_Solution).pdf
Digital_IOT_(Microsoft_Solution).pdf
ssuserd23711
 
hari_duche_updated
hari_duche_updatedhari_duche_updated
hari_duche_updated
Hari Duche
 

Ähnlich wie NoCode, Data & AI LLM Inside Bootcamp: Episode 6 - Design Patterns: Retrieval Augmentation with LLMs (20)

Symphony Driver Essay
Symphony Driver EssaySymphony Driver Essay
Symphony Driver Essay
 
Machine learning at scale - Webinar By zekeLabs
Machine learning at scale - Webinar By zekeLabsMachine learning at scale - Webinar By zekeLabs
Machine learning at scale - Webinar By zekeLabs
 
Danny Bickson - Python based predictive analytics with GraphLab Create
Danny Bickson - Python based predictive analytics with GraphLab Create Danny Bickson - Python based predictive analytics with GraphLab Create
Danny Bickson - Python based predictive analytics with GraphLab Create
 
So you want to do Data Science.... what now?
So you want to do Data Science.... what now?So you want to do Data Science.... what now?
So you want to do Data Science.... what now?
 
Using Algorithmia to leverage AI and Machine Learning APIs
Using Algorithmia to leverage AI and Machine Learning APIsUsing Algorithmia to leverage AI and Machine Learning APIs
Using Algorithmia to leverage AI and Machine Learning APIs
 
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
 
Running Data Platforms Like Products
Running Data Platforms Like ProductsRunning Data Platforms Like Products
Running Data Platforms Like Products
 
Sem tech 2011 v8
Sem tech 2011 v8Sem tech 2011 v8
Sem tech 2011 v8
 
Data Management - Full Stack Deep Learning
Data Management - Full Stack Deep LearningData Management - Full Stack Deep Learning
Data Management - Full Stack Deep Learning
 
Simplifying AI and Machine Learning with Watson Studio
Simplifying AI and Machine Learning with Watson StudioSimplifying AI and Machine Learning with Watson Studio
Simplifying AI and Machine Learning with Watson Studio
 
Thinking Outside the Cube: How In-Memory Bolsters Analytics
Thinking Outside the Cube: How In-Memory Bolsters AnalyticsThinking Outside the Cube: How In-Memory Bolsters Analytics
Thinking Outside the Cube: How In-Memory Bolsters Analytics
 
IBM Bluemix OpenWhisk: Interconnect 2016, Las Vegas: CCD-1088: The Future of ...
IBM Bluemix OpenWhisk: Interconnect 2016, Las Vegas: CCD-1088: The Future of ...IBM Bluemix OpenWhisk: Interconnect 2016, Las Vegas: CCD-1088: The Future of ...
IBM Bluemix OpenWhisk: Interconnect 2016, Las Vegas: CCD-1088: The Future of ...
 
IBM Bluemix Openwhisk
IBM Bluemix OpenwhiskIBM Bluemix Openwhisk
IBM Bluemix Openwhisk
 
ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Da...
ODSC East 2020   Accelerate ML Lifecycle with Kubernetes and Containerized Da...ODSC East 2020   Accelerate ML Lifecycle with Kubernetes and Containerized Da...
ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Da...
 
Digital_IOT_(Microsoft_Solution).pdf
Digital_IOT_(Microsoft_Solution).pdfDigital_IOT_(Microsoft_Solution).pdf
Digital_IOT_(Microsoft_Solution).pdf
 
Neo4j: What's Under the Hood & How Knowing This Can Help You
Neo4j: What's Under the Hood & How Knowing This Can Help You Neo4j: What's Under the Hood & How Knowing This Can Help You
Neo4j: What's Under the Hood & How Knowing This Can Help You
 
Online Fitness Gym Documentation
Online Fitness Gym Documentation Online Fitness Gym Documentation
Online Fitness Gym Documentation
 
Data Science at Scale - The DevOps Approach
Data Science at Scale - The DevOps ApproachData Science at Scale - The DevOps Approach
Data Science at Scale - The DevOps Approach
 
hari_duche_updated
hari_duche_updatedhari_duche_updated
hari_duche_updated
 
TechDayPakistan-Slides RAG with Cosmos DB.pptx
TechDayPakistan-Slides RAG with Cosmos DB.pptxTechDayPakistan-Slides RAG with Cosmos DB.pptx
TechDayPakistan-Slides RAG with Cosmos DB.pptx
 

Mehr von Anant Corporation

Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPTAutomate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Anant Corporation
 
Data Engineer's Lunch #86: Building Real-Time Applications at Scale: A Case S...
Data Engineer's Lunch #86: Building Real-Time Applications at Scale: A Case S...Data Engineer's Lunch #86: Building Real-Time Applications at Scale: A Case S...
Data Engineer's Lunch #86: Building Real-Time Applications at Scale: A Case S...
Anant Corporation
 

Mehr von Anant Corporation (20)

QLoRA Fine-Tuning on Cassandra Link Data Set (1/2) Cassandra Lunch 137
QLoRA Fine-Tuning on Cassandra Link Data Set (1/2) Cassandra Lunch 137QLoRA Fine-Tuning on Cassandra Link Data Set (1/2) Cassandra Lunch 137
QLoRA Fine-Tuning on Cassandra Link Data Set (1/2) Cassandra Lunch 137
 
Kono.IntelCraft.Weekly.AI.LLM.Landscape.2024.02.28.pdf
Kono.IntelCraft.Weekly.AI.LLM.Landscape.2024.02.28.pdfKono.IntelCraft.Weekly.AI.LLM.Landscape.2024.02.28.pdf
Kono.IntelCraft.Weekly.AI.LLM.Landscape.2024.02.28.pdf
 
Data Engineer's Lunch 96: Intro to Real Time Analytics Using Apache Pinot
Data Engineer's Lunch 96: Intro to Real Time Analytics Using Apache PinotData Engineer's Lunch 96: Intro to Real Time Analytics Using Apache Pinot
Data Engineer's Lunch 96: Intro to Real Time Analytics Using Apache Pinot
 
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPTAutomate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
 
YugabyteDB Developer Tools
YugabyteDB Developer ToolsYugabyteDB Developer Tools
YugabyteDB Developer Tools
 
Episode 2: The LLM / GPT / AI Prompt / Data Engineer Roadmap
Episode 2: The LLM / GPT / AI Prompt / Data Engineer RoadmapEpisode 2: The LLM / GPT / AI Prompt / Data Engineer Roadmap
Episode 2: The LLM / GPT / AI Prompt / Data Engineer Roadmap
 
Machine Learning Orchestration with Airflow
Machine Learning Orchestration with AirflowMachine Learning Orchestration with Airflow
Machine Learning Orchestration with Airflow
 
Cassandra Lunch 130: Recap of Cassandra Forward Talks
Cassandra Lunch 130: Recap of Cassandra Forward TalksCassandra Lunch 130: Recap of Cassandra Forward Talks
Cassandra Lunch 130: Recap of Cassandra Forward Talks
 
Data Engineer's Lunch 90: Migrating SQL Data with Arcion
Data Engineer's Lunch 90: Migrating SQL Data with ArcionData Engineer's Lunch 90: Migrating SQL Data with Arcion
Data Engineer's Lunch 90: Migrating SQL Data with Arcion
 
Data Engineer's Lunch 89: Machine Learning Orchestration with AirflowMachine ...
Data Engineer's Lunch 89: Machine Learning Orchestration with AirflowMachine ...Data Engineer's Lunch 89: Machine Learning Orchestration with AirflowMachine ...
Data Engineer's Lunch 89: Machine Learning Orchestration with AirflowMachine ...
 
Cassandra Lunch 129: What’s New: Apache Cassandra 4.1+ Features & Future
Cassandra Lunch 129: What’s New:  Apache Cassandra 4.1+ Features & FutureCassandra Lunch 129: What’s New:  Apache Cassandra 4.1+ Features & Future
Cassandra Lunch 129: What’s New: Apache Cassandra 4.1+ Features & Future
 
Data Engineer's Lunch #86: Building Real-Time Applications at Scale: A Case S...
Data Engineer's Lunch #86: Building Real-Time Applications at Scale: A Case S...Data Engineer's Lunch #86: Building Real-Time Applications at Scale: A Case S...
Data Engineer's Lunch #86: Building Real-Time Applications at Scale: A Case S...
 
CL 121
CL 121CL 121
CL 121
 
Data Engineer's Lunch #83: Strategies for Migration to Apache Iceberg
Data Engineer's Lunch #83: Strategies for Migration to Apache IcebergData Engineer's Lunch #83: Strategies for Migration to Apache Iceberg
Data Engineer's Lunch #83: Strategies for Migration to Apache Iceberg
 
Apache Cassandra Lunch 120: Apache Cassandra Monitoring Made Easy with AxonOps
Apache Cassandra Lunch 120: Apache Cassandra Monitoring Made Easy with AxonOpsApache Cassandra Lunch 120: Apache Cassandra Monitoring Made Easy with AxonOps
Apache Cassandra Lunch 120: Apache Cassandra Monitoring Made Easy with AxonOps
 
Apache Cassandra Lunch 119: Desktop GUI Tools for Apache Cassandra
Apache Cassandra Lunch 119: Desktop GUI Tools for Apache CassandraApache Cassandra Lunch 119: Desktop GUI Tools for Apache Cassandra
Apache Cassandra Lunch 119: Desktop GUI Tools for Apache Cassandra
 
Data Engineer's Lunch #82: Automating Apache Cassandra Operations with Apache...
Data Engineer's Lunch #82: Automating Apache Cassandra Operations with Apache...Data Engineer's Lunch #82: Automating Apache Cassandra Operations with Apache...
Data Engineer's Lunch #82: Automating Apache Cassandra Operations with Apache...
 
Data Engineer's Lunch #60: Series - Developing Enterprise Consciousness
Data Engineer's Lunch #60: Series - Developing Enterprise ConsciousnessData Engineer's Lunch #60: Series - Developing Enterprise Consciousness
Data Engineer's Lunch #60: Series - Developing Enterprise Consciousness
 
Data Engineer's Lunch #81: Reverse ETL Tools for Modern Data Platforms
Data Engineer's Lunch #81: Reverse ETL Tools for Modern Data PlatformsData Engineer's Lunch #81: Reverse ETL Tools for Modern Data Platforms
Data Engineer's Lunch #81: Reverse ETL Tools for Modern Data Platforms
 
Data Engineer’s Lunch #67: Machine Learning - Feature Selection
Data Engineer’s Lunch #67: Machine Learning - Feature SelectionData Engineer’s Lunch #67: Machine Learning - Feature Selection
Data Engineer’s Lunch #67: Machine Learning - Feature Selection
 

Kürzlich hochgeladen

Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
Bhaskar Mitra
 

Kürzlich hochgeladen (20)

Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
НАДІЯ ФЕДЮШКО БАЦ «Професійне зростання QA спеціаліста»
НАДІЯ ФЕДЮШКО БАЦ  «Професійне зростання QA спеціаліста»НАДІЯ ФЕДЮШКО БАЦ  «Професійне зростання QA спеціаліста»
НАДІЯ ФЕДЮШКО БАЦ «Професійне зростання QA спеціаліста»
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John Staveley
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
 
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
 

NoCode, Data & AI LLM Inside Bootcamp: Episode 6 - Design Patterns: Retrieval Augmentation with LLMs

  • 1. NoCode, Data & AI LLM Inside Bootcamp Design Patterns: Retrieval Augmentation with LLMs The Next Frontier in AI: Retrieval Augmentation Rahul Xavier Singh Anant Corporation Nocode Data & AI
  • 2. Retreival Augmented or Data Augmented LLM responses allow you to get accurate answers from your own data instead of hallucinated answers.
  • 4. NoCode, Data & AI LLM Inside Bootcamp with Cassandra Full day bootcamp to familiarize product managers, software professionals, and data engineers to creating next generation experts, assistants, and platforms powered by Generative AI with Large Language Models (LLM, OpenAI, GPT) Rahul Xavier Singh Anant Corporation Nocode Data & AI kono.io/bootcamp
  • 5. Agenda ● I: Strategy & Theory ● II: LLM Design Patterns ● III: NoCode/Code LLM Stacks ● IV: Build a Custom ChatBot with LLM your Data
  • 6. Today’s Agenda 1. Retrieval Augmentation 2. Design Patterns: Basic Database 3. Vector Database / Embeddings 4. Design Patterns: Vector Database
  • 7. Retrieval Augmentation ● Basic Data Augmentation ● Vectorized Data Augmentation ● General Patterns & Architectures
  • 8. What is data / retrieval augmentation? ● Information Retrieval : For data that we have not trained in an LLM, we can get data from another source. ● Contextual Relevance : If the data we retrieve is relevant to the user’s query, we only send what we need to manage context length limits. ● Enhanced Capability : This allows us to get data from different sources & systems to meet the needs of the user. ● Dynamic Learning: Unlike training or fine tuning which takes days & hours, we can have new data available immediately.
  • 9. Prompting Techniques: ReAct 1. Reasoning / Acting (ReAct) Continues to build on CoT reasoning, but enhances it by acting as in getting other information that can help it. 2. It can act by asking more questions by itself, or potentially going out to outside systems. This is the basis of systems like Langchain, LllamaIndex ReAct: Synergizing Reasoning and Acting in Language Models https://react-lm.github.io/
  • 10. Using information retrieval with LLMs. ● Your code intercepts User’s Query ● Talks to an Information Retrieval System ○ Search Index ○ SQL Database ○ API … ● Constructs a prompt with the query, the “context” that it got from the IR system ● Sends that new constructed prompt to the LLM ● Gets the answer, formats it, and sends it back to the user.
  • 12. ● You preprocess embeddings of your data into a vector database with an LLM ● Your code intercepts User’s Query, embeds it with an LLM ● Find similar documents from a vector database ● Constructs a prompt with the query, the “context” that it got from the vector database ● Sends that new constructed prompt to the LLM ● Gets the answer, formats it, and sends it back to the user. Vectorized data augmentation
  • 13. Vector Information Retrieval Augmentation - Part 0 https://blog.christianperone.com/2013/09/machine-learnin g-cosine-similarity-for-vector-space-models-part-iii/ https://milvus.io/blog/scalable-and-blazing-fast-similarity- search-with-milvus-vector-database.md ● Vector databases seem like the best “memory” for machine learning.
  • 17. Before LLM Engineering, Machine Learning was Hard https://planetcassandra.org/post/building-an-infinitely-smart-ai-powered-by-the-worlds-largest-scalable-datab ase-apache-cassandra-part-1/
  • 18. Now Making Intelligent Platforms is a lot easier. https://planetcassandra.org/post/building-an-infinitely-smart-ai-powered-by-the-worlds-largest-scalable-datab ase-apache-cassandra-part-1/
  • 19. LLM Frameworks ● LlamaIndex ● LangChain ● Semantic Kernel
  • 20. 20 Key Takeaways: Retrieval Augmentation Prompt Engineering Software Engineering Use Open Frameworks Data Engineering - The core of LLM Frameworks is retrieval augmentation. - The first pillar of retrieval augmentation is made up of prompt engineering to know how to ask the question. - The second pillar of retrieval augmentation is made up of data engineering, how to prepare the data. - The last pillar of retrieval is basic software engineering. - Try it out on your own first, but quickly go to a framework Try it on your own ..
  • 21. 21 Thank you and Dream Big. Hire us - Design Workshops - Innovation Sprints - Service Catalog Anant.us - Read our Playbook - Join our Mailing List - Read up on Data Platforms - Watch our Videos - Download Examples