SlideShare ist ein Scribd-Unternehmen logo
1 von 22
Downloaden Sie, um offline zu lesen
Smart Personal Shopping
Assistant (SPA)
Arvind Rapaka
Sairam Bantupalli
Ravindra Nath
SpotDy Inc
www.spotdy.com
Why we need?
A Personal Shopping Assistant is an occupation
where people help customers by giving advice and
making suggestions. They are employed by
departmental Stores.
But you have a Mobile/Web Ecommerce business.
How can you enable your mobile/web application turn
into a smart personal shopper for your customers?
Enter - Smart Personal Shopping Assistant.
Customer
Store
Assistant
Online Store
Why we need ?
I need skinny pants that
girls like. My size is 32
inch waist and 34 length.
Here you go. Let me
know If I should filter by
price, size or brand
I like it. My price range is
40-50 dollars.
I need skinny pants that
girls like. 32 inch waist,
34 length.
I like this pant. Let’s buy
it.
Ok, I placed the order.
You should receive your
order by tomorrow. Best
of luck.
Why we need ?
Can you place an order
of red skinny pants that I
ordered last year
Do you want the same
size?
Yes
I have placed the order.
You should receive your
pants by tomorrow.
Process Overview
ASR Image Q&A
Knowledge Graph/ Image DB
● Speech Recognition
● Image Matching
● Q & A Dialogue
ASR- Acoustic Speech Recognition
Q&A - Question and Answers Dialogue
SPA - System Call Flow
Q&A Dialogue and IR
ASR
Image Analysis
Engine
Pre-computed
KD
SpotDy BigAITM
Platform
Image
Text
Voice
IR - Information Retrieval
KD - Knowledge Graph DB
Dialogue/Action
Dialogue/Action
Components
Knowledge Database (KD)
● Build Product Knowledge Database
○ Classification (LDA, Existing Taxonomy)
○ NLP Analysis (CRF, Bayesian etc ..)
○ Image Analysis/ Text Attribution (SURF)
○ Ontologies
Image Matching
● Image Analysis
○ Extract Feature (SURF Feature Extraction)
■ Find keypoints
○ Grouping Descriptors (SURF Feature Descriptor)
■ Keypoints are grouped in descriptors
○ Match images in the precomputed descriptor database.
○ Post Processing
ASR
● Speech Recognition
○ Extract Feature vectors
○ Speech Decoder
■ Scoring (DNN)
■ Most Likely Text from Acoustic Model (HMM/Viterbi Algorithm)
○ Pass to Q & A system
Question and Answering
● Query Analysis
○ Query Processing (Stemming, lemma, Gazetteer ..)
○ Understand User Intent (HMM )
■ Navigational/Specificity
■ Initiate dialogue if necessary
○ Query POS Labelling / Entity Extraction (CRF)
○ Query Rewrite/ Retrieval/Post Processing
Q&A Personalization
● Q & A results should be personalized and
aggregated based on:
○ Past user history
○ User Geo/Demo
○ Occasions such as Christmas, Thanksgiving etc ..
SpotDy BigAITM
Query Results
Algorithms
SURF (Speeded up Robust Features)
SURF is a feature detection process to examine an
image to extract features, that are unique to the objects
in the image. Based on SIFT but faster.
In our case, it help in retrieving similar products based
on images.
Process Involves :
○ Build Scale Space
○ LoG Approximation
○ Key Point Extraction
○ Generate Features
LoG Approximation
● The Laplacian is a 2-D isotropic measure of the 2nd spatial derivative of an
image.
● The Laplacian of an image highlights regions of rapid intensity change to
detect edges.
● Uses Gaussian smoothing filter in order to reduce its sensitivity to noise due
to second derivation
GMM/DNN-HMM
HMM is a generative probabilistic model that provides
a framework for modelling time-varying spectral vector
sequences. In our case, we use for speech recognition.
● GMM/DNN produce posterior probabilities for HMM States
● predicts likelihood of observation sequence being generated by
state sequence using Viterbi Algo
● Sub word HMMs concatenate to create larger word-based HMM
Observations (Feature vectors)
GMM/DNN
HMM States
(Senones)
Posterior Probabilities
NLP
Knowledge Database (KD) is the key for the query
processing and information retrieval
● NLP is extensively used to process unstructured data in building
KD.
Algorithms:
● Conditional Random Fields/Maxent for POS Tagging, Entity
Extraction, concept tagging etc.
● LDA for topic Analysis and Classification
Q&A Dialogue and IR
Indexed
KD
Product
Catalog
Product
metainfo
NLP Engine
Query Processing
Query
Indexed
KD
Annotators/Filters
Results
● User Query pass goes through
various annotators. Some of the
few annotators include :
○ Gazetteer, Lemmatization,
Stemming, POS Tagging,
Entity Extraction
● Query Rewrite
● Search - Similarity (IR). Basic
Algorithms include
○ Vector Space Modelling
○ BM25F
● Result Generation
Scalability
SPA - HA Architecture
Significant computing resources are required while
scaling to millions of requests in real time.
BigAITM
BigAITM
is purpose built for the scalability
of applications such as SPA.
● Building KD (Knowledge Database)
● Image Repository Store
● Query Processing
● Scalable Machine Learning Models
Q&A

Weitere ähnliche Inhalte

Andere mochten auch

Personal digital assistant
Personal digital assistantPersonal digital assistant
Personal digital assistantMarietaaaa
 
Natural Language Processing in AI
Natural Language Processing in AINatural Language Processing in AI
Natural Language Processing in AISaurav Shrestha
 
Ai powered personal assistants
Ai powered personal assistantsAi powered personal assistants
Ai powered personal assistantsSavannah Boettcher
 
The new era of bots, multi-channel messaging solutions, dynamic routing and a...
The new era of bots, multi-channel messaging solutions, dynamic routing and a...The new era of bots, multi-channel messaging solutions, dynamic routing and a...
The new era of bots, multi-channel messaging solutions, dynamic routing and a...Nicola Junior Vitto
 
Powerpoint Artificial Heart
Powerpoint Artificial HeartPowerpoint Artificial Heart
Powerpoint Artificial Heartbfarber
 
Unlocking Business Value Using Data
Unlocking Business Value Using DataUnlocking Business Value Using Data
Unlocking Business Value Using DataSplunk
 
The lifecycle of a chatbot
The lifecycle of a chatbotThe lifecycle of a chatbot
The lifecycle of a chatbotSohan Maheshwar
 
Virtual personal assistant
Virtual personal assistantVirtual personal assistant
Virtual personal assistantShubham Bhalekar
 
Social Media in the Life of the Digital Library by Lorri Mon
Social Media in the Life of the Digital Library by Lorri MonSocial Media in the Life of the Digital Library by Lorri Mon
Social Media in the Life of the Digital Library by Lorri Monipl2: Information You Can Trust
 
RememberItNow! A Great Employee Benefit
RememberItNow! A Great Employee BenefitRememberItNow! A Great Employee Benefit
RememberItNow! A Great Employee BenefitRememberItNow
 
CES 2016 – GfK smart home presentation
CES 2016 – GfK smart home presentationCES 2016 – GfK smart home presentation
CES 2016 – GfK smart home presentationGfK
 
How to use google calendar
How to use google calendar How to use google calendar
How to use google calendar Ailyn Arcangeles
 
Smart Home technologies
Smart Home technologiesSmart Home technologies
Smart Home technologiesloggcity
 
Chatbot Artificial Intelligence
Chatbot Artificial IntelligenceChatbot Artificial Intelligence
Chatbot Artificial IntelligenceMd. Mahedi Mahfuj
 

Andere mochten auch (16)

Personal digital assistant
Personal digital assistantPersonal digital assistant
Personal digital assistant
 
Natural Language Processing in AI
Natural Language Processing in AINatural Language Processing in AI
Natural Language Processing in AI
 
Ai powered personal assistants
Ai powered personal assistantsAi powered personal assistants
Ai powered personal assistants
 
The new era of bots, multi-channel messaging solutions, dynamic routing and a...
The new era of bots, multi-channel messaging solutions, dynamic routing and a...The new era of bots, multi-channel messaging solutions, dynamic routing and a...
The new era of bots, multi-channel messaging solutions, dynamic routing and a...
 
Powerpoint Artificial Heart
Powerpoint Artificial HeartPowerpoint Artificial Heart
Powerpoint Artificial Heart
 
smart assistant
smart assistantsmart assistant
smart assistant
 
Unlocking Business Value Using Data
Unlocking Business Value Using DataUnlocking Business Value Using Data
Unlocking Business Value Using Data
 
NLP
NLPNLP
NLP
 
The lifecycle of a chatbot
The lifecycle of a chatbotThe lifecycle of a chatbot
The lifecycle of a chatbot
 
Virtual personal assistant
Virtual personal assistantVirtual personal assistant
Virtual personal assistant
 
Social Media in the Life of the Digital Library by Lorri Mon
Social Media in the Life of the Digital Library by Lorri MonSocial Media in the Life of the Digital Library by Lorri Mon
Social Media in the Life of the Digital Library by Lorri Mon
 
RememberItNow! A Great Employee Benefit
RememberItNow! A Great Employee BenefitRememberItNow! A Great Employee Benefit
RememberItNow! A Great Employee Benefit
 
CES 2016 – GfK smart home presentation
CES 2016 – GfK smart home presentationCES 2016 – GfK smart home presentation
CES 2016 – GfK smart home presentation
 
How to use google calendar
How to use google calendar How to use google calendar
How to use google calendar
 
Smart Home technologies
Smart Home technologiesSmart Home technologies
Smart Home technologies
 
Chatbot Artificial Intelligence
Chatbot Artificial IntelligenceChatbot Artificial Intelligence
Chatbot Artificial Intelligence
 

Ähnlich wie Personal Shopping Assistant - A Big Data Problem

Introduction to Artificial Intelligence
Introduction to Artificial IntelligenceIntroduction to Artificial Intelligence
Introduction to Artificial Intelligenceananth
 
A step towards machine learning at accionlabs
A step towards machine learning at accionlabsA step towards machine learning at accionlabs
A step towards machine learning at accionlabsChetan Khatri
 
Enterprise PHP Architecture through Design Patterns and Modularization (Midwe...
Enterprise PHP Architecture through Design Patterns and Modularization (Midwe...Enterprise PHP Architecture through Design Patterns and Modularization (Midwe...
Enterprise PHP Architecture through Design Patterns and Modularization (Midwe...Aaron Saray
 
Deep learning applications in e-commerce search: Dynamic talks Chicago 3/14/2019
Deep learning applications in e-commerce search: Dynamic talks Chicago 3/14/2019Deep learning applications in e-commerce search: Dynamic talks Chicago 3/14/2019
Deep learning applications in e-commerce search: Dynamic talks Chicago 3/14/2019Grid Dynamics
 
Being a Data Science Product Manager
Being a Data Science Product ManagerBeing a Data Science Product Manager
Being a Data Science Product ManagerRam Narayan Subudhi
 
Machine learning and TensorFlow
Machine learning and TensorFlowMachine learning and TensorFlow
Machine learning and TensorFlowJose Papo, MSc
 
Big & Personal: the data and the models behind Netflix recommendations by Xa...
 Big & Personal: the data and the models behind Netflix recommendations by Xa... Big & Personal: the data and the models behind Netflix recommendations by Xa...
Big & Personal: the data and the models behind Netflix recommendations by Xa...BigMine
 
2016 XUG Conference Big Data: Big Deal for Personalized Communications or Meh?
2016 XUG Conference   Big Data: Big Deal for Personalized Communications or Meh?2016 XUG Conference   Big Data: Big Deal for Personalized Communications or Meh?
2016 XUG Conference Big Data: Big Deal for Personalized Communications or Meh?Jeffrey Stewart
 
Geek Sync | A Photographer and a Data Modeler Walk into a Bar…
Geek Sync | A Photographer and a Data Modeler Walk into a Bar…Geek Sync | A Photographer and a Data Modeler Walk into a Bar…
Geek Sync | A Photographer and a Data Modeler Walk into a Bar…IDERA Software
 
Introduction to ML and solutions challenge (GDSC)
Introduction to ML and solutions challenge (GDSC)Introduction to ML and solutions challenge (GDSC)
Introduction to ML and solutions challenge (GDSC)AbrarSharif2
 
GDSC Machine Learning Session Presentation
GDSC Machine Learning Session PresentationGDSC Machine Learning Session Presentation
GDSC Machine Learning Session Presentationgdsclavasa
 
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix ScaleQcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix ScaleXavier Amatriain
 
Generative AI by Salesforce Admin Group Dehradun
Generative AI by Salesforce Admin Group DehradunGenerative AI by Salesforce Admin Group Dehradun
Generative AI by Salesforce Admin Group DehradunkailashChandra95
 
Live predictions with schemaless data at scale. MLMU Kosice, Exponea
Live predictions with schemaless data at scale. MLMU Kosice, ExponeaLive predictions with schemaless data at scale. MLMU Kosice, Exponea
Live predictions with schemaless data at scale. MLMU Kosice, ExponeaData Science Club
 
Artificial Intelligence (ML - DL)
Artificial Intelligence (ML - DL)Artificial Intelligence (ML - DL)
Artificial Intelligence (ML - DL)ShehryarSH1
 
infoShare AI Roadshow 2018 - Adam Karwan (Groupon) - Jak wykorzystać uczenie ...
infoShare AI Roadshow 2018 - Adam Karwan (Groupon) - Jak wykorzystać uczenie ...infoShare AI Roadshow 2018 - Adam Karwan (Groupon) - Jak wykorzystać uczenie ...
infoShare AI Roadshow 2018 - Adam Karwan (Groupon) - Jak wykorzystać uczenie ...Infoshare
 
Domain specific nlp pipelines
Domain specific nlp pipelinesDomain specific nlp pipelines
Domain specific nlp pipelinesRajesh Muppalla
 

Ähnlich wie Personal Shopping Assistant - A Big Data Problem (20)

Introduction to Artificial Intelligence
Introduction to Artificial IntelligenceIntroduction to Artificial Intelligence
Introduction to Artificial Intelligence
 
A step towards machine learning at accionlabs
A step towards machine learning at accionlabsA step towards machine learning at accionlabs
A step towards machine learning at accionlabs
 
Search and fpga
Search and fpgaSearch and fpga
Search and fpga
 
Enterprise PHP Architecture through Design Patterns and Modularization (Midwe...
Enterprise PHP Architecture through Design Patterns and Modularization (Midwe...Enterprise PHP Architecture through Design Patterns and Modularization (Midwe...
Enterprise PHP Architecture through Design Patterns and Modularization (Midwe...
 
Deep learning applications in e-commerce search: Dynamic talks Chicago 3/14/2019
Deep learning applications in e-commerce search: Dynamic talks Chicago 3/14/2019Deep learning applications in e-commerce search: Dynamic talks Chicago 3/14/2019
Deep learning applications in e-commerce search: Dynamic talks Chicago 3/14/2019
 
Being a Data Science Product Manager
Being a Data Science Product ManagerBeing a Data Science Product Manager
Being a Data Science Product Manager
 
Machine learning and TensorFlow
Machine learning and TensorFlowMachine learning and TensorFlow
Machine learning and TensorFlow
 
Big & Personal: the data and the models behind Netflix recommendations by Xa...
 Big & Personal: the data and the models behind Netflix recommendations by Xa... Big & Personal: the data and the models behind Netflix recommendations by Xa...
Big & Personal: the data and the models behind Netflix recommendations by Xa...
 
2016 XUG Conference Big Data: Big Deal for Personalized Communications or Meh?
2016 XUG Conference   Big Data: Big Deal for Personalized Communications or Meh?2016 XUG Conference   Big Data: Big Deal for Personalized Communications or Meh?
2016 XUG Conference Big Data: Big Deal for Personalized Communications or Meh?
 
Geek Sync | A Photographer and a Data Modeler Walk into a Bar…
Geek Sync | A Photographer and a Data Modeler Walk into a Bar…Geek Sync | A Photographer and a Data Modeler Walk into a Bar…
Geek Sync | A Photographer and a Data Modeler Walk into a Bar…
 
Introduction to ML and solutions challenge (GDSC)
Introduction to ML and solutions challenge (GDSC)Introduction to ML and solutions challenge (GDSC)
Introduction to ML and solutions challenge (GDSC)
 
GDSC Machine Learning Session Presentation
GDSC Machine Learning Session PresentationGDSC Machine Learning Session Presentation
GDSC Machine Learning Session Presentation
 
GDSC BPIT ML Campaign.pptx
GDSC BPIT ML Campaign.pptxGDSC BPIT ML Campaign.pptx
GDSC BPIT ML Campaign.pptx
 
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix ScaleQcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
 
Role of Data Science in eCommerce
Role of Data Science in eCommerceRole of Data Science in eCommerce
Role of Data Science in eCommerce
 
Generative AI by Salesforce Admin Group Dehradun
Generative AI by Salesforce Admin Group DehradunGenerative AI by Salesforce Admin Group Dehradun
Generative AI by Salesforce Admin Group Dehradun
 
Live predictions with schemaless data at scale. MLMU Kosice, Exponea
Live predictions with schemaless data at scale. MLMU Kosice, ExponeaLive predictions with schemaless data at scale. MLMU Kosice, Exponea
Live predictions with schemaless data at scale. MLMU Kosice, Exponea
 
Artificial Intelligence (ML - DL)
Artificial Intelligence (ML - DL)Artificial Intelligence (ML - DL)
Artificial Intelligence (ML - DL)
 
infoShare AI Roadshow 2018 - Adam Karwan (Groupon) - Jak wykorzystać uczenie ...
infoShare AI Roadshow 2018 - Adam Karwan (Groupon) - Jak wykorzystać uczenie ...infoShare AI Roadshow 2018 - Adam Karwan (Groupon) - Jak wykorzystać uczenie ...
infoShare AI Roadshow 2018 - Adam Karwan (Groupon) - Jak wykorzystać uczenie ...
 
Domain specific nlp pipelines
Domain specific nlp pipelinesDomain specific nlp pipelines
Domain specific nlp pipelines
 

Kürzlich hochgeladen

2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 

Kürzlich hochgeladen (20)

2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 

Personal Shopping Assistant - A Big Data Problem

  • 1. Smart Personal Shopping Assistant (SPA) Arvind Rapaka Sairam Bantupalli Ravindra Nath SpotDy Inc www.spotdy.com
  • 2. Why we need? A Personal Shopping Assistant is an occupation where people help customers by giving advice and making suggestions. They are employed by departmental Stores. But you have a Mobile/Web Ecommerce business. How can you enable your mobile/web application turn into a smart personal shopper for your customers? Enter - Smart Personal Shopping Assistant. Customer Store Assistant Online Store
  • 3. Why we need ? I need skinny pants that girls like. My size is 32 inch waist and 34 length. Here you go. Let me know If I should filter by price, size or brand I like it. My price range is 40-50 dollars. I need skinny pants that girls like. 32 inch waist, 34 length. I like this pant. Let’s buy it. Ok, I placed the order. You should receive your order by tomorrow. Best of luck.
  • 4. Why we need ? Can you place an order of red skinny pants that I ordered last year Do you want the same size? Yes I have placed the order. You should receive your pants by tomorrow.
  • 5. Process Overview ASR Image Q&A Knowledge Graph/ Image DB ● Speech Recognition ● Image Matching ● Q & A Dialogue ASR- Acoustic Speech Recognition Q&A - Question and Answers Dialogue
  • 6. SPA - System Call Flow Q&A Dialogue and IR ASR Image Analysis Engine Pre-computed KD SpotDy BigAITM Platform Image Text Voice IR - Information Retrieval KD - Knowledge Graph DB Dialogue/Action Dialogue/Action
  • 8. Knowledge Database (KD) ● Build Product Knowledge Database ○ Classification (LDA, Existing Taxonomy) ○ NLP Analysis (CRF, Bayesian etc ..) ○ Image Analysis/ Text Attribution (SURF) ○ Ontologies
  • 9. Image Matching ● Image Analysis ○ Extract Feature (SURF Feature Extraction) ■ Find keypoints ○ Grouping Descriptors (SURF Feature Descriptor) ■ Keypoints are grouped in descriptors ○ Match images in the precomputed descriptor database. ○ Post Processing
  • 10. ASR ● Speech Recognition ○ Extract Feature vectors ○ Speech Decoder ■ Scoring (DNN) ■ Most Likely Text from Acoustic Model (HMM/Viterbi Algorithm) ○ Pass to Q & A system
  • 11. Question and Answering ● Query Analysis ○ Query Processing (Stemming, lemma, Gazetteer ..) ○ Understand User Intent (HMM ) ■ Navigational/Specificity ■ Initiate dialogue if necessary ○ Query POS Labelling / Entity Extraction (CRF) ○ Query Rewrite/ Retrieval/Post Processing
  • 12. Q&A Personalization ● Q & A results should be personalized and aggregated based on: ○ Past user history ○ User Geo/Demo ○ Occasions such as Christmas, Thanksgiving etc .. SpotDy BigAITM Query Results
  • 14. SURF (Speeded up Robust Features) SURF is a feature detection process to examine an image to extract features, that are unique to the objects in the image. Based on SIFT but faster. In our case, it help in retrieving similar products based on images. Process Involves : ○ Build Scale Space ○ LoG Approximation ○ Key Point Extraction ○ Generate Features
  • 15. LoG Approximation ● The Laplacian is a 2-D isotropic measure of the 2nd spatial derivative of an image. ● The Laplacian of an image highlights regions of rapid intensity change to detect edges. ● Uses Gaussian smoothing filter in order to reduce its sensitivity to noise due to second derivation
  • 16. GMM/DNN-HMM HMM is a generative probabilistic model that provides a framework for modelling time-varying spectral vector sequences. In our case, we use for speech recognition. ● GMM/DNN produce posterior probabilities for HMM States ● predicts likelihood of observation sequence being generated by state sequence using Viterbi Algo ● Sub word HMMs concatenate to create larger word-based HMM Observations (Feature vectors) GMM/DNN HMM States (Senones) Posterior Probabilities
  • 17. NLP Knowledge Database (KD) is the key for the query processing and information retrieval ● NLP is extensively used to process unstructured data in building KD. Algorithms: ● Conditional Random Fields/Maxent for POS Tagging, Entity Extraction, concept tagging etc. ● LDA for topic Analysis and Classification Q&A Dialogue and IR Indexed KD Product Catalog Product metainfo NLP Engine
  • 18. Query Processing Query Indexed KD Annotators/Filters Results ● User Query pass goes through various annotators. Some of the few annotators include : ○ Gazetteer, Lemmatization, Stemming, POS Tagging, Entity Extraction ● Query Rewrite ● Search - Similarity (IR). Basic Algorithms include ○ Vector Space Modelling ○ BM25F ● Result Generation
  • 20. SPA - HA Architecture Significant computing resources are required while scaling to millions of requests in real time.
  • 21. BigAITM BigAITM is purpose built for the scalability of applications such as SPA. ● Building KD (Knowledge Database) ● Image Repository Store ● Query Processing ● Scalable Machine Learning Models
  • 22. Q&A