SlideShare ist ein Scribd-Unternehmen logo
1 von 79
Downloaden Sie, um offline zu lesen
Helixa
Audience Projection of Target Consumers over
Multiple Domains: a NER and Bayesian approach
Gianmario Spacagna
Chief Scientist @ Helixa
O’Reilly AI Conference
London, 16th October 2019
About Me
7+ years experience in Data Science and Machine Learning
Currently leading a team of ML Scientists and ML Engineers
Background in Telematics and Software Engineering of Distributed Systems
Ongoing MBA Student
Co-author of Python Deep Learning
Contributor of the Professional Data Science Manifesto
Blogger of Data Science Vademecum
Founder of the Data Science Milan community (1.4k members)
Stockholm, London, Milan
Gianmario Spacagna
Chief Scientist, Helixa
gspacagna@helixa.ai
DEMOGRAPHICS
HHI < 40K
Female
18 - 24
INFLUENCERS
ODESZA Cardi B
Shane DawsonJames Charles
Helixa is Market
Research platform
that uses AI to
integrate disparate
data sources into an
enriched view of the
consumers who
matter to your
business.
INTERESTS
Listen to Podcasts Kylie Cosmetics
Fan
Starbucks
Chipotle
PSYCHOGRAPHICS
Fast Food
Fans
Fashion
Enthusiasts
Entertainment
Junkies
In the next 40 minutes...
OUR GOAL:
Discuss some of the current challenges of traditional market
research and propose a novel solution based on Named Entity
Recognition (NER) and Bayesian Inference.
Challenges in Market Research
Applied Social Science
What is Market Research?
Gain Insights for Strategic Decisions
Information about
individuals and organizations Statistical Inference
Why Market Research matters?
Brands Perceptions
Consumers Preferences
and Behaviors
Buyer Personas
Market Segmentation
Identify OpportunitiesMarket Trends
Approaches to Market Research
Opinions and individual experiences
In-depth interviews
Smaller sample
Qualitative Quantitative
Numbers and Data
Statistics
Larger sample
Quantitative Market Research is conducted with Surveys
Define
Analyze
DistributeCollect
Design
Limitations of Surveys
Expensive
Invasive
Response Bias
Predefined questions
Narrow coverage
Market Research using “Implicit Consumers Feedback”
Define
Analyze
DistributeCollect
Design
vs.
e.g. Social Listening
Twitter Interactions
Inferring Interests from Twitter Interactions
Advantages of Implicit Consumer Feedback Approaches
Flexible costs
Wide view
Opportunities for Big Data and AIMass coverage
Spontaneous
What about other information?
Twitter Interactions
Amazon Purchases
?
Beer Consumption Brand
?
The Universe of Consumers Datasets
Social Media
Financial and
Properties
Behaviors
First Party
(CSM)
Consumer
Research
Surveys
SCATTERED PARTIAL SKEWED
M A L E F E M A L E
18-30
31-43
44-56
57-70
Individual Consumers Datasets are Far From Being Exhaustive
ALL IN ONE COMPLETE REPRESENTATIVE
M A L E F E M A L E
18-30
31-43
44-56
57-70
The Holy Grail of Market Research
What is the baseline
algorithm for
“completing”
datasets?
Look-alike Fusion
What is look-alike fusion?
Left:
Social Network Panel
Right:
Consumptions Survey Panel
Assignment Optimization Problem
● Hungarian method
● Simplex
● Auction algorithm
Well-known solutions:
Datasets Fusion
X X
X X X
X X
X X X
X
Left
User
Right
User
left-only entities right-only entities
Target
Audience
=
Look-alike Fusions Requires a Main Panel Centrality
Look-alike Fusions Don’t Scale Well
Differences in feature
space
Craftsmanship required
at each change of data
Universal objective
function to optimize
Is there a more
scalable way to
“fuse” datasets?
The Audience Projection
Audience Projection defined as “User Binary Classification”
Source:
Social Network Panel
Destination:
Consumptions Survey Panel
70M
Social accounts
200M
U.S. consumers
1.6M / 26M /
TRUE
FALSE
TRUE
FALSE
Target
Audience
=
PROJECTION
Ben & Jerry’s: bought in
last 6 months?
Affinity: 1.80x
Venmo: paid in last 30 days?
Affinity: 1.6x
Angry Orchard: drunk in
last 6 months?
Affinity: 1.50x
Solution = Named Entity Recognition (NER) + Bayesian Model
Social
Pages
Consumption
Questions
NER NER
BAYESIAN MODEL
ENTITY LINKING (NEL)
Destination:
Consumptions Survey Panel
Source:
Social Network Panel
Projected Users
Probabilities
Target
Audience
Entities Represent an Universal Feature Space
Social
Pages
Consumption
Questions
Listed
Products
NER NER NER
The Coca-Cola Company is a total beverage
company, offering over 500 brands in more
than 200 countries and territories.
Named Entity Recognition(NER) in each Domain
Social
Pages
Consumption
Questions
Listed
Products
Adidas Originals Men's Relaxed Strapback Cap
Coca-Cola KWC-4 6-Can Personal Mini 12V DC Car and 110V
AC Cooler, Red
NLP Libraries with NER capability
Polyglot
Deep
Pavlov
Why for Production?
Fast Accurate
Industry-grade maturity
example of NER usage
Same Entity May Exist with Different Spellings
Interacted with
Coca-Cola Company on
Social Networks
“Have you consumed
Coca-Cola last week?”
Linking and Normalizing Entities via
en.wikipedia.org/wiki/Coca-Cola
en.wikipedia.org/wiki/The_Coca-Cola_Company
Entity
Relationship
Normalized Entities means a Common Feature Space
Stacked Heterogeneous Feature Space
X X ? ?
X X ? ?
? ? X X X X
? ? X X X
? ? X X X
Source
Users
Destination
Users
source-only entities common entities destination-only entities
Latent
interests
Target
Audience
=
Common Entities translate Source to Destination
Source:
Social Network Panel
Destination:
Consumptions Survey Panel
Target
Audience
=
Common Entities
?Bayesian
Model
Source Target Size
1.6M / 70M = 2.3%
Share of
Interests
“Share of interests” encode the DNA of the Target Audience
Global
share of interests:
100%
Common Entities
Target audience
share of interests:
50%
17%
50%
Target Audience
slice
Bayesian Model
Posterior
Probability of user belonging to
projected target given the
Share of Interests on common entities
𝐏( / ) =∈
𝐏( / )∙𝐏( )∈ ∈
𝐏( )
Evidence
Prior
Source Target Size=2.3%Likelihood
Evidence Decomposition
𝐏( )
Evidence
𝐏( / )∙𝐏( )∈ ∈
𝐏( / )∙𝐏( )∉ ∉
Marginal Positive Likelihood
Binomial distribution
𝐏( / ) ≈∈p=17%
Joint Likelihood under Naive Assumption
𝐏( , , / ) =50%17% 50%
∈
𝐏( / )∙17%
∈
𝐏( / )∙50%
∈
𝐏( / )50%
∈
Destination
variables
TeenNick Robot
Chicken
Bob’s
Burgers
Ben &
Jerry’s
Venmo Angry
Orchard
Nintendo
DSi XL
Video
Games
Audio or
Video Chat
Affinity 8.9x 7.27x 2.36x 1.80x 1.62x 1.55.x 1.47x 1.45x 1.23x
Predicted Probabilities provides Insights on the Projected Users
PROJECTIONTarget
Audience
=
Projected Users
Probabilities
Insights on Destination Variables
𝐏( / )∈
Audience Projection In a Nutshell
Social Panel Consumptions Survey PanelCommon Entities
Bayesian Model
Target
Audience
=
Affinity: 1.80x
Affinity: 1.55x
Affinity: 1.62x
Cool! How do you
know this is
accurate?
Evaluation Techniques
Binary Classifier Evaluation
Bayesian Model
Projected Users Probabilities
Ground Truth
Evaluation
techniques
?
Validate via Common Entities
X
X
X X X
X X
X
Source
Users
Destination
Users
common entities
Target
Audience
OR=
Projected
Audience
OR=
Exact Query Replica
Ground
Truth
Validate via Self Reconstruction Within the Same Domain
X X X
X X X
X X X X X X
X X X X
X X X
Source
Users
Destination
Users
source-only entities common entities destination-only entities
Target
Audience
=
Ground
Truth
Validate via Double-step Reconstruction
PROJECTION PROJECTION
Predicted
probabilities
Ground
Truth
Repeat Test Cases Stratifying by Category
Demographics Skewness
PROJECTION
Golden Benchmarks Comparison on Aggregated Insights
Opportunities
Many Linked Views of the Same Global Population
Audience
Projection
Multiple Perspectives Reinforce Reliability
Social Panel
Target
Audience
=
Interacted with Game
Informer social page
Affinity: 2.17x
Have you read any Game
Informer issue?
Affinity: 1.73x
Game Informer Single Issue
Magazine purchased online
Affinity: 2.51x
Generalize Audience Projection as a Domain Adaptation Problem
Final Remarks
Many Datasets
but
only Partial Views
Look-alike
fusions don’t
scale well
Audience Projection
adapts to any
“entity domain”
Bayesian Model
Accuracy and
Biases can be
quantified
Strategists now
have a complete
view of their
Target Audience
Gianmario Spacagna
Chief Scientist at Helixa.ai
gspacagna@helixa.ai
@gm_spacagna
Appendix A:
The spaCy NER Model
Natural Language Processing (NLP) Pipeline
"Mark Watney visited Mars"
The spaCy NER Model Overview
EMBED
ENCODE
ATTEND
PREDICT
Embedding Words
Features
token lower prefix suffix shape
Apple apple app ple Wwwww
U.K. uk uk uk W.W.
Fahrenheit 451 fahrenheit 451 fah 451 Wwwwwwwwww ddd
Each word (token) is represented by concatenating
the embeddings of all of the 4 features in order to
generalize the context for unknown words.
Efficiently Embedding Words
Hash Embedding reduces the dimensionality and
allows to deal with large vocabularies
Encoding Sequences of Words
Residual Convolutional Neural Networks allows to
encode context-independent word vectors into a
context-sensitive sentence matrix.
Raw tri-gram chunk Enriched tri-gram matrix
Mark
Watney
visited
“Mark Watney visited Mars”
Crafting the Attention Vector
The attention vector of the trigram includes
information on the encountered entities.
“Mark Watney visited Mars”
Attention vector
Tri-gram matrix
Enriched
tri-gram vector
Predicting the Recognized Entities
Actions:
SHIFT
OUT
REDUCE (Entity Tagging)
Stack Buffer Segment
“Mark Watney visited Mars”
Actions:
1.SHIFT
2.SHIFT
3.REDUCE (PER)
4.OUT
5.SHIFT
6.REDUCE (LOC)
Mark
Watney
Mars
Mark
Watney
visited
Mars
Enriched
tri-gram vector
Update
attention
Attention vector
Tri-gam matrix
Official Explanation of spaCy NER Model
https://www.youtube.com/watch?v=sqDHBH9IjRU
Appendix B:
The Bayesian Model
Projecting the Share of Interests on Common Entities
Target
Audience
Projection
50%
17%
50%
Share of Interests:
SIZE: 60M
SIZE: 200M
SIZE: ?
SIZE: 40M
Global Audience
(average american)
=
Target
Audience evidence
prior
Evidence Statistics on Share of Interests
N = 180M users in U.S. population
sampling rate = 1 : 10k
n = 18k users in sample panel
p = 17% of market penetration
x = 3k expected projected users
SIZE: 200M
SIZE: 40M
statistics:
evidence
𝐏( / ) =
Binomial Positive Likelihood
n = 17999
x = 2999
log(p)=-5.56323
Probability of selecting 3000 / 18000 McDonald’s panel
users given that the user IS part of the target∈
n = 18000
x = 3000
log(p)=-5.54342
is smaller than
p=17%
𝐏( / ) =
Binomial Negative Likelihood
n = 17999
x = 2999
log(p)=-5.53942
Probability of selecting 3000 / 18000 McDonald’s panel
users given that the user IS NOT part of the target∉
n = 18000
x = 3000
log(p)=-5.54342
p=17%
is greater than

Weitere ähnliche Inhalte

Ähnlich wie Audience projection of target consumers over multiple domains a ner and bayesian approach, Gianmario Spacagna, Alberto Pirovano

Eswc2013 audience short
Eswc2013 audience shortEswc2013 audience short
Eswc2013 audience short
Claudia Wagner
 
Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!
Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!
Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!
Sri Ambati
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
Marcel Kurovski
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
inovex GmbH
 
AI-SDV 2021: Francisco Webber - Efficiency is the New Precision
AI-SDV 2021: Francisco Webber - Efficiency is the New PrecisionAI-SDV 2021: Francisco Webber - Efficiency is the New Precision
AI-SDV 2021: Francisco Webber - Efficiency is the New Precision
Dr. Haxel Consult
 
Natural Language Processing & Semantic Models in an Imperfect World
Natural Language Processing & Semantic Modelsin an Imperfect WorldNatural Language Processing & Semantic Modelsin an Imperfect World
Natural Language Processing & Semantic Models in an Imperfect World
Vital.AI
 

Ähnlich wie Audience projection of target consumers over multiple domains a ner and bayesian approach, Gianmario Spacagna, Alberto Pirovano (20)

Eswc2013 audience short
Eswc2013 audience shortEswc2013 audience short
Eswc2013 audience short
 
Driverless AI Hands-on Focused on Machine Learning Interpretability - H2O.ai
Driverless AI Hands-on Focused on Machine Learning Interpretability - H2O.aiDriverless AI Hands-on Focused on Machine Learning Interpretability - H2O.ai
Driverless AI Hands-on Focused on Machine Learning Interpretability - H2O.ai
 
Using Formal Concept Analysis to Construct and Visualise Hierarchies of Socio...
Using Formal Concept Analysis to Construct and Visualise Hierarchies of Socio...Using Formal Concept Analysis to Construct and Visualise Hierarchies of Socio...
Using Formal Concept Analysis to Construct and Visualise Hierarchies of Socio...
 
2014 01-ticosa
2014 01-ticosa2014 01-ticosa
2014 01-ticosa
 
Building better knowledge graphs through social computing
Building better knowledge graphs through social computingBuilding better knowledge graphs through social computing
Building better knowledge graphs through social computing
 
Mythrealities
MythrealitiesMythrealities
Mythrealities
 
Twente ir-course 20-10-2010
Twente ir-course 20-10-2010Twente ir-course 20-10-2010
Twente ir-course 20-10-2010
 
Six Easy Pieces of Quantitatively Analyzing Open Source
Six Easy Pieces of Quantitatively Analyzing Open SourceSix Easy Pieces of Quantitatively Analyzing Open Source
Six Easy Pieces of Quantitatively Analyzing Open Source
 
Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!
Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!
Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
 
On building more human query answering systems
On building more human query answering systemsOn building more human query answering systems
On building more human query answering systems
 
Software Analytics: Towards Software Mining that Matters (2014)
Software Analytics:Towards Software Mining that Matters (2014)Software Analytics:Towards Software Mining that Matters (2014)
Software Analytics: Towards Software Mining that Matters (2014)
 
AI-SDV 2021: Francisco Webber - Efficiency is the New Precision
AI-SDV 2021: Francisco Webber - Efficiency is the New PrecisionAI-SDV 2021: Francisco Webber - Efficiency is the New Precision
AI-SDV 2021: Francisco Webber - Efficiency is the New Precision
 
Towards Statistical Queries over Distributed Private User Data
Towards Statistical Queries over Distributed Private User Data Towards Statistical Queries over Distributed Private User Data
Towards Statistical Queries over Distributed Private User Data
 
Natural Language Processing & Semantic Models in an Imperfect World
Natural Language Processing & Semantic Modelsin an Imperfect WorldNatural Language Processing & Semantic Modelsin an Imperfect World
Natural Language Processing & Semantic Models in an Imperfect World
 
Software Visualization Como 2016
Software Visualization Como 2016Software Visualization Como 2016
Software Visualization Como 2016
 
Challenges in business analytics
Challenges in business analyticsChallenges in business analytics
Challenges in business analytics
 
Bayesian Network 을 활용한 예측 분석
Bayesian Network 을 활용한 예측 분석Bayesian Network 을 활용한 예측 분석
Bayesian Network 을 활용한 예측 분석
 
SLA Summer 2008
SLA Summer 2008SLA Summer 2008
SLA Summer 2008
 

Mehr von Data Science Milan

MLOps with a Feature Store: Filling the Gap in ML Infrastructure
MLOps with a Feature Store: Filling the Gap in ML InfrastructureMLOps with a Feature Store: Filling the Gap in ML Infrastructure
MLOps with a Feature Store: Filling the Gap in ML Infrastructure
Data Science Milan
 
Time Series Classification with Deep Learning | Marco Del Pra
Time Series Classification with Deep Learning | Marco Del PraTime Series Classification with Deep Learning | Marco Del Pra
Time Series Classification with Deep Learning | Marco Del Pra
Data Science Milan
 
Continual/Lifelong Learning with Deep Architectures, Vincenzo Lomonaco
Continual/Lifelong Learning with Deep Architectures, Vincenzo LomonacoContinual/Lifelong Learning with Deep Architectures, Vincenzo Lomonaco
Continual/Lifelong Learning with Deep Architectures, Vincenzo Lomonaco
Data Science Milan
 
3D Point Cloud analysis using Deep Learning
3D Point Cloud analysis using Deep Learning3D Point Cloud analysis using Deep Learning
3D Point Cloud analysis using Deep Learning
Data Science Milan
 
Deep time-to-failure: predicting failures, churns and customer lifetime with ...
Deep time-to-failure: predicting failures, churns and customer lifetime with ...Deep time-to-failure: predicting failures, churns and customer lifetime with ...
Deep time-to-failure: predicting failures, churns and customer lifetime with ...
Data Science Milan
 
Pricing Optimization: Close-out, Online and Renewal strategies, Data Reply
Pricing Optimization: Close-out, Online and Renewal strategies, Data ReplyPricing Optimization: Close-out, Online and Renewal strategies, Data Reply
Pricing Optimization: Close-out, Online and Renewal strategies, Data Reply
Data Science Milan
 
"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig...
"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig..."How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig...
"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig...
Data Science Milan
 

Mehr von Data Science Milan (20)

ML & Graph algorithms to prevent financial crime in digital payments
ML & Graph  algorithms to prevent  financial crime in  digital paymentsML & Graph  algorithms to prevent  financial crime in  digital payments
ML & Graph algorithms to prevent financial crime in digital payments
 
How to use the Economic Complexity Index to guide innovation plans
How to use the Economic Complexity Index to guide innovation plansHow to use the Economic Complexity Index to guide innovation plans
How to use the Economic Complexity Index to guide innovation plans
 
Robustness Metrics for ML Models based on Deep Learning Methods
Robustness Metrics for ML Models based on Deep Learning MethodsRobustness Metrics for ML Models based on Deep Learning Methods
Robustness Metrics for ML Models based on Deep Learning Methods
 
"You don't need a bigger boat": serverless MLOps for reasonable companies
"You don't need a bigger boat": serverless MLOps for reasonable companies"You don't need a bigger boat": serverless MLOps for reasonable companies
"You don't need a bigger boat": serverless MLOps for reasonable companies
 
Question generation using Natural Language Processing by QuestGen.AI
Question generation using Natural Language Processing by QuestGen.AIQuestion generation using Natural Language Processing by QuestGen.AI
Question generation using Natural Language Processing by QuestGen.AI
 
Speed up data preparation for ML pipelines on AWS
Speed up data preparation for ML pipelines on AWSSpeed up data preparation for ML pipelines on AWS
Speed up data preparation for ML pipelines on AWS
 
Serverless machine learning architectures at Helixa
Serverless machine learning architectures at HelixaServerless machine learning architectures at Helixa
Serverless machine learning architectures at Helixa
 
MLOps with a Feature Store: Filling the Gap in ML Infrastructure
MLOps with a Feature Store: Filling the Gap in ML InfrastructureMLOps with a Feature Store: Filling the Gap in ML Infrastructure
MLOps with a Feature Store: Filling the Gap in ML Infrastructure
 
Reinforcement Learning Overview | Marco Del Pra
Reinforcement Learning Overview | Marco Del PraReinforcement Learning Overview | Marco Del Pra
Reinforcement Learning Overview | Marco Del Pra
 
Time Series Classification with Deep Learning | Marco Del Pra
Time Series Classification with Deep Learning | Marco Del PraTime Series Classification with Deep Learning | Marco Del Pra
Time Series Classification with Deep Learning | Marco Del Pra
 
Ludwig: A code-free deep learning toolbox | Piero Molino, Uber AI
Ludwig: A code-free deep learning toolbox | Piero Molino, Uber AILudwig: A code-free deep learning toolbox | Piero Molino, Uber AI
Ludwig: A code-free deep learning toolbox | Piero Molino, Uber AI
 
Weak supervised learning - Kristina Khvatova
Weak supervised learning - Kristina KhvatovaWeak supervised learning - Kristina Khvatova
Weak supervised learning - Kristina Khvatova
 
GANs beyond nice pictures: real value of data generation, Alex Honchar
GANs beyond nice pictures: real value of data generation, Alex HoncharGANs beyond nice pictures: real value of data generation, Alex Honchar
GANs beyond nice pictures: real value of data generation, Alex Honchar
 
Continual/Lifelong Learning with Deep Architectures, Vincenzo Lomonaco
Continual/Lifelong Learning with Deep Architectures, Vincenzo LomonacoContinual/Lifelong Learning with Deep Architectures, Vincenzo Lomonaco
Continual/Lifelong Learning with Deep Architectures, Vincenzo Lomonaco
 
3D Point Cloud analysis using Deep Learning
3D Point Cloud analysis using Deep Learning3D Point Cloud analysis using Deep Learning
3D Point Cloud analysis using Deep Learning
 
Deep time-to-failure: predicting failures, churns and customer lifetime with ...
Deep time-to-failure: predicting failures, churns and customer lifetime with ...Deep time-to-failure: predicting failures, churns and customer lifetime with ...
Deep time-to-failure: predicting failures, churns and customer lifetime with ...
 
50 Shades of Text - Leveraging Natural Language Processing (NLP), Alessandro ...
50 Shades of Text - Leveraging Natural Language Processing (NLP), Alessandro ...50 Shades of Text - Leveraging Natural Language Processing (NLP), Alessandro ...
50 Shades of Text - Leveraging Natural Language Processing (NLP), Alessandro ...
 
Pricing Optimization: Close-out, Online and Renewal strategies, Data Reply
Pricing Optimization: Close-out, Online and Renewal strategies, Data ReplyPricing Optimization: Close-out, Online and Renewal strategies, Data Reply
Pricing Optimization: Close-out, Online and Renewal strategies, Data Reply
 
"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig...
"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig..."How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig...
"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig...
 
A view of graph data usage by Cerved
A view of graph data usage by CervedA view of graph data usage by Cerved
A view of graph data usage by Cerved
 

Kürzlich hochgeladen

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Kürzlich hochgeladen (20)

Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 

Audience projection of target consumers over multiple domains a ner and bayesian approach, Gianmario Spacagna, Alberto Pirovano

  • 1. Helixa Audience Projection of Target Consumers over Multiple Domains: a NER and Bayesian approach Gianmario Spacagna Chief Scientist @ Helixa O’Reilly AI Conference London, 16th October 2019
  • 2. About Me 7+ years experience in Data Science and Machine Learning Currently leading a team of ML Scientists and ML Engineers Background in Telematics and Software Engineering of Distributed Systems Ongoing MBA Student Co-author of Python Deep Learning Contributor of the Professional Data Science Manifesto Blogger of Data Science Vademecum Founder of the Data Science Milan community (1.4k members) Stockholm, London, Milan Gianmario Spacagna Chief Scientist, Helixa gspacagna@helixa.ai
  • 3. DEMOGRAPHICS HHI < 40K Female 18 - 24 INFLUENCERS ODESZA Cardi B Shane DawsonJames Charles Helixa is Market Research platform that uses AI to integrate disparate data sources into an enriched view of the consumers who matter to your business. INTERESTS Listen to Podcasts Kylie Cosmetics Fan Starbucks Chipotle PSYCHOGRAPHICS Fast Food Fans Fashion Enthusiasts Entertainment Junkies
  • 4. In the next 40 minutes... OUR GOAL: Discuss some of the current challenges of traditional market research and propose a novel solution based on Named Entity Recognition (NER) and Bayesian Inference.
  • 6. Applied Social Science What is Market Research? Gain Insights for Strategic Decisions Information about individuals and organizations Statistical Inference
  • 7. Why Market Research matters? Brands Perceptions Consumers Preferences and Behaviors Buyer Personas Market Segmentation Identify OpportunitiesMarket Trends
  • 8. Approaches to Market Research Opinions and individual experiences In-depth interviews Smaller sample Qualitative Quantitative Numbers and Data Statistics Larger sample
  • 9. Quantitative Market Research is conducted with Surveys Define Analyze DistributeCollect Design
  • 10. Limitations of Surveys Expensive Invasive Response Bias Predefined questions Narrow coverage
  • 11. Market Research using “Implicit Consumers Feedback” Define Analyze DistributeCollect Design vs. e.g. Social Listening
  • 12. Twitter Interactions Inferring Interests from Twitter Interactions
  • 13. Advantages of Implicit Consumer Feedback Approaches Flexible costs Wide view Opportunities for Big Data and AIMass coverage Spontaneous
  • 14. What about other information? Twitter Interactions Amazon Purchases ? Beer Consumption Brand ?
  • 15. The Universe of Consumers Datasets Social Media Financial and Properties Behaviors First Party (CSM) Consumer Research Surveys
  • 16. SCATTERED PARTIAL SKEWED M A L E F E M A L E 18-30 31-43 44-56 57-70 Individual Consumers Datasets are Far From Being Exhaustive
  • 17. ALL IN ONE COMPLETE REPRESENTATIVE M A L E F E M A L E 18-30 31-43 44-56 57-70 The Holy Grail of Market Research
  • 18. What is the baseline algorithm for “completing” datasets?
  • 20. What is look-alike fusion? Left: Social Network Panel Right: Consumptions Survey Panel
  • 21. Assignment Optimization Problem ● Hungarian method ● Simplex ● Auction algorithm Well-known solutions:
  • 22. Datasets Fusion X X X X X X X X X X X Left User Right User left-only entities right-only entities Target Audience =
  • 23. Look-alike Fusions Requires a Main Panel Centrality
  • 24. Look-alike Fusions Don’t Scale Well Differences in feature space Craftsmanship required at each change of data Universal objective function to optimize
  • 25. Is there a more scalable way to “fuse” datasets?
  • 27. Audience Projection defined as “User Binary Classification” Source: Social Network Panel Destination: Consumptions Survey Panel 70M Social accounts 200M U.S. consumers 1.6M / 26M / TRUE FALSE TRUE FALSE Target Audience = PROJECTION Ben & Jerry’s: bought in last 6 months? Affinity: 1.80x Venmo: paid in last 30 days? Affinity: 1.6x Angry Orchard: drunk in last 6 months? Affinity: 1.50x
  • 28. Solution = Named Entity Recognition (NER) + Bayesian Model Social Pages Consumption Questions NER NER BAYESIAN MODEL ENTITY LINKING (NEL) Destination: Consumptions Survey Panel Source: Social Network Panel Projected Users Probabilities Target Audience
  • 29. Entities Represent an Universal Feature Space Social Pages Consumption Questions Listed Products NER NER NER
  • 30. The Coca-Cola Company is a total beverage company, offering over 500 brands in more than 200 countries and territories. Named Entity Recognition(NER) in each Domain Social Pages Consumption Questions Listed Products Adidas Originals Men's Relaxed Strapback Cap Coca-Cola KWC-4 6-Can Personal Mini 12V DC Car and 110V AC Cooler, Red
  • 31. NLP Libraries with NER capability Polyglot Deep Pavlov
  • 32. Why for Production? Fast Accurate Industry-grade maturity
  • 33. example of NER usage
  • 34. Same Entity May Exist with Different Spellings Interacted with Coca-Cola Company on Social Networks “Have you consumed Coca-Cola last week?”
  • 35. Linking and Normalizing Entities via en.wikipedia.org/wiki/Coca-Cola en.wikipedia.org/wiki/The_Coca-Cola_Company Entity Relationship
  • 36. Normalized Entities means a Common Feature Space
  • 37. Stacked Heterogeneous Feature Space X X ? ? X X ? ? ? ? X X X X ? ? X X X ? ? X X X Source Users Destination Users source-only entities common entities destination-only entities Latent interests Target Audience =
  • 38. Common Entities translate Source to Destination Source: Social Network Panel Destination: Consumptions Survey Panel Target Audience = Common Entities ?Bayesian Model Source Target Size 1.6M / 70M = 2.3% Share of Interests
  • 39. “Share of interests” encode the DNA of the Target Audience Global share of interests: 100% Common Entities Target audience share of interests: 50% 17% 50% Target Audience slice
  • 40. Bayesian Model Posterior Probability of user belonging to projected target given the Share of Interests on common entities 𝐏( / ) =∈ 𝐏( / )∙𝐏( )∈ ∈ 𝐏( ) Evidence Prior Source Target Size=2.3%Likelihood
  • 41. Evidence Decomposition 𝐏( ) Evidence 𝐏( / )∙𝐏( )∈ ∈ 𝐏( / )∙𝐏( )∉ ∉
  • 42. Marginal Positive Likelihood Binomial distribution 𝐏( / ) ≈∈p=17%
  • 43. Joint Likelihood under Naive Assumption 𝐏( , , / ) =50%17% 50% ∈ 𝐏( / )∙17% ∈ 𝐏( / )∙50% ∈ 𝐏( / )50% ∈
  • 44. Destination variables TeenNick Robot Chicken Bob’s Burgers Ben & Jerry’s Venmo Angry Orchard Nintendo DSi XL Video Games Audio or Video Chat Affinity 8.9x 7.27x 2.36x 1.80x 1.62x 1.55.x 1.47x 1.45x 1.23x Predicted Probabilities provides Insights on the Projected Users PROJECTIONTarget Audience = Projected Users Probabilities Insights on Destination Variables 𝐏( / )∈
  • 45. Audience Projection In a Nutshell Social Panel Consumptions Survey PanelCommon Entities Bayesian Model Target Audience = Affinity: 1.80x Affinity: 1.55x Affinity: 1.62x
  • 46. Cool! How do you know this is accurate?
  • 48. Binary Classifier Evaluation Bayesian Model Projected Users Probabilities Ground Truth Evaluation techniques ?
  • 49. Validate via Common Entities X X X X X X X X Source Users Destination Users common entities Target Audience OR= Projected Audience OR= Exact Query Replica Ground Truth
  • 50. Validate via Self Reconstruction Within the Same Domain X X X X X X X X X X X X X X X X X X X Source Users Destination Users source-only entities common entities destination-only entities Target Audience = Ground Truth
  • 51. Validate via Double-step Reconstruction PROJECTION PROJECTION Predicted probabilities Ground Truth
  • 52. Repeat Test Cases Stratifying by Category
  • 54. Golden Benchmarks Comparison on Aggregated Insights
  • 56. Many Linked Views of the Same Global Population Audience Projection
  • 57. Multiple Perspectives Reinforce Reliability Social Panel Target Audience = Interacted with Game Informer social page Affinity: 2.17x Have you read any Game Informer issue? Affinity: 1.73x Game Informer Single Issue Magazine purchased online Affinity: 2.51x
  • 58. Generalize Audience Projection as a Domain Adaptation Problem
  • 62. Audience Projection adapts to any “entity domain” Bayesian Model
  • 63. Accuracy and Biases can be quantified
  • 64. Strategists now have a complete view of their Target Audience
  • 65. Gianmario Spacagna Chief Scientist at Helixa.ai gspacagna@helixa.ai @gm_spacagna
  • 67. Natural Language Processing (NLP) Pipeline "Mark Watney visited Mars"
  • 68. The spaCy NER Model Overview EMBED ENCODE ATTEND PREDICT
  • 69. Embedding Words Features token lower prefix suffix shape Apple apple app ple Wwwww U.K. uk uk uk W.W. Fahrenheit 451 fahrenheit 451 fah 451 Wwwwwwwwww ddd Each word (token) is represented by concatenating the embeddings of all of the 4 features in order to generalize the context for unknown words.
  • 70. Efficiently Embedding Words Hash Embedding reduces the dimensionality and allows to deal with large vocabularies
  • 71. Encoding Sequences of Words Residual Convolutional Neural Networks allows to encode context-independent word vectors into a context-sensitive sentence matrix. Raw tri-gram chunk Enriched tri-gram matrix Mark Watney visited “Mark Watney visited Mars”
  • 72. Crafting the Attention Vector The attention vector of the trigram includes information on the encountered entities. “Mark Watney visited Mars” Attention vector Tri-gram matrix Enriched tri-gram vector
  • 73. Predicting the Recognized Entities Actions: SHIFT OUT REDUCE (Entity Tagging) Stack Buffer Segment “Mark Watney visited Mars” Actions: 1.SHIFT 2.SHIFT 3.REDUCE (PER) 4.OUT 5.SHIFT 6.REDUCE (LOC) Mark Watney Mars Mark Watney visited Mars Enriched tri-gram vector Update attention Attention vector Tri-gam matrix
  • 74. Official Explanation of spaCy NER Model https://www.youtube.com/watch?v=sqDHBH9IjRU
  • 76. Projecting the Share of Interests on Common Entities Target Audience Projection 50% 17% 50% Share of Interests: SIZE: 60M SIZE: 200M SIZE: ? SIZE: 40M Global Audience (average american) = Target Audience evidence prior
  • 77. Evidence Statistics on Share of Interests N = 180M users in U.S. population sampling rate = 1 : 10k n = 18k users in sample panel p = 17% of market penetration x = 3k expected projected users SIZE: 200M SIZE: 40M statistics: evidence
  • 78. 𝐏( / ) = Binomial Positive Likelihood n = 17999 x = 2999 log(p)=-5.56323 Probability of selecting 3000 / 18000 McDonald’s panel users given that the user IS part of the target∈ n = 18000 x = 3000 log(p)=-5.54342 is smaller than p=17%
  • 79. 𝐏( / ) = Binomial Negative Likelihood n = 17999 x = 2999 log(p)=-5.53942 Probability of selecting 3000 / 18000 McDonald’s panel users given that the user IS NOT part of the target∉ n = 18000 x = 3000 log(p)=-5.54342 p=17% is greater than