SlideShare ist ein Scribd-Unternehmen logo
1 von 20
1
EPAM BI Competency Center
1Near Real-time Marketing Support System
Implementation Details
by <Kiryl Sultanau> & <Yauheni Yushyn> & <Dzmitry Maskayeu>
2
Preamble
3
Bidding Support
Improve Ad campaigns
NRT Data Visualization:
● Near Real Time visualization of bids match with clicks, leads etc.
● Detect best Ad type/place/size/position... for different users/devices/regions...
● Quick reaction and better estimation for just started Ad Campaigns.
Improve Ad campaigns:
● Improve keyword campaigns with more relevant keywords and better
specialized target group, region etc.
● Create new short time campaigns for special events or occasions
● Collect specific users and information about them
4
Prerequisites
External bidding info (impressions, clicks)
Ad Campaign info
Publisher log streams (impressions, click, search...
Other Dictionaries
5
Architectural Overview
6
Pipeline
Source
data
ETL
layer
Presentation
layer
7
Stream Processing
server log [cookies, user_agent, city_id, log_type_id …]
Ad Exchange
- Google
DoubleClick AdX
- TANX Alibaba
- Baidu
- Google Mobile
...
JOIN
City
US city names
Log Type
- bid-impression
- bid-click
- site-open
- site-search
- site-impression
- site-click
Site Pages
Owner URL &
google tag
User Tag
External URL &
user search
keyword
State
US state names
Keywords
User Keywords as
union of google
tags and user
search keywords
Spark Cache
Kafka RDD
DataFrame
Apply schema
8
joined server logDataFrame
Parse User Agent String
Browser
OS
Group
Manufacturer
Rendering engine
Version: major, minor
Name
Name
Platform
Device
Manufacturer
DataFrame
Stream Processing
9
joined server log + user agentDataFrame
JOIN
Cassandra table UNPIVOTDataFrame
id bid_click_kw site_open_kw site_click_kw site_lead_kwsite_search_kw
joined server log + user agent + previous user behaviorDataFrame
Stream Processing
10
joined server log + user agent + previous user behaviorDataFrame
joined server log + user agent + previous user behavior + target group marker
Stream Processing
11
Saving data
Users Dimension Analytics Service API
12
Saving data
13
Visualisation
Discover
Visualize
Dashboard
14
Tags Analyser Tool
● real time data
● slices by any collected metric
(time, geo-location, action type,
make, model, user behavior …)
● apply filters on the fly easy as a
cake
● combine and manage filters
● share dashboards
● add new visualisations on the fly
● serve all this staff from UI
15
NRT Data Visualization
16
Question: How to recognize users that will potentially bring profit
to provider?
Input data: Logs of searches and clicks on site, logs from
partner sites. The data will be merged and split on parts: 60%
training, 20% test, 20% validation.
Features: The variables for model training that we’ll use as
defining the output are: region, city, user actions and searches
on site.
Algorithms: Deep Learning algorithm from H2O package.
Evaluation: The model will be evaluated based on number of
predicted clicks + N * number of predicted conversions.
Model Usage: After being trained the Model will receive data on
user and his actions on site and will provide probability that this
user will click on ad.
Lead Prediction Using Machine Learning
17
NRT Bidding
region: LA, CA
sex: male
age: 31
stream:
google.com > Edmunds.com > search: SUV
region: CA
tags: top, SUV, 2015
price: 90$ CPM
limit: 200$ day
region: CA, NY
tags: SUV, crossover
price: 70$ CPM
limit: 300$ day
18
Crawl Social Networks (event, places, post, feeds...)
19
Crawl Social Networks (attenders, followers, likers...)
20
Confidential 20

Weitere ähnliche Inhalte

Andere mochten auch

MATLAB 2014-15 TITLES FOR FINAL YEAR
MATLAB 2014-15 TITLES FOR FINAL YEARMATLAB 2014-15 TITLES FOR FINAL YEAR
MATLAB 2014-15 TITLES FOR FINAL YEARBlueChip Technology
 
Baltijas ceļa pasākumi ārvalstīs augustā un septembrī
Baltijas ceļa pasākumi ārvalstīs augustā un septembrīBaltijas ceļa pasākumi ārvalstīs augustā un septembrī
Baltijas ceļa pasākumi ārvalstīs augustā un septembrīValdības māja
 
Advanta Healthcare Partners Capabilities
Advanta Healthcare Partners Capabilities Advanta Healthcare Partners Capabilities
Advanta Healthcare Partners Capabilities Advanta
 
もしも私がカドカワの川上量生CEOならどのようにして会社を成長させるか? -ニコニコ動画を持つ企業の経営戦略-
もしも私がカドカワの川上量生CEOならどのようにして会社を成長させるか? -ニコニコ動画を持つ企業の経営戦略-もしも私がカドカワの川上量生CEOならどのようにして会社を成長させるか? -ニコニコ動画を持つ企業の経営戦略-
もしも私がカドカワの川上量生CEOならどのようにして会社を成長させるか? -ニコニコ動画を持つ企業の経営戦略-Takashi Okada
 
The Mathematical Epidemiology of Human Babesiosis in the North-Eastern United...
The Mathematical Epidemiology of Human Babesiosis in the North-Eastern United...The Mathematical Epidemiology of Human Babesiosis in the North-Eastern United...
The Mathematical Epidemiology of Human Babesiosis in the North-Eastern United...QUT_SEF
 
Kts c3-he to hop
Kts c3-he to hopKts c3-he to hop
Kts c3-he to hopWang Ruan
 
39.2015.tt.bnnptnt
39.2015.tt.bnnptnt39.2015.tt.bnnptnt
39.2015.tt.bnnptntMèo Hoang
 
国内IT系エンジニアは不足しているのか?
国内IT系エンジニアは不足しているのか?国内IT系エンジニアは不足しているのか?
国内IT系エンジニアは不足しているのか?Takashi Okada
 

Andere mochten auch (13)

MATLAB 2014-15 TITLES FOR FINAL YEAR
MATLAB 2014-15 TITLES FOR FINAL YEARMATLAB 2014-15 TITLES FOR FINAL YEAR
MATLAB 2014-15 TITLES FOR FINAL YEAR
 
Baltijas ceļa pasākumi ārvalstīs augustā un septembrī
Baltijas ceļa pasākumi ārvalstīs augustā un septembrīBaltijas ceļa pasākumi ārvalstīs augustā un septembrī
Baltijas ceļa pasākumi ārvalstīs augustā un septembrī
 
Sayurianimeshop
SayurianimeshopSayurianimeshop
Sayurianimeshop
 
Advanta Healthcare Partners Capabilities
Advanta Healthcare Partners Capabilities Advanta Healthcare Partners Capabilities
Advanta Healthcare Partners Capabilities
 
もしも私がカドカワの川上量生CEOならどのようにして会社を成長させるか? -ニコニコ動画を持つ企業の経営戦略-
もしも私がカドカワの川上量生CEOならどのようにして会社を成長させるか? -ニコニコ動画を持つ企業の経営戦略-もしも私がカドカワの川上量生CEOならどのようにして会社を成長させるか? -ニコニコ動画を持つ企業の経営戦略-
もしも私がカドカワの川上量生CEOならどのようにして会社を成長させるか? -ニコニコ動画を持つ企業の経営戦略-
 
The Mathematical Epidemiology of Human Babesiosis in the North-Eastern United...
The Mathematical Epidemiology of Human Babesiosis in the North-Eastern United...The Mathematical Epidemiology of Human Babesiosis in the North-Eastern United...
The Mathematical Epidemiology of Human Babesiosis in the North-Eastern United...
 
Kts c3-he to hop
Kts c3-he to hopKts c3-he to hop
Kts c3-he to hop
 
39.2015.tt.bnnptnt
39.2015.tt.bnnptnt39.2015.tt.bnnptnt
39.2015.tt.bnnptnt
 
BionicMe
BionicMeBionicMe
BionicMe
 
Meadowbrook Estates
Meadowbrook EstatesMeadowbrook Estates
Meadowbrook Estates
 
LA VOTE ORY
LA VOTE ORYLA VOTE ORY
LA VOTE ORY
 
Michelle Anne 2017
Michelle Anne 2017Michelle Anne 2017
Michelle Anne 2017
 
国内IT系エンジニアは不足しているのか?
国内IT系エンジニアは不足しているのか?国内IT系エンジニアは不足しているのか?
国内IT系エンジニアは不足しているのか?
 

Ähnlich wie EPAM BI Competency Center Near Real-time Marketing Support System

Nicholas Gorski: Real-time revenue science at Twitter
Nicholas Gorski: Real-time revenue science at TwitterNicholas Gorski: Real-time revenue science at Twitter
Nicholas Gorski: Real-time revenue science at TwitterDavid Garrison
 
Data Science Salon: Enabling self-service predictive analytics at Bidtellect
Data Science Salon: Enabling self-service predictive analytics at BidtellectData Science Salon: Enabling self-service predictive analytics at Bidtellect
Data Science Salon: Enabling self-service predictive analytics at BidtellectFormulatedby
 
Vpon - 廣告效果導向為基礎的行動廣告系統
Vpon - 廣告效果導向為基礎的行動廣告系統Vpon - 廣告效果導向為基礎的行動廣告系統
Vpon - 廣告效果導向為基礎的行動廣告系統Vpon
 
Smart Canvas @ Large Scale Recommender Systems Workshop 2015
Smart Canvas  @ Large Scale Recommender Systems Workshop 2015Smart Canvas  @ Large Scale Recommender Systems Workshop 2015
Smart Canvas @ Large Scale Recommender Systems Workshop 2015Gilmar Souza
 
Kritter introduction - advertiser
Kritter   introduction - advertiserKritter   introduction - advertiser
Kritter introduction - advertiserKrittercorporate
 
Rd Online Deck 3.0
Rd Online Deck 3.0Rd Online Deck 3.0
Rd Online Deck 3.0Zestadz
 
Mobile marketing from analysis to launching a project
Mobile marketing from analysis to launching a projectMobile marketing from analysis to launching a project
Mobile marketing from analysis to launching a projectHeads&Hands
 
Борис Омельницкий
Борис ОмельницкийБорис Омельницкий
Борис ОмельницкийIABBelarus
 
Andrey Sukhovoy. Marketing Mix Modeling: Define the Optimal Budget Allocation...
Andrey Sukhovoy. Marketing Mix Modeling: Define the Optimal Budget Allocation...Andrey Sukhovoy. Marketing Mix Modeling: Define the Optimal Budget Allocation...
Andrey Sukhovoy. Marketing Mix Modeling: Define the Optimal Budget Allocation...Octopus Events
 
Growth Hacking İstanbul 2018 - Ata Gür - Web Push Notifications
Growth Hacking İstanbul 2018 - Ata Gür - Web Push NotificationsGrowth Hacking İstanbul 2018 - Ata Gür - Web Push Notifications
Growth Hacking İstanbul 2018 - Ata Gür - Web Push NotificationsGrowth Hacking İstanbul
 
How to Use Push Notifications on Websites as a Growth Channel
How to Use Push Notifications on Websites as a Growth ChannelHow to Use Push Notifications on Websites as a Growth Channel
How to Use Push Notifications on Websites as a Growth ChannelAta Gur
 
Google Analytics - A Brief Intro
Google Analytics - A Brief IntroGoogle Analytics - A Brief Intro
Google Analytics - A Brief IntroKashyap Shah
 
Deep.bi - Real-time, Deep Data Analytics Platform For Ecommerce
Deep.bi - Real-time, Deep Data Analytics Platform For EcommerceDeep.bi - Real-time, Deep Data Analytics Platform For Ecommerce
Deep.bi - Real-time, Deep Data Analytics Platform For EcommerceDeep.BI
 
AppMetrica product_presentation_Dec_2017
AppMetrica product_presentation_Dec_2017AppMetrica product_presentation_Dec_2017
AppMetrica product_presentation_Dec_2017Alexander Sibrikov
 

Ähnlich wie EPAM BI Competency Center Near Real-time Marketing Support System (20)

Nicholas Gorski: Real-time revenue science at Twitter
Nicholas Gorski: Real-time revenue science at TwitterNicholas Gorski: Real-time revenue science at Twitter
Nicholas Gorski: Real-time revenue science at Twitter
 
Data Science Salon: Enabling self-service predictive analytics at Bidtellect
Data Science Salon: Enabling self-service predictive analytics at BidtellectData Science Salon: Enabling self-service predictive analytics at Bidtellect
Data Science Salon: Enabling self-service predictive analytics at Bidtellect
 
Google Analytics 4 : The Next Generation of Digital Analytics - Benjamin Kepn...
Google Analytics 4 : The Next Generation of Digital Analytics - Benjamin Kepn...Google Analytics 4 : The Next Generation of Digital Analytics - Benjamin Kepn...
Google Analytics 4 : The Next Generation of Digital Analytics - Benjamin Kepn...
 
Vpon - 廣告效果導向為基礎的行動廣告系統
Vpon - 廣告效果導向為基礎的行動廣告系統Vpon - 廣告效果導向為基礎的行動廣告系統
Vpon - 廣告效果導向為基礎的行動廣告系統
 
Cam specialist-unit-web-analytics-and-social-media
Cam specialist-unit-web-analytics-and-social-mediaCam specialist-unit-web-analytics-and-social-media
Cam specialist-unit-web-analytics-and-social-media
 
Smart Canvas @ Large Scale Recommender Systems Workshop 2015
Smart Canvas  @ Large Scale Recommender Systems Workshop 2015Smart Canvas  @ Large Scale Recommender Systems Workshop 2015
Smart Canvas @ Large Scale Recommender Systems Workshop 2015
 
lsrs15_ciandt
lsrs15_ciandtlsrs15_ciandt
lsrs15_ciandt
 
Kritter introduction - advertiser
Kritter   introduction - advertiserKritter   introduction - advertiser
Kritter introduction - advertiser
 
Thoughts on Post Click Analysis
Thoughts on Post Click AnalysisThoughts on Post Click Analysis
Thoughts on Post Click Analysis
 
Rd Online Deck 3.0
Rd Online Deck 3.0Rd Online Deck 3.0
Rd Online Deck 3.0
 
Mobile marketing from analysis to launching a project
Mobile marketing from analysis to launching a projectMobile marketing from analysis to launching a project
Mobile marketing from analysis to launching a project
 
Mar-Tech Oversight
Mar-Tech OversightMar-Tech Oversight
Mar-Tech Oversight
 
Google Analytics tutorial by Jay Murphy
Google Analytics tutorial by Jay Murphy Google Analytics tutorial by Jay Murphy
Google Analytics tutorial by Jay Murphy
 
Борис Омельницкий
Борис ОмельницкийБорис Омельницкий
Борис Омельницкий
 
Andrey Sukhovoy. Marketing Mix Modeling: Define the Optimal Budget Allocation...
Andrey Sukhovoy. Marketing Mix Modeling: Define the Optimal Budget Allocation...Andrey Sukhovoy. Marketing Mix Modeling: Define the Optimal Budget Allocation...
Andrey Sukhovoy. Marketing Mix Modeling: Define the Optimal Budget Allocation...
 
Growth Hacking İstanbul 2018 - Ata Gür - Web Push Notifications
Growth Hacking İstanbul 2018 - Ata Gür - Web Push NotificationsGrowth Hacking İstanbul 2018 - Ata Gür - Web Push Notifications
Growth Hacking İstanbul 2018 - Ata Gür - Web Push Notifications
 
How to Use Push Notifications on Websites as a Growth Channel
How to Use Push Notifications on Websites as a Growth ChannelHow to Use Push Notifications on Websites as a Growth Channel
How to Use Push Notifications on Websites as a Growth Channel
 
Google Analytics - A Brief Intro
Google Analytics - A Brief IntroGoogle Analytics - A Brief Intro
Google Analytics - A Brief Intro
 
Deep.bi - Real-time, Deep Data Analytics Platform For Ecommerce
Deep.bi - Real-time, Deep Data Analytics Platform For EcommerceDeep.bi - Real-time, Deep Data Analytics Platform For Ecommerce
Deep.bi - Real-time, Deep Data Analytics Platform For Ecommerce
 
AppMetrica product_presentation_Dec_2017
AppMetrica product_presentation_Dec_2017AppMetrica product_presentation_Dec_2017
AppMetrica product_presentation_Dec_2017
 

Mehr von Dmitry Tolpeko

Big Data Analytics for BI, BA and QA
Big Data Analytics for BI, BA and QABig Data Analytics for BI, BA and QA
Big Data Analytics for BI, BA and QADmitry Tolpeko
 
Qubole - Big data in cloud
Qubole - Big data in cloudQubole - Big data in cloud
Qubole - Big data in cloudDmitry Tolpeko
 
Big Data Technology - Solit 2015 Conference
Big Data Technology - Solit 2015 ConferenceBig Data Technology - Solit 2015 Conference
Big Data Technology - Solit 2015 ConferenceDmitry Tolpeko
 
Apache Yarn - Hadoop Cluster Management
Apache Yarn -  Hadoop Cluster ManagementApache Yarn -  Hadoop Cluster Management
Apache Yarn - Hadoop Cluster ManagementDmitry Tolpeko
 
Bi 2.0 hadoop everywhere
Bi 2.0   hadoop everywhereBi 2.0   hadoop everywhere
Bi 2.0 hadoop everywhereDmitry Tolpeko
 
Apache Kafka - Messaging System Overview
Apache Kafka - Messaging System OverviewApache Kafka - Messaging System Overview
Apache Kafka - Messaging System OverviewDmitry Tolpeko
 

Mehr von Dmitry Tolpeko (6)

Big Data Analytics for BI, BA and QA
Big Data Analytics for BI, BA and QABig Data Analytics for BI, BA and QA
Big Data Analytics for BI, BA and QA
 
Qubole - Big data in cloud
Qubole - Big data in cloudQubole - Big data in cloud
Qubole - Big data in cloud
 
Big Data Technology - Solit 2015 Conference
Big Data Technology - Solit 2015 ConferenceBig Data Technology - Solit 2015 Conference
Big Data Technology - Solit 2015 Conference
 
Apache Yarn - Hadoop Cluster Management
Apache Yarn -  Hadoop Cluster ManagementApache Yarn -  Hadoop Cluster Management
Apache Yarn - Hadoop Cluster Management
 
Bi 2.0 hadoop everywhere
Bi 2.0   hadoop everywhereBi 2.0   hadoop everywhere
Bi 2.0 hadoop everywhere
 
Apache Kafka - Messaging System Overview
Apache Kafka - Messaging System OverviewApache Kafka - Messaging System Overview
Apache Kafka - Messaging System Overview
 

Kürzlich hochgeladen

Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxHimangsuNath
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data VisualizationKianJazayeri1
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfSubhamKumar3239
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxHaritikaChhatwal1
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxTasha Penwell
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Milind Agarwal
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataTecnoIncentive
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxSimranPal17
 

Kürzlich hochgeladen (20)

Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptx
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data Visualization
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdf
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptx
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded data
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptx
 

EPAM BI Competency Center Near Real-time Marketing Support System

  • 1. 1 EPAM BI Competency Center 1Near Real-time Marketing Support System Implementation Details by <Kiryl Sultanau> & <Yauheni Yushyn> & <Dzmitry Maskayeu>
  • 3. 3 Bidding Support Improve Ad campaigns NRT Data Visualization: ● Near Real Time visualization of bids match with clicks, leads etc. ● Detect best Ad type/place/size/position... for different users/devices/regions... ● Quick reaction and better estimation for just started Ad Campaigns. Improve Ad campaigns: ● Improve keyword campaigns with more relevant keywords and better specialized target group, region etc. ● Create new short time campaigns for special events or occasions ● Collect specific users and information about them
  • 4. 4 Prerequisites External bidding info (impressions, clicks) Ad Campaign info Publisher log streams (impressions, click, search... Other Dictionaries
  • 7. 7 Stream Processing server log [cookies, user_agent, city_id, log_type_id …] Ad Exchange - Google DoubleClick AdX - TANX Alibaba - Baidu - Google Mobile ... JOIN City US city names Log Type - bid-impression - bid-click - site-open - site-search - site-impression - site-click Site Pages Owner URL & google tag User Tag External URL & user search keyword State US state names Keywords User Keywords as union of google tags and user search keywords Spark Cache Kafka RDD DataFrame Apply schema
  • 8. 8 joined server logDataFrame Parse User Agent String Browser OS Group Manufacturer Rendering engine Version: major, minor Name Name Platform Device Manufacturer DataFrame Stream Processing
  • 9. 9 joined server log + user agentDataFrame JOIN Cassandra table UNPIVOTDataFrame id bid_click_kw site_open_kw site_click_kw site_lead_kwsite_search_kw joined server log + user agent + previous user behaviorDataFrame Stream Processing
  • 10. 10 joined server log + user agent + previous user behaviorDataFrame joined server log + user agent + previous user behavior + target group marker Stream Processing
  • 11. 11 Saving data Users Dimension Analytics Service API
  • 14. 14 Tags Analyser Tool ● real time data ● slices by any collected metric (time, geo-location, action type, make, model, user behavior …) ● apply filters on the fly easy as a cake ● combine and manage filters ● share dashboards ● add new visualisations on the fly ● serve all this staff from UI
  • 16. 16 Question: How to recognize users that will potentially bring profit to provider? Input data: Logs of searches and clicks on site, logs from partner sites. The data will be merged and split on parts: 60% training, 20% test, 20% validation. Features: The variables for model training that we’ll use as defining the output are: region, city, user actions and searches on site. Algorithms: Deep Learning algorithm from H2O package. Evaluation: The model will be evaluated based on number of predicted clicks + N * number of predicted conversions. Model Usage: After being trained the Model will receive data on user and his actions on site and will provide probability that this user will click on ad. Lead Prediction Using Machine Learning
  • 17. 17 NRT Bidding region: LA, CA sex: male age: 31 stream: google.com > Edmunds.com > search: SUV region: CA tags: top, SUV, 2015 price: 90$ CPM limit: 200$ day region: CA, NY tags: SUV, crossover price: 70$ CPM limit: 300$ day
  • 18. 18 Crawl Social Networks (event, places, post, feeds...)
  • 19. 19 Crawl Social Networks (attenders, followers, likers...)