SlideShare ist ein Scribd-Unternehmen logo
1 von 51
RTB Bid Landscape
survival models
powered by bitmaps
Łukasz Mączewski Senior Data Scientist
Przemysław Piotrowski Senior Data Engineer
AI&Analytics, Adform
14.05.2019
1
Agenda
• Real Time Bidding – RTB
• Bidding Strategy
• Bid Landscape
• Bitmap population selection
• Survival analysis
• Validation
• Product demo
2
Real-Time Bidding (RTB)
RTB allows for fully automated ad
inventory selling / buying in a real-time
3
Advertiser 1
Advertiser 2
Advertiser 3
Advertiser n
AudienceInventory
Auction
Demand Side Platform is to
estimate optimal
bid price according to:
• Targeting (audience)
• Business requirements (KPIs)
• Ad inventory (context, ad slot price)
• Campaign budget and it's spend plan
First price vs. second price auction
First price auction:
- Pay exact amount that was bidded
- Bid price is close to a real impression value
- Seller revenue maximization
4
Second price auction:
- Pay 0.00001$ more than second-highest bid
price
- Bid as high as possible to maximize winning
probability
In the past the second price auction was preferred, but currently
the first price auction becomes more and more popular
RTB ecosystem
5
diverse
audience
web page
Publisher 1 and its
inventory SSP 1
Supply Side
Platform
Publisher n and its
inventory
.
.
. Ad Exchange
DMP
Data
Management
Platform
DSP 1
Demand Side Platform
SSP n DSP n
.
.
.
advertisers
demographics, browsing, interest segments data
cookie profile data
feedback data
ad-slot / banner data
campaign data
Bidding strategy
optimisation
Time to bid is ~100 ms
Bidding strategy optimisation
6
Bidding Strategy
Optimization
Candidate impression valuation with
respect to KPIs and budget
constraints
DSP algorithms
Evaluation of bid request business potential
Bid Landscape
Estimation of winning probability at bid price
Budget throttle
Control budget spend
Optimal bid price
Numbers
7
Adform provides an integrated Software as a Service
platform for the buying, managing and serving of
digital advertising.
Presence in EU, US, APAC
3 millions bid request per second
billions of engagements daily
5 millions scoring of algorithms per second
200 thousands of cookie profiles in DMP
Algorithms
8
Discovering knowledge from massive RTB data
9
Cookie profile/
Targeting:
• device type
• os type
• browser type
• screen size
• country
• regions
• city
• clicks made
• visited log points
• visited domains
• urls
• IAB categories
• ...
Feedback:
• acquisition/conversion
• click
• order status
• revenue
• viewability
• ...
Adslot/ banners/
creatives:
• inventory name
• domain
• URL
• banner position
• banner size
• banner type
• floor price
• video player properties
• ...
Data Management
Platform:
• demography
• interests and preferences
• profession
• ...
Campaign:
• budget
• pricing type
• deals
• targeting
• brand safety
• ...
Discovering knowledge from massive RTB data
10
Huge class
imbalance
Mostly
categorical
data
High
cardinality
Sparsity
High risk of
overfit
Data summary
Reduction of dimensionality
Preselection of features RegularisationData sampling
Cross validationEnsembling
RTB models
11
CPC
Cost per Click
CPL
Cost per Lead
netCPL
Confirmed lead
CPV
Cost per
Viewability
ROAS
Return on Ad
Spend
VCR
Video Completion
Rate
Adform machine learning pipeline
12
Train/Test data
sets preparation
(dedicated ETLs)
Data
preprocessing
and feature
engineering
Feature selection
Building model
and
hyperparameters
optimization
Offline backtests Online A/B tests Deployment Live monitoring
After successful model deployment it is
periodically retrained
RTB Algorithms Pipeline
13
PricingAPI
Train
Data
JAR
JAR
OFFLINE JAR
JAR
ONLINE JAR
ONLINE SPEC ONLINE MONITORING
Backtests
• Offline procedure to check performance of
two competing models in time
• Sequential analysis of historical data:
- Data divided into training and test sets
• Time series analysis of critical performance
KPIs:
- AUC
- Sparsity
- Target rates
• If the backtests succeed the algorithm is
submitted to online A/B testing procedure
14
AUC
SPARSITY
Target rate
Time
A/B test online monitoring
15
Split traffic to
hash(cookie_id)%1024
buckets
For each experiment monitor
• Cost per target
• Target rate
• Target count
• Number of impressions
• ... many more
Bid Landscape
17
User Stories
As a Trafficker I want to know volume of impressions my campaign would win in RTB given
the selected inventory, targeting and budget settings (CPM).
As a DSP I want to enable Advertisers winning impressions for lowest possible price.
18
Bid Shading
Demand Side Platform protects Advertiser from overpaying while
AdTech ecosystem migrating from second-price to first-price RTB auctions.
Forecasting & Bid Landscape
Forecasting
• Total number of matching Bid Requests for
given Campaign Targeting
• Bid Opportunities
19
Bid Landscape
• Predicting probability of winning
auction with a given bid price and
Campaign Settings
• Win rate evolution with a bid price
Impressions
CPM
Bid Landscape
• Building Bid Landscape model using
historical data regarding won and failed
auctions:
For lost auctions win price is unknown
• Right Censored Data
Survival Analysis
The exact moment of failure for majority of
objects is unknown
20
All Bids
Won Bids
Relative Rate
Bid Price Buckets
Population selection for
Bid Landscape modelling
21
The Data
22
Input
Bids and Impressions with features
Impression is a Bid that won the auction
100000 2000 5000 100 4000 3000 ... 4500
Output
cardinality of selected (Campaign Settings)
Bids bucketed by bid price
Impressions bucketed by bid price
Impressions bucketed by win price
100 buckets in our case
How?
Population Selection – select relevant population from high-dimensional
space for survival analysis.
source:biretail.com
Bitmaps (bitset, bitvector)
23
waw
1
0
0
0
1
0
0
1
…
0
iOS
0
0
1
0
1
1
0
1
…
0
11:00
1
1
0
0
1
0
0
1
…
0
Bid
0
1
2
3
4
5
6
7
…
100mln
Matching
bids
0
0
0
0
1
0
0
1
…
0
BidPriceBuckets
9900
790
2000
….
100
Counts
Nested queries on millions of features
24
waw
1
0
0
0
1
0
0
1
…
0
iOS
0
0
1
0
1
1
0
1
…
0
11:00
1
1
0
0
1
0
0
1
…
0
Bid
0
1
2
3
4
5
6
7
…
100mln
Matching
bids
0
0
0
0
1
0
0
1
…
0
BidPriceBuckets
9900
790
2000
….
100
Counts
3 mln other features
AND, OR, NOT, XOR
25
Buckets' cardinality
10000 3000 ... 4500
2000 100 ... 0
1000 800 ... 400
Survival
Analysis
Roaring Bitmap
26
Roaring Bitmap – state of the art compressed bitmap algorithm.
Hybrid approach between sparse and dense bitmaps representation.
roaringbitmap.org/about/
„Use Roaring for bitmap compression whenever possible.
Do not use other bitmap compression methods” [1]
[1] Wang, Jianguo, et al.
"An experimental study of bitmap compression vs. inverted list compression." 2017.
Bitmap Execution Database
27
• Roaring Bitmap – state of the art compressed bitmap algorithm
• Whole 7 days of Internet bid opportunities – 12GB (sample 1/1000) Naive 50TB
• Bids&Impressions – 5GB (sample 1/100)
• Majority of responses under 1 second
Nested set operations language
28
And(
Or("inventory|1", "inventory|2", "inventory|3"),
AndNot(
Universe,
Or("category|moto", "category|fishing")
),
Or(
"hour_of_week|41",
"hour_of_week|42",
"hour_of_week|43",
"hour_of_week|44"
),
Or("tld|com", "website|example.com"),
And(
"country|poland",
AndNot(
Universe,
"city|warsaw"
)
),
...
)
Survival analysis
29
Principles of survival analysis
• The basic assumption in survival analysis is that the risk of failure changes with time
• Two major approaches for survival probability modelling can be distinguished:
- Parametric – can be extrapolated for new experiences (Weibull)
- Non-parametric – very flexible but can't be extended for new experience (Kaplan-Maier)
30
FailureRate
Data censoring in survival analysis
31
Weibull based survival probability estimation
• Weibull distribution is derived from hazard function:
where:
- S(t) - survival function (cdf)
- F(t) = 1 – S(t) - failure function (cdf)
- f(t) = dF(t)/dt - pdf
• Weibull distribution is defined by 2 parameters:
- Shape parameter p - failure mode type
- Scale parameter l - position
32
Time
Survive .
0 t ∆t
Fail .
Weibull based survival probability estimation
• One can distinguish 3 classes of failure types:
- Infant mortality p < 1
- Random failures p = 1
- Ware out p > 1
33
Weibull examples
Parametric approach easy to extrapolate
age
Hazard
Kaplan-Maier based survival probability estimation
• In Kaplan-Meier method we are to predict chance of surviving time ∆t after surviving time t:
34
No parameters to be estimated, very flexible and easy in implementation
S(t,Δt) = S(t) • S(Δt|t)
Time
Survive .
0 t ∆t
Survive.
Kaplan-Meier curve calculation
35
S(t,Δt) = S(t) • S(Δt|t)
Time (weeks)
KM Curve for Group 1 (treatment)
Elements of survival analysis in
Bid Landscape modelling
36
RTB auctions modelling with survival methods
From survival models to Bid Landscape:
- time -> cost per 1k bought impressions (CPM)
- died -> won bid (left censored, known win price)
- alive -> lost bid (right censored, unknown exact win price)
37
our win price (second price auction)
competitor outbid price (unknown)
our bid price
Bid Landscape vs. Survival Analysis
Survival analysis
• Risk of failure evolves with time
• Incomplete knowledge regarding failure
time, right censoring:
- Majority of object survive
• Multiple competing failure modes
• Very high disproportion between positive
(failures) and negative (survived) class
representatives
• Usually very limited data
• Wrong predictions may have catastrophic
consequences – quality
monitoring required
38
Bid Landscape analysis
• Winning probability evolves with price
• Incomplete knowledge regarding win price, right
censoring:
- Only small fraction of bids are won
• Different inventory sources
• Very high disproportion between positive (won)
and negative (lost) class representatives
• Usually huge data sets – subsampling
• Wrong predictions can lead to not efficient
campaign money spend – quality monitoring
required
Survival model for Bid Landscape modelling
39
• Let's apply survival techniques to auction data:
- Instead of time a bid price is to be used
• Legend:
• number of lost auctions
• number of won impressions
- win price CDF
- Kaplan-Maier estimate
- Weibull estimate
Adform Bid Landscape exploits
Kaplan-Maier estimate exclusively
Bid Price (CPM)
Multiplicity
WinRate
Bid Landscape validation in
Adform
40
Bid Landscape validation – qualitative
41
Win rate evolution with bid price Win rate evolution over day
The bid landscape predictions are in line with the intuition:
• The win rate increases with bid price
• The win rate changes periodically over a day
Bid Landscape validation – quantitative (real RTB data)
42
Real auction data versus Bid Landscape predictions
Bid Landscape as Adform
product
43
Interactive tool used in UI
44
Hyperlocal precision
45
Floor Price detection
46
Bid Multipliers optimisation
47
Polish accountants'
portal
Jobs for professionals
Video oriented website
Impressions
Source of traffic:
• Campaign targeting Warsaw people to be
run on three different domains
• Global CPM makes no sense, needs to be
adjusted
Bid Landscape dynamic CPM algorithm
48
Given the Campaign Settings including hourly budget maximize
bought relevant impressions by using lowest CPM.
First tests are showing multiplied Viewable Impressions,
Clicks comparing to same Campaign using static price.
Summary and conclusions
• The Adform Bid Landscape is composed of two base elements ensuring speed and quality:
- Bitmaps – fast and efficient population selection
- Kaplan-Maier win rates – simple and precise estimate
• The Adform Bid Landscape is in production for more than one year
- It is mostly used by Adform customers optimising the CPM campaigns
- Currently we develop new RTB algorithm that is based on BL predictions
- In the short time perspective we are to extend usage of BL in other RTB models
49
5
RTB Bid Landscape in Adform

Weitere ähnliche Inhalte

Was ist angesagt?

AWS 9월 웨비나 | Amazon Aurora Deep Dive
AWS 9월 웨비나 | Amazon Aurora Deep DiveAWS 9월 웨비나 | Amazon Aurora Deep Dive
AWS 9월 웨비나 | Amazon Aurora Deep DiveAmazon Web Services Korea
 
Counterfactual evaluation of machine learning models
Counterfactual evaluation of machine learning modelsCounterfactual evaluation of machine learning models
Counterfactual evaluation of machine learning modelsMichael Manapat
 
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...StampedeCon
 
DynamoDB의 안과밖 - 정민영 (비트패킹 컴퍼니)
DynamoDB의 안과밖 - 정민영 (비트패킹 컴퍼니)DynamoDB의 안과밖 - 정민영 (비트패킹 컴퍼니)
DynamoDB의 안과밖 - 정민영 (비트패킹 컴퍼니)AWSKRUG - AWS한국사용자모임
 
Overcoming Today's Data Challenges with MongoDB
Overcoming Today's Data Challenges with MongoDBOvercoming Today's Data Challenges with MongoDB
Overcoming Today's Data Challenges with MongoDBMongoDB
 
MongoDB WiredTiger Internals
MongoDB WiredTiger InternalsMongoDB WiredTiger Internals
MongoDB WiredTiger InternalsNorberto Leite
 
백억개의 로그를 모아 검색하고 분석하고 학습도 시켜보자 : 로기스
백억개의 로그를 모아 검색하고 분석하고 학습도 시켜보자 : 로기스백억개의 로그를 모아 검색하고 분석하고 학습도 시켜보자 : 로기스
백억개의 로그를 모아 검색하고 분석하고 학습도 시켜보자 : 로기스NAVER D2
 
인공지능추천시스템 airs개발기_모델링과시스템
인공지능추천시스템 airs개발기_모델링과시스템인공지능추천시스템 airs개발기_모델링과시스템
인공지능추천시스템 airs개발기_모델링과시스템NAVER D2
 
Unified Batch & Stream Processing with Apache Samza
Unified Batch & Stream Processing with Apache SamzaUnified Batch & Stream Processing with Apache Samza
Unified Batch & Stream Processing with Apache SamzaDataWorks Summit
 
Multi cluster, multitenant and hierarchical kafka messaging service slideshare
Multi cluster, multitenant and hierarchical kafka messaging service   slideshareMulti cluster, multitenant and hierarchical kafka messaging service   slideshare
Multi cluster, multitenant and hierarchical kafka messaging service slideshareAllen (Xiaozhong) Wang
 
EMR 플랫폼 기반의 Spark 워크로드 실행 최적화 방안 - 정세웅, AWS 솔루션즈 아키텍트:: AWS Summit Online Ko...
EMR 플랫폼 기반의 Spark 워크로드 실행 최적화 방안 - 정세웅, AWS 솔루션즈 아키텍트::  AWS Summit Online Ko...EMR 플랫폼 기반의 Spark 워크로드 실행 최적화 방안 - 정세웅, AWS 솔루션즈 아키텍트::  AWS Summit Online Ko...
EMR 플랫폼 기반의 Spark 워크로드 실행 최적화 방안 - 정세웅, AWS 솔루션즈 아키텍트:: AWS Summit Online Ko...Amazon Web Services Korea
 
[MLOps KR 행사] MLOps 춘추 전국 시대 정리(210605)
[MLOps KR 행사] MLOps 춘추 전국 시대 정리(210605)[MLOps KR 행사] MLOps 춘추 전국 시대 정리(210605)
[MLOps KR 행사] MLOps 춘추 전국 시대 정리(210605)Seongyun Byeon
 
A Technical Introduction to WiredTiger
A Technical Introduction to WiredTigerA Technical Introduction to WiredTiger
A Technical Introduction to WiredTigerMongoDB
 
MongoDB Sharding
MongoDB ShardingMongoDB Sharding
MongoDB ShardingRob Walters
 
Extending Analytics Beyond the Data Warehouse, ft. Warner Bros. Analytics (AN...
Extending Analytics Beyond the Data Warehouse, ft. Warner Bros. Analytics (AN...Extending Analytics Beyond the Data Warehouse, ft. Warner Bros. Analytics (AN...
Extending Analytics Beyond the Data Warehouse, ft. Warner Bros. Analytics (AN...Amazon Web Services
 
클라우드 기반 AWS 데이터베이스 선택 옵션 - AWS Summit Seoul 2017
클라우드 기반 AWS 데이터베이스 선택 옵션 - AWS Summit Seoul 2017 클라우드 기반 AWS 데이터베이스 선택 옵션 - AWS Summit Seoul 2017
클라우드 기반 AWS 데이터베이스 선택 옵션 - AWS Summit Seoul 2017 Amazon Web Services Korea
 
Spark Summit EU talk by Herman van Hovell
Spark Summit EU talk by Herman van HovellSpark Summit EU talk by Herman van Hovell
Spark Summit EU talk by Herman van HovellSpark Summit
 

Was ist angesagt? (20)

AWS 9월 웨비나 | Amazon Aurora Deep Dive
AWS 9월 웨비나 | Amazon Aurora Deep DiveAWS 9월 웨비나 | Amazon Aurora Deep Dive
AWS 9월 웨비나 | Amazon Aurora Deep Dive
 
Counterfactual evaluation of machine learning models
Counterfactual evaluation of machine learning modelsCounterfactual evaluation of machine learning models
Counterfactual evaluation of machine learning models
 
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
 
DynamoDB의 안과밖 - 정민영 (비트패킹 컴퍼니)
DynamoDB의 안과밖 - 정민영 (비트패킹 컴퍼니)DynamoDB의 안과밖 - 정민영 (비트패킹 컴퍼니)
DynamoDB의 안과밖 - 정민영 (비트패킹 컴퍼니)
 
Overcoming Today's Data Challenges with MongoDB
Overcoming Today's Data Challenges with MongoDBOvercoming Today's Data Challenges with MongoDB
Overcoming Today's Data Challenges with MongoDB
 
MongoDB WiredTiger Internals
MongoDB WiredTiger InternalsMongoDB WiredTiger Internals
MongoDB WiredTiger Internals
 
백억개의 로그를 모아 검색하고 분석하고 학습도 시켜보자 : 로기스
백억개의 로그를 모아 검색하고 분석하고 학습도 시켜보자 : 로기스백억개의 로그를 모아 검색하고 분석하고 학습도 시켜보자 : 로기스
백억개의 로그를 모아 검색하고 분석하고 학습도 시켜보자 : 로기스
 
인공지능추천시스템 airs개발기_모델링과시스템
인공지능추천시스템 airs개발기_모델링과시스템인공지능추천시스템 airs개발기_모델링과시스템
인공지능추천시스템 airs개발기_모델링과시스템
 
Unified Batch & Stream Processing with Apache Samza
Unified Batch & Stream Processing with Apache SamzaUnified Batch & Stream Processing with Apache Samza
Unified Batch & Stream Processing with Apache Samza
 
Multi cluster, multitenant and hierarchical kafka messaging service slideshare
Multi cluster, multitenant and hierarchical kafka messaging service   slideshareMulti cluster, multitenant and hierarchical kafka messaging service   slideshare
Multi cluster, multitenant and hierarchical kafka messaging service slideshare
 
EMR 플랫폼 기반의 Spark 워크로드 실행 최적화 방안 - 정세웅, AWS 솔루션즈 아키텍트:: AWS Summit Online Ko...
EMR 플랫폼 기반의 Spark 워크로드 실행 최적화 방안 - 정세웅, AWS 솔루션즈 아키텍트::  AWS Summit Online Ko...EMR 플랫폼 기반의 Spark 워크로드 실행 최적화 방안 - 정세웅, AWS 솔루션즈 아키텍트::  AWS Summit Online Ko...
EMR 플랫폼 기반의 Spark 워크로드 실행 최적화 방안 - 정세웅, AWS 솔루션즈 아키텍트:: AWS Summit Online Ko...
 
[MLOps KR 행사] MLOps 춘추 전국 시대 정리(210605)
[MLOps KR 행사] MLOps 춘추 전국 시대 정리(210605)[MLOps KR 행사] MLOps 춘추 전국 시대 정리(210605)
[MLOps KR 행사] MLOps 춘추 전국 시대 정리(210605)
 
A Technical Introduction to WiredTiger
A Technical Introduction to WiredTigerA Technical Introduction to WiredTiger
A Technical Introduction to WiredTiger
 
Amazon DynamoDB 키 디자인 패턴
Amazon DynamoDB 키 디자인 패턴Amazon DynamoDB 키 디자인 패턴
Amazon DynamoDB 키 디자인 패턴
 
Mongo db 최범균
Mongo db 최범균Mongo db 최범균
Mongo db 최범균
 
MongoDB Sharding
MongoDB ShardingMongoDB Sharding
MongoDB Sharding
 
Extending Analytics Beyond the Data Warehouse, ft. Warner Bros. Analytics (AN...
Extending Analytics Beyond the Data Warehouse, ft. Warner Bros. Analytics (AN...Extending Analytics Beyond the Data Warehouse, ft. Warner Bros. Analytics (AN...
Extending Analytics Beyond the Data Warehouse, ft. Warner Bros. Analytics (AN...
 
클라우드 기반 AWS 데이터베이스 선택 옵션 - AWS Summit Seoul 2017
클라우드 기반 AWS 데이터베이스 선택 옵션 - AWS Summit Seoul 2017 클라우드 기반 AWS 데이터베이스 선택 옵션 - AWS Summit Seoul 2017
클라우드 기반 AWS 데이터베이스 선택 옵션 - AWS Summit Seoul 2017
 
Spark Summit EU talk by Herman van Hovell
Spark Summit EU talk by Herman van HovellSpark Summit EU talk by Herman van Hovell
Spark Summit EU talk by Herman van Hovell
 
Deep Dive on Amazon Redshift
Deep Dive on Amazon RedshiftDeep Dive on Amazon Redshift
Deep Dive on Amazon Redshift
 

Ähnlich wie RTB Bid Landscape in Adform

RTBMA ECIR 2016 tutorial
RTBMA ECIR 2016 tutorialRTBMA ECIR 2016 tutorial
RTBMA ECIR 2016 tutorialShuai Yuan
 
Machine learning for profit: Computational advertising landscape
Machine learning for profit: Computational advertising landscapeMachine learning for profit: Computational advertising landscape
Machine learning for profit: Computational advertising landscapeSharat Chikkerur
 
Fast Data Driving Personalization - Nick Gorski
Fast Data Driving Personalization - Nick GorskiFast Data Driving Personalization - Nick Gorski
Fast Data Driving Personalization - Nick GorskiHakka Labs
 
Computational Advertising in Yelp Local Ads
Computational Advertising in Yelp Local AdsComputational Advertising in Yelp Local Ads
Computational Advertising in Yelp Local Adssoupsranjan
 
Reel bid insightd-eproject
Reel bid   insightd-eprojectReel bid   insightd-eproject
Reel bid insightd-eprojectMudit Uppal
 
Weinan Zhang's KDD15 Talk: Statistical Arbitrage Mining for Display Advertising
Weinan Zhang's KDD15 Talk: Statistical Arbitrage Mining for Display AdvertisingWeinan Zhang's KDD15 Talk: Statistical Arbitrage Mining for Display Advertising
Weinan Zhang's KDD15 Talk: Statistical Arbitrage Mining for Display AdvertisingJun Wang
 
CIKM 2013 Tutorial: Real-time Bidding: A New Frontier of Computational Advert...
CIKM 2013 Tutorial: Real-time Bidding: A New Frontier of Computational Advert...CIKM 2013 Tutorial: Real-time Bidding: A New Frontier of Computational Advert...
CIKM 2013 Tutorial: Real-time Bidding: A New Frontier of Computational Advert...Shuai Yuan
 
A History of Programmatic Media
A History of Programmatic MediaA History of Programmatic Media
A History of Programmatic MediaThe Media Kitchen
 
TripleLift: Preparing for a New Programmatic Ad-Tech World
TripleLift: Preparing for a New Programmatic Ad-Tech WorldTripleLift: Preparing for a New Programmatic Ad-Tech World
TripleLift: Preparing for a New Programmatic Ad-Tech WorldVoltDB
 
Ad science bid simulator (public ver)
Ad science bid simulator (public ver)Ad science bid simulator (public ver)
Ad science bid simulator (public ver)Marsan Ma
 
Damien Lefortier, Senior Machine Learning Engineer and Tech Lead in the Predi...
Damien Lefortier, Senior Machine Learning Engineer and Tech Lead in the Predi...Damien Lefortier, Senior Machine Learning Engineer and Tech Lead in the Predi...
Damien Lefortier, Senior Machine Learning Engineer and Tech Lead in the Predi...MLconf
 
Big Data in Advertising Industry — Oleksandr Fedirko, Danylo Stepanchuk
Big Data in Advertising Industry — Oleksandr Fedirko, Danylo StepanchukBig Data in Advertising Industry — Oleksandr Fedirko, Danylo Stepanchuk
Big Data in Advertising Industry — Oleksandr Fedirko, Danylo StepanchukGlobalLogic Ukraine
 
Machine Learning for Performance Advertising
Machine Learning for Performance AdvertisingMachine Learning for Performance Advertising
Machine Learning for Performance AdvertisingEustache Diemert
 
Yelp Ad Targeting at Scale with Apache Spark with Inaz Alaei-Novin and Joe Ma...
Yelp Ad Targeting at Scale with Apache Spark with Inaz Alaei-Novin and Joe Ma...Yelp Ad Targeting at Scale with Apache Spark with Inaz Alaei-Novin and Joe Ma...
Yelp Ad Targeting at Scale with Apache Spark with Inaz Alaei-Novin and Joe Ma...Databricks
 
Programmatic advertising - overview
Programmatic advertising - overviewProgrammatic advertising - overview
Programmatic advertising - overviewJim Murray Jones
 
Int'l Conference on Predictive APIs: RTB Optimizer presentation
Int'l Conference on Predictive APIs: RTB Optimizer presentationInt'l Conference on Predictive APIs: RTB Optimizer presentation
Int'l Conference on Predictive APIs: RTB Optimizer presentationDatacratic
 
Understanding Natural Language Instructions for Fetching Daily Objects Using ...
Understanding Natural Language Instructions for Fetching Daily Objects Using ...Understanding Natural Language Instructions for Fetching Daily Objects Using ...
Understanding Natural Language Instructions for Fetching Daily Objects Using ...Aly Magassouba
 
Nicolas Kruchten @ Datacratic
Nicolas Kruchten @ DatacraticNicolas Kruchten @ Datacratic
Nicolas Kruchten @ DatacraticPAPIs.io
 

Ähnlich wie RTB Bid Landscape in Adform (20)

RTBMA ECIR 2016 tutorial
RTBMA ECIR 2016 tutorialRTBMA ECIR 2016 tutorial
RTBMA ECIR 2016 tutorial
 
Machine learning for profit: Computational advertising landscape
Machine learning for profit: Computational advertising landscapeMachine learning for profit: Computational advertising landscape
Machine learning for profit: Computational advertising landscape
 
Fast Data Driving Personalization - Nick Gorski
Fast Data Driving Personalization - Nick GorskiFast Data Driving Personalization - Nick Gorski
Fast Data Driving Personalization - Nick Gorski
 
Computational Advertising in Yelp Local Ads
Computational Advertising in Yelp Local AdsComputational Advertising in Yelp Local Ads
Computational Advertising in Yelp Local Ads
 
Reel bid insightd-eproject
Reel bid   insightd-eprojectReel bid   insightd-eproject
Reel bid insightd-eproject
 
Weinan Zhang's KDD15 Talk: Statistical Arbitrage Mining for Display Advertising
Weinan Zhang's KDD15 Talk: Statistical Arbitrage Mining for Display AdvertisingWeinan Zhang's KDD15 Talk: Statistical Arbitrage Mining for Display Advertising
Weinan Zhang's KDD15 Talk: Statistical Arbitrage Mining for Display Advertising
 
CIKM 2013 Tutorial: Real-time Bidding: A New Frontier of Computational Advert...
CIKM 2013 Tutorial: Real-time Bidding: A New Frontier of Computational Advert...CIKM 2013 Tutorial: Real-time Bidding: A New Frontier of Computational Advert...
CIKM 2013 Tutorial: Real-time Bidding: A New Frontier of Computational Advert...
 
A History of Programmatic Media
A History of Programmatic MediaA History of Programmatic Media
A History of Programmatic Media
 
Machine Learning for Computational Advertising
Machine Learning for Computational AdvertisingMachine Learning for Computational Advertising
Machine Learning for Computational Advertising
 
TripleLift: Preparing for a New Programmatic Ad-Tech World
TripleLift: Preparing for a New Programmatic Ad-Tech WorldTripleLift: Preparing for a New Programmatic Ad-Tech World
TripleLift: Preparing for a New Programmatic Ad-Tech World
 
Wsdm2015
Wsdm2015Wsdm2015
Wsdm2015
 
Ad science bid simulator (public ver)
Ad science bid simulator (public ver)Ad science bid simulator (public ver)
Ad science bid simulator (public ver)
 
Damien Lefortier, Senior Machine Learning Engineer and Tech Lead in the Predi...
Damien Lefortier, Senior Machine Learning Engineer and Tech Lead in the Predi...Damien Lefortier, Senior Machine Learning Engineer and Tech Lead in the Predi...
Damien Lefortier, Senior Machine Learning Engineer and Tech Lead in the Predi...
 
Big Data in Advertising Industry — Oleksandr Fedirko, Danylo Stepanchuk
Big Data in Advertising Industry — Oleksandr Fedirko, Danylo StepanchukBig Data in Advertising Industry — Oleksandr Fedirko, Danylo Stepanchuk
Big Data in Advertising Industry — Oleksandr Fedirko, Danylo Stepanchuk
 
Machine Learning for Performance Advertising
Machine Learning for Performance AdvertisingMachine Learning for Performance Advertising
Machine Learning for Performance Advertising
 
Yelp Ad Targeting at Scale with Apache Spark with Inaz Alaei-Novin and Joe Ma...
Yelp Ad Targeting at Scale with Apache Spark with Inaz Alaei-Novin and Joe Ma...Yelp Ad Targeting at Scale with Apache Spark with Inaz Alaei-Novin and Joe Ma...
Yelp Ad Targeting at Scale with Apache Spark with Inaz Alaei-Novin and Joe Ma...
 
Programmatic advertising - overview
Programmatic advertising - overviewProgrammatic advertising - overview
Programmatic advertising - overview
 
Int'l Conference on Predictive APIs: RTB Optimizer presentation
Int'l Conference on Predictive APIs: RTB Optimizer presentationInt'l Conference on Predictive APIs: RTB Optimizer presentation
Int'l Conference on Predictive APIs: RTB Optimizer presentation
 
Understanding Natural Language Instructions for Fetching Daily Objects Using ...
Understanding Natural Language Instructions for Fetching Daily Objects Using ...Understanding Natural Language Instructions for Fetching Daily Objects Using ...
Understanding Natural Language Instructions for Fetching Daily Objects Using ...
 
Nicolas Kruchten @ Datacratic
Nicolas Kruchten @ DatacraticNicolas Kruchten @ Datacratic
Nicolas Kruchten @ Datacratic
 

Kürzlich hochgeladen

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfOverkill Security
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKJago de Vreede
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 

Kürzlich hochgeladen (20)

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 

RTB Bid Landscape in Adform

  • 1. RTB Bid Landscape survival models powered by bitmaps Łukasz Mączewski Senior Data Scientist Przemysław Piotrowski Senior Data Engineer AI&Analytics, Adform 14.05.2019 1
  • 2. Agenda • Real Time Bidding – RTB • Bidding Strategy • Bid Landscape • Bitmap population selection • Survival analysis • Validation • Product demo 2
  • 3. Real-Time Bidding (RTB) RTB allows for fully automated ad inventory selling / buying in a real-time 3 Advertiser 1 Advertiser 2 Advertiser 3 Advertiser n AudienceInventory Auction Demand Side Platform is to estimate optimal bid price according to: • Targeting (audience) • Business requirements (KPIs) • Ad inventory (context, ad slot price) • Campaign budget and it's spend plan
  • 4. First price vs. second price auction First price auction: - Pay exact amount that was bidded - Bid price is close to a real impression value - Seller revenue maximization 4 Second price auction: - Pay 0.00001$ more than second-highest bid price - Bid as high as possible to maximize winning probability In the past the second price auction was preferred, but currently the first price auction becomes more and more popular
  • 5. RTB ecosystem 5 diverse audience web page Publisher 1 and its inventory SSP 1 Supply Side Platform Publisher n and its inventory . . . Ad Exchange DMP Data Management Platform DSP 1 Demand Side Platform SSP n DSP n . . . advertisers demographics, browsing, interest segments data cookie profile data feedback data ad-slot / banner data campaign data Bidding strategy optimisation Time to bid is ~100 ms
  • 6. Bidding strategy optimisation 6 Bidding Strategy Optimization Candidate impression valuation with respect to KPIs and budget constraints DSP algorithms Evaluation of bid request business potential Bid Landscape Estimation of winning probability at bid price Budget throttle Control budget spend Optimal bid price
  • 7. Numbers 7 Adform provides an integrated Software as a Service platform for the buying, managing and serving of digital advertising. Presence in EU, US, APAC 3 millions bid request per second billions of engagements daily 5 millions scoring of algorithms per second 200 thousands of cookie profiles in DMP
  • 9. Discovering knowledge from massive RTB data 9 Cookie profile/ Targeting: • device type • os type • browser type • screen size • country • regions • city • clicks made • visited log points • visited domains • urls • IAB categories • ... Feedback: • acquisition/conversion • click • order status • revenue • viewability • ... Adslot/ banners/ creatives: • inventory name • domain • URL • banner position • banner size • banner type • floor price • video player properties • ... Data Management Platform: • demography • interests and preferences • profession • ... Campaign: • budget • pricing type • deals • targeting • brand safety • ...
  • 10. Discovering knowledge from massive RTB data 10 Huge class imbalance Mostly categorical data High cardinality Sparsity High risk of overfit Data summary Reduction of dimensionality Preselection of features RegularisationData sampling Cross validationEnsembling
  • 11. RTB models 11 CPC Cost per Click CPL Cost per Lead netCPL Confirmed lead CPV Cost per Viewability ROAS Return on Ad Spend VCR Video Completion Rate
  • 12. Adform machine learning pipeline 12 Train/Test data sets preparation (dedicated ETLs) Data preprocessing and feature engineering Feature selection Building model and hyperparameters optimization Offline backtests Online A/B tests Deployment Live monitoring After successful model deployment it is periodically retrained
  • 13. RTB Algorithms Pipeline 13 PricingAPI Train Data JAR JAR OFFLINE JAR JAR ONLINE JAR ONLINE SPEC ONLINE MONITORING
  • 14. Backtests • Offline procedure to check performance of two competing models in time • Sequential analysis of historical data: - Data divided into training and test sets • Time series analysis of critical performance KPIs: - AUC - Sparsity - Target rates • If the backtests succeed the algorithm is submitted to online A/B testing procedure 14 AUC SPARSITY Target rate Time
  • 15. A/B test online monitoring 15 Split traffic to hash(cookie_id)%1024 buckets For each experiment monitor • Cost per target • Target rate • Target count • Number of impressions • ... many more
  • 16.
  • 18. User Stories As a Trafficker I want to know volume of impressions my campaign would win in RTB given the selected inventory, targeting and budget settings (CPM). As a DSP I want to enable Advertisers winning impressions for lowest possible price. 18 Bid Shading Demand Side Platform protects Advertiser from overpaying while AdTech ecosystem migrating from second-price to first-price RTB auctions.
  • 19. Forecasting & Bid Landscape Forecasting • Total number of matching Bid Requests for given Campaign Targeting • Bid Opportunities 19 Bid Landscape • Predicting probability of winning auction with a given bid price and Campaign Settings • Win rate evolution with a bid price Impressions CPM
  • 20. Bid Landscape • Building Bid Landscape model using historical data regarding won and failed auctions: For lost auctions win price is unknown • Right Censored Data Survival Analysis The exact moment of failure for majority of objects is unknown 20 All Bids Won Bids Relative Rate Bid Price Buckets
  • 21. Population selection for Bid Landscape modelling 21
  • 22. The Data 22 Input Bids and Impressions with features Impression is a Bid that won the auction 100000 2000 5000 100 4000 3000 ... 4500 Output cardinality of selected (Campaign Settings) Bids bucketed by bid price Impressions bucketed by bid price Impressions bucketed by win price 100 buckets in our case How? Population Selection – select relevant population from high-dimensional space for survival analysis. source:biretail.com
  • 24. Nested queries on millions of features 24 waw 1 0 0 0 1 0 0 1 … 0 iOS 0 0 1 0 1 1 0 1 … 0 11:00 1 1 0 0 1 0 0 1 … 0 Bid 0 1 2 3 4 5 6 7 … 100mln Matching bids 0 0 0 0 1 0 0 1 … 0 BidPriceBuckets 9900 790 2000 …. 100 Counts 3 mln other features AND, OR, NOT, XOR
  • 25. 25 Buckets' cardinality 10000 3000 ... 4500 2000 100 ... 0 1000 800 ... 400 Survival Analysis
  • 26. Roaring Bitmap 26 Roaring Bitmap – state of the art compressed bitmap algorithm. Hybrid approach between sparse and dense bitmaps representation. roaringbitmap.org/about/ „Use Roaring for bitmap compression whenever possible. Do not use other bitmap compression methods” [1] [1] Wang, Jianguo, et al. "An experimental study of bitmap compression vs. inverted list compression." 2017.
  • 27. Bitmap Execution Database 27 • Roaring Bitmap – state of the art compressed bitmap algorithm • Whole 7 days of Internet bid opportunities – 12GB (sample 1/1000) Naive 50TB • Bids&Impressions – 5GB (sample 1/100) • Majority of responses under 1 second
  • 28. Nested set operations language 28 And( Or("inventory|1", "inventory|2", "inventory|3"), AndNot( Universe, Or("category|moto", "category|fishing") ), Or( "hour_of_week|41", "hour_of_week|42", "hour_of_week|43", "hour_of_week|44" ), Or("tld|com", "website|example.com"), And( "country|poland", AndNot( Universe, "city|warsaw" ) ), ... )
  • 30. Principles of survival analysis • The basic assumption in survival analysis is that the risk of failure changes with time • Two major approaches for survival probability modelling can be distinguished: - Parametric – can be extrapolated for new experiences (Weibull) - Non-parametric – very flexible but can't be extended for new experience (Kaplan-Maier) 30 FailureRate
  • 31. Data censoring in survival analysis 31
  • 32. Weibull based survival probability estimation • Weibull distribution is derived from hazard function: where: - S(t) - survival function (cdf) - F(t) = 1 – S(t) - failure function (cdf) - f(t) = dF(t)/dt - pdf • Weibull distribution is defined by 2 parameters: - Shape parameter p - failure mode type - Scale parameter l - position 32 Time Survive . 0 t ∆t Fail .
  • 33. Weibull based survival probability estimation • One can distinguish 3 classes of failure types: - Infant mortality p < 1 - Random failures p = 1 - Ware out p > 1 33 Weibull examples Parametric approach easy to extrapolate age Hazard
  • 34. Kaplan-Maier based survival probability estimation • In Kaplan-Meier method we are to predict chance of surviving time ∆t after surviving time t: 34 No parameters to be estimated, very flexible and easy in implementation S(t,Δt) = S(t) • S(Δt|t) Time Survive . 0 t ∆t Survive.
  • 35. Kaplan-Meier curve calculation 35 S(t,Δt) = S(t) • S(Δt|t) Time (weeks) KM Curve for Group 1 (treatment)
  • 36. Elements of survival analysis in Bid Landscape modelling 36
  • 37. RTB auctions modelling with survival methods From survival models to Bid Landscape: - time -> cost per 1k bought impressions (CPM) - died -> won bid (left censored, known win price) - alive -> lost bid (right censored, unknown exact win price) 37 our win price (second price auction) competitor outbid price (unknown) our bid price
  • 38. Bid Landscape vs. Survival Analysis Survival analysis • Risk of failure evolves with time • Incomplete knowledge regarding failure time, right censoring: - Majority of object survive • Multiple competing failure modes • Very high disproportion between positive (failures) and negative (survived) class representatives • Usually very limited data • Wrong predictions may have catastrophic consequences – quality monitoring required 38 Bid Landscape analysis • Winning probability evolves with price • Incomplete knowledge regarding win price, right censoring: - Only small fraction of bids are won • Different inventory sources • Very high disproportion between positive (won) and negative (lost) class representatives • Usually huge data sets – subsampling • Wrong predictions can lead to not efficient campaign money spend – quality monitoring required
  • 39. Survival model for Bid Landscape modelling 39 • Let's apply survival techniques to auction data: - Instead of time a bid price is to be used • Legend: • number of lost auctions • number of won impressions - win price CDF - Kaplan-Maier estimate - Weibull estimate Adform Bid Landscape exploits Kaplan-Maier estimate exclusively Bid Price (CPM) Multiplicity WinRate
  • 40. Bid Landscape validation in Adform 40
  • 41. Bid Landscape validation – qualitative 41 Win rate evolution with bid price Win rate evolution over day The bid landscape predictions are in line with the intuition: • The win rate increases with bid price • The win rate changes periodically over a day
  • 42. Bid Landscape validation – quantitative (real RTB data) 42 Real auction data versus Bid Landscape predictions
  • 43. Bid Landscape as Adform product 43
  • 47. Bid Multipliers optimisation 47 Polish accountants' portal Jobs for professionals Video oriented website Impressions Source of traffic: • Campaign targeting Warsaw people to be run on three different domains • Global CPM makes no sense, needs to be adjusted
  • 48. Bid Landscape dynamic CPM algorithm 48 Given the Campaign Settings including hourly budget maximize bought relevant impressions by using lowest CPM. First tests are showing multiplied Viewable Impressions, Clicks comparing to same Campaign using static price.
  • 49. Summary and conclusions • The Adform Bid Landscape is composed of two base elements ensuring speed and quality: - Bitmaps – fast and efficient population selection - Kaplan-Maier win rates – simple and precise estimate • The Adform Bid Landscape is in production for more than one year - It is mostly used by Adform customers optimising the CPM campaigns - Currently we develop new RTB algorithm that is based on BL predictions - In the short time perspective we are to extend usage of BL in other RTB models 49
  • 50. 5