Machine Learning RUM - Velocity 2016

•Als PPTX, PDF herunterladen•

1 gefällt mir•852 views

Patrick Meenan

Presentation from Velocity 2016 on using Machine Learning to determine the metrics that drive bounce and conversions

Technologie

Using machine learning
to determine drivers
of bounce and conversion
2016 Velocity Santa Clara

Pat Meenan
@patmeenan
Tammy Everts
@tameverts

Get the code
https://github.com/WPO-Foundation/beacon-ml

Random Forest
Lots of random decision trees

Vectorizing the data
• Everything needs to be numeric
• Strings converted to several inputs as
yes/no (1/0)
• i.e. Device Manufacturer
– “Apple” would be a discrete input
• Watch out for input explosion (UA String)

Balancing the data
• 3% Conversion Rate
• 97% Accurate by always guessing no
• Subsample the data for 50/50 mix

Validation Data
• Train on 80% of the data
• Validate on 20% to prevent overfitting

Smoothing the data
• ML works best on normally distributed data
scaler = StandardScaler()
x_train = scaler.fit_transform(x_train)
x_val = scaler.transform(x_val)

Input/Output Relationships
• SSL highly correlated with Conversions
• Long sessions highly correlated with not
bouncing
• Remove correlated features from training

Training Deep Learning
model = Sequential()
model.add(...)
model.compile(optimizer='adagrad',
loss='binary_crossentropy',
metrics=["accuracy"])
model.fit(x_train,
y_train,
nb_epoch=EPOCH_COUNT,
batch_size=32,
validation_data=(x_val, y_val),
verbose=2,
shuffle=True)

$Training Random Forest clf = RandomForestClassifier(n_estimators=FOREST_SIZE, criterion='gini', max_depth=None, min_samples_split=2, min_samples_leaf=1, min_weight_fraction_leaf=0.0, max_features='auto', max_leaf_nodes=None, bootstrap=True, oob_score=False, n_jobs=12, random_state=None, verbose=2, warm_start=False, class_weight=None) clf.fit(x_train, y_train)$

Feature Importances
clf.feature_importances_

Empfohlen

Using machine learning to determine drivers of bounce and conversionTammy Everts

Mining model for hotel recommendations (Kaggle Challenge)Arjun Varma

AdWorld Experience - Excel Forumals to supercharge your reprotingAnu Adegbola

BigML Education - EnsemblesBigML, Inc

Over fitting underfittingSivapriyaS12

Data Analysis With Spss - ReliabilityDr Ali Yusob Md Zain

Sample cronbach analysis using karaAiden Yeh

Machine learning systems for engineersCameron Joannidis

Empfohlen

Using machine learning to determine drivers of bounce and conversionTammy Everts

Mining model for hotel recommendations (Kaggle Challenge)Arjun Varma

AdWorld Experience - Excel Forumals to supercharge your reprotingAnu Adegbola

BigML Education - EnsemblesBigML, Inc

Over fitting underfittingSivapriyaS12

Data Analysis With Spss - ReliabilityDr Ali Yusob Md Zain

Sample cronbach analysis using karaAiden Yeh

Machine learning systems for engineersCameron Joannidis

TLS - 2016 Velocity TrainingPatrick Meenan

Scaling Front-End Performance - Velocity 2016Patrick Meenan

Service workers - Velocity 2016 TrainingPatrick Meenan

Service Workers for PerformancePatrick Meenan

Measuring the visual experience of website performancePatrick Meenan

Selecting and deploying automated optimization solutionsPatrick Meenan

Front-End Single Point of Failure - Velocity 2016 TrainingPatrick Meenan

WebPagetest Power Users - Velocity 2014Patrick Meenan

Velocity EU 2012 - Third party scripts and youPatrick Meenan

Fail WellJoshua Simmons

Service workersjungkees

Measuring performance - Velocity 2016 TrainingPatrick Meenan

Web Page Test - Beyond the BasicsAndy Davies

Opticon 2015-Pushing the Boundaries of OptimizelyOptimizely

Debugging Distributed Systems - Velocity Santa Clara 2016Donny Nadolny

Using machine learning to determine drivers of bounce and conversion (part 2)Tammy Everts

Velocity 2016 Speaking Session - Using Machine Learning to Determine Drivers ...SOASTA

Fraud Detection for Insurance ClaimsYit Wei (Jason) Chia

Machine Learning - Splitting DatasetsAndrew Ferlitsch

Introduction to Machine learning - DBA's to data scientists - Oct 2020 - OGBEmeaSandesh Rao

Introduction to Machine Learning - From DBA's to Data Scientists - OGBEMEASandesh Rao

Part 3 Machine LearnningMohamed Essam

Weitere ähnliche Inhalte

Andere mochten auch

TLS - 2016 Velocity TrainingPatrick Meenan

Scaling Front-End Performance - Velocity 2016Patrick Meenan

Service workers - Velocity 2016 TrainingPatrick Meenan

Service Workers for PerformancePatrick Meenan

Measuring the visual experience of website performancePatrick Meenan

Selecting and deploying automated optimization solutionsPatrick Meenan

Front-End Single Point of Failure - Velocity 2016 TrainingPatrick Meenan

WebPagetest Power Users - Velocity 2014Patrick Meenan

Velocity EU 2012 - Third party scripts and youPatrick Meenan

Fail WellJoshua Simmons

Service workersjungkees

Measuring performance - Velocity 2016 TrainingPatrick Meenan

Web Page Test - Beyond the BasicsAndy Davies

Opticon 2015-Pushing the Boundaries of OptimizelyOptimizely

Debugging Distributed Systems - Velocity Santa Clara 2016Donny Nadolny

Andere mochten auch (15)

TLS - 2016 Velocity Training

Scaling Front-End Performance - Velocity 2016

Service workers - Velocity 2016 Training

Service Workers for Performance

Measuring the visual experience of website performance

Selecting and deploying automated optimization solutions

Front-End Single Point of Failure - Velocity 2016 Training

WebPagetest Power Users - Velocity 2014

Velocity EU 2012 - Third party scripts and you

Fail Well

Service workers

Measuring performance - Velocity 2016 Training

Web Page Test - Beyond the Basics

Opticon 2015-Pushing the Boundaries of Optimizely

Debugging Distributed Systems - Velocity Santa Clara 2016

Ähnlich wie Machine Learning RUM - Velocity 2016

Using machine learning to determine drivers of bounce and conversion (part 2)Tammy Everts

Velocity 2016 Speaking Session - Using Machine Learning to Determine Drivers ...SOASTA

Fraud Detection for Insurance ClaimsYit Wei (Jason) Chia

Machine Learning - Splitting DatasetsAndrew Ferlitsch

Introduction to Machine learning - DBA's to data scientists - Oct 2020 - OGBEmeaSandesh Rao

Introduction to Machine Learning - From DBA's to Data Scientists - OGBEMEASandesh Rao

Part 3 Machine LearnningMohamed Essam

NEURAL Network Design TrainingESCOM

Echelon Asia Summit 2017 Startup Academy WorkshopGarrett Teoh Hor Keong

PresentationTomas Lukas Komar

Churn Modeling-For-Mobile-Telecommunications Salford Systems

Techniques for effective test data management in test automation.pptxKnoldus Inc.

The Power of Auto ML and How Does it WorkIvo Andreev

Mining Big Data Streams with APACHE SAMOAAlbert Bifet

Machine learning with scikitlearnPratap Dangeti

Machine learning and_nlpankit_ppt

Machine Learning in Autonomous Data WarehouseSandesh Rao

Automate your Machine LearningAjit Ananthram

in5490-classification (1).pptxMonicaTimber

Internship_presentationAditya Gautam

Ähnlich wie Machine Learning RUM - Velocity 2016 (20)

Using machine learning to determine drivers of bounce and conversion (part 2)

Velocity 2016 Speaking Session - Using Machine Learning to Determine Drivers ...

Fraud Detection for Insurance Claims

Machine Learning - Splitting Datasets

Introduction to Machine learning - DBA's to data scientists - Oct 2020 - OGBEmea

Introduction to Machine Learning - From DBA's to Data Scientists - OGBEMEA

Part 3 Machine Learnning

NEURAL Network Design Training

Echelon Asia Summit 2017 Startup Academy Workshop

Presentation

Churn Modeling-For-Mobile-Telecommunications

Techniques for effective test data management in test automation.pptx

The Power of Auto ML and How Does it Work

Mining Big Data Streams with APACHE SAMOA

Machine learning with scikitlearn

Machine learning and_nlp

Machine Learning in Autonomous Data Warehouse

Automate your Machine Learning

in5490-classification (1).pptx

Internship_presentation

Mehr von Patrick Meenan

Resource PrioritizationPatrick Meenan

HTTP/2 PrioritizationPatrick Meenan

Getting the most out of WebPageTestPatrick Meenan

Http2 in practicePatrick Meenan

Resource loading, prioritization, HTTP/2 - oh my!Patrick Meenan

How fast is it?Patrick Meenan

Velocity 2014 nyc WebPagetest private instancesPatrick Meenan

Mobile web performance - MoDev EastPatrick Meenan

Tracking Performance - Velocity NYC 2013Patrick Meenan

Image optimizationPatrick Meenan

Google I/O 2012 - Protecting your user experience while integrating 3rd party...Patrick Meenan

Velocity 2012 - Taming the Mobile BeastPatrick Meenan

Measuring web performancePatrick Meenan

Frontend SPOFPatrick Meenan

Web Performance OptimizationPatrick Meenan

Web performance testingPatrick Meenan

Making the web fasterPatrick Meenan

Hands on performance testing and analysis with web pagetestPatrick Meenan

Mehr von Patrick Meenan (18)

Resource Prioritization

HTTP/2 Prioritization

Getting the most out of WebPageTest

Http2 in practice

Resource loading, prioritization, HTTP/2 - oh my!

How fast is it?

Velocity 2014 nyc WebPagetest private instances

Mobile web performance - MoDev East

Tracking Performance - Velocity NYC 2013

Image optimization

Google I/O 2012 - Protecting your user experience while integrating 3rd party...

Velocity 2012 - Taming the Mobile Beast

Measuring web performance

Frontend SPOF

Web Performance Optimization

Web performance testing

Making the web faster

Hands on performance testing and analysis with web pagetest

Kürzlich hochgeladen

Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK

How to convert PDF to text with Nanonetsnaman860154

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

Google AI Hackathon: LLM based Evaluator for RAGSujit Pal

IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge

A Domino Admins Adventures (Engage 2024)Gabriella Davis

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

Kürzlich hochgeladen (20)

Unblocking The Main Thread Solving ANRs and Frozen Frames

How to convert PDF to text with Nanonets

[2024]Digital Global Overview Report 2024 Meltwater.pdf

08448380779 Call Girls In Civil Lines Women Seeking Men

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

Maximizing Board Effectiveness 2024 Webinar.pptx

The 7 Things I Know About Cyber Security After 25 Years | April 2024

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Data Cloud, More than a CDP by Matt Robison

08448380779 Call Girls In Friends Colony Women Seeking Men

Google AI Hackathon: LLM based Evaluator for RAG

IAC 2024 - IA Fast Track to Search Focused AI Solutions

A Domino Admins Adventures (Engage 2024)

Boost PC performance: How more available memory can improve productivity

My Hashitalk Indonesia April 2024 Presentation

Breaking the Kubernetes Kill Chain: Host Path Mount

Handwritten Text Recognition for manuscripts and early printed texts

Finology Group – Insurtech Innovation Award 2024

Machine Learning RUM - Velocity 2016

1. Using machine learning to determine drivers of bounce and conversion 2016 Velocity Santa Clara

2. Pat Meenan @patmeenan Tammy Everts @tameverts

3. What we did

4. Get the code https://github.com/WPO-Foundation/beacon-ml

5. Deep Learning Weights

6. Random Forest Lots of random decision trees

7. Vectorizing the data • Everything needs to be numeric • Strings converted to several inputs as yes/no (1/0) • i.e. Device Manufacturer – “Apple” would be a discrete input • Watch out for input explosion (UA String)

8. Balancing the data • 3% Conversion Rate • 97% Accurate by always guessing no • Subsample the data for 50/50 mix

9. Validation Data • Train on 80% of the data • Validate on 20% to prevent overfitting

10. Smoothing the data • ML works best on normally distributed data scaler = StandardScaler() x_train = scaler.fit_transform(x_train) x_val = scaler.transform(x_val)

11. Input/Output Relationships • SSL highly correlated with Conversions • Long sessions highly correlated with not bouncing • Remove correlated features from training

12. Training Deep Learning model = Sequential() model.add(...) model.compile(optimizer='adagrad', loss='binary_crossentropy', metrics=["accuracy"]) model.fit(x_train, y_train, nb_epoch=EPOCH_COUNT, batch_size=32, validation_data=(x_val, y_val), verbose=2, shuffle=True)

13. Training Random Forest clf = RandomForestClassifier(n_estimators=FOREST_SIZE, criterion='gini', max_depth=None, min_samples_split=2, min_samples_leaf=1, min_weight_fraction_leaf=0.0, max_features='auto', max_leaf_nodes=None, bootstrap=True, oob_score=False, n_jobs=12, random_state=None, verbose=2, warm_start=False, class_weight=None) clf.fit(x_train, y_train)

14. Feature Importances clf.feature_importances_

15. What we learned

16. Takeaways

17.

18. Thanks!