SlideShare a Scribd company logo
1 of 22
Download to read offline
Google Analytics sampling
limitations
and how to overcome them
George	
  Papadongonas
Web	
  Analyst,	
  Amazee	
  Metrics
16/7/2013
How Google Analytics stores data
• All unfiltered data of a web property (up to 10 million hits per
month) are stored in the Google Analytics database
• Each standard report has an associated data table in the Google
Analytics database with unsampled data
• Reports for accounts with more than 200,000 visits per day are
processed daily
• Reports for accounts with less than 200,000 visits per day are
processed more often
2
Google Analytics
Report Sampling
• Sampling starts when the requested date range has more than
500,000 visits
• The sample size can be arranged by using the Google Analytics
sampling slider. The default setting is for 250,000 visits, maximum
setting is 500,000
• Visits are counted for the specific date range on a Web Property level,
not on a Profile level
• Standard reports without filters, advanced segments or secondary
dimensions use always unsampled data
• Sampling applies to custom reports
3
Organic Search traffic report
is unsampled
4
By adding a second dimension,
sampling is applied
5
Adjust the sampling size
6
Prefer the
higher precision setting
7
Google Analytics
Report Sampling
8
How sampling is calculated
• Web property has 24,580,303 visits
• Profile has 492,786 visits
• Default sampling is: 250,000x492,786/24,580,303 = 5,012 visits (1%)
• Maximum sampling is: 500,000x492,786/24,580,303 = 10,024 visits (2%)
9
Avoid the
faster processing setting
10
Prefer the
higher precision setting
11
Pageviews or events reports
can be unreliable
12
Google Analytics
Report Sampling
• Visits and Visitors reports are usually reliable, even with a small
sample
• E-commerce transactions, individual pageviews, adwords data,
revenue and goal conversions are less reliable
13
Solutions
14
1. Buy Google Analytics Premium
• “Only”$150.000 / year
• 1 Billion hits processed per month
• Unsampled reports
• Data processing every 4 hours
15
2. Create custom profiles
• Instead of creating reports with specific advanced segments,
create custom profiles using filters
• The default reports of all profiles are always unsampled, even if the
visitors are more than 500,000
16
3. Enable Data Sampling
• Sample your data , by adding a line in the Google Analytics
tracking code
code_gaq.push(['_setSampleRate', '80']); Sets
sampling rate at 80%
• Not a perfect solution, as the data are still sampled, but you have
control and can avoid tracking interruption (for more than 10
million hits per month)
17
4. Use smaller date range
• Break you report in smaller data ranges, each one having less than
500.000 visits
• This ensures that the data are unsampled
• Export the reports using the Google Analytics API
• Aggregate the data in Excel and create the master report
18
5. Use analyticscanvas.com
• Analytics Canvas offers query partitioning, using the Google
Analytics API.
• Reports are exported in smaller date ranges, so that they are
unsampled and they are then merged automatically with Analytics
Canvas.
19
6. Download Google Analytics
data locally
• It is possible to keep a local copy of Google Analytics data
• Add a line in the Google Analytics tracking code
_gaq.push(['_setLocalRemoteServerMode']);
• Add _utm.gif to your web server root
20
6. Download Google Analytics
data locally
86.138.209.96 www.mysite.com - [01/Oct/2007:03:34:02 +0100] "GET /__utm.gif?utmwv=1&utmt=var&utmn=
2108116629 HTTP/1.1" 200 35 "http://www.mysite.com/pageX.htm" "Mozilla/4.0 (compatible; MSIE 6.0;
Windows NT 5.1; SV1; .NET CLR 1.1.4322)" "__utma=1.117971038.1175394730.1175394730.1175394730.1;
__utmb=1; __utmc=1; __utmz=1.1175394730.1.1.utmcid=23|utmgclid=CP-Bssq-oIsCFQMrlAodeUThgA|
utmccn=(not+set)|utmcmd=(not+set)|utmctr=looking+for+site; __utmv=1.Section One"
21
• Data are recorded in the server log files
• Use http://analytics.angelfishstats.com/ to analyze them, as Urchin is
discontiniued
Thanks!
22

More Related Content

What's hot

MeasureCamp #10 - WTF are Related Products in Google Analytics Ecommerce?
MeasureCamp #10 - WTF are Related Products in Google Analytics Ecommerce?MeasureCamp #10 - WTF are Related Products in Google Analytics Ecommerce?
MeasureCamp #10 - WTF are Related Products in Google Analytics Ecommerce?Michaela Linhart
 
Yali presentation for snowplow amsterdam meetup number 2
Yali presentation for snowplow amsterdam meetup number 2Yali presentation for snowplow amsterdam meetup number 2
Yali presentation for snowplow amsterdam meetup number 2yalisassoon
 
Data Driven Attribution in BigQuery with Shapley Values and Markov Chains
Data Driven Attribution in BigQuery with Shapley Values and Markov ChainsData Driven Attribution in BigQuery with Shapley Values and Markov Chains
Data Driven Attribution in BigQuery with Shapley Values and Markov ChainsChristopher Gutknecht
 
How to Avoid Sampling in Google Analytics
How to Avoid Sampling in Google AnalyticsHow to Avoid Sampling in Google Analytics
How to Avoid Sampling in Google AnalyticsYevheniia Chernenko
 
One Further - Spektrix and Google Analytics 4
One Further - Spektrix and Google Analytics 4One Further - Spektrix and Google Analytics 4
One Further - Spektrix and Google Analytics 4One Further
 
Data driven video advertising campaigns - JustWatch & Snowplow
Data driven video advertising campaigns - JustWatch & SnowplowData driven video advertising campaigns - JustWatch & Snowplow
Data driven video advertising campaigns - JustWatch & SnowplowGiuseppe Gaviani
 
Simply Business and Snowplow - Multichannel Attribution Analysis
Simply Business and Snowplow - Multichannel Attribution AnalysisSimply Business and Snowplow - Multichannel Attribution Analysis
Simply Business and Snowplow - Multichannel Attribution AnalysisStewart Duncan
 
What's new in Google Analytics 4
What's new in Google Analytics 4What's new in Google Analytics 4
What's new in Google Analytics 4Stephen Ellis
 
Jonathan Weber - All Things DATA 2017
Jonathan Weber - All Things DATA 2017Jonathan Weber - All Things DATA 2017
Jonathan Weber - All Things DATA 2017Shuki Mann
 
BCCCA Google Analytics for Improved Lead Generation
BCCCA Google Analytics for Improved Lead GenerationBCCCA Google Analytics for Improved Lead Generation
BCCCA Google Analytics for Improved Lead GenerationPhilippe Taza
 
Beyond the clouds with digital analytics
Beyond the clouds with digital analyticsBeyond the clouds with digital analytics
Beyond the clouds with digital analyticsMark Edmondson
 
The culture trip snowplow implementation
The culture trip snowplow implementationThe culture trip snowplow implementation
The culture trip snowplow implementationidan_by
 
Google Analytics with an Intro to Google Tag Manager for Austin WordPress Meetup
Google Analytics with an Intro to Google Tag Manager for Austin WordPress MeetupGoogle Analytics with an Intro to Google Tag Manager for Austin WordPress Meetup
Google Analytics with an Intro to Google Tag Manager for Austin WordPress MeetupRich Plakas
 
[Webinar] Getting Started with BigQuery: Basics, Its Appilcations & Use Cases
[Webinar] Getting Started with BigQuery: Basics, Its Appilcations & Use Cases[Webinar] Getting Started with BigQuery: Basics, Its Appilcations & Use Cases
[Webinar] Getting Started with BigQuery: Basics, Its Appilcations & Use CasesTatvic Analytics
 
Google analytics version 4 in details
Google analytics version 4 in detailsGoogle analytics version 4 in details
Google analytics version 4 in detailsOmkar Nandi
 
Snowplow at Sigfig
Snowplow at SigfigSnowplow at Sigfig
Snowplow at Sigfigyalisassoon
 
How Incuda builds user journey models with Snowplow
How Incuda builds user journey models with SnowplowHow Incuda builds user journey models with Snowplow
How Incuda builds user journey models with SnowplowGiuseppe Gaviani
 
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016yalisassoon
 
Google Analytics For Retailers - Getting Your House in Order
Google Analytics For Retailers - Getting Your House in OrderGoogle Analytics For Retailers - Getting Your House in Order
Google Analytics For Retailers - Getting Your House in OrderRod Jacka
 

What's hot (20)

MeasureCamp #10 - WTF are Related Products in Google Analytics Ecommerce?
MeasureCamp #10 - WTF are Related Products in Google Analytics Ecommerce?MeasureCamp #10 - WTF are Related Products in Google Analytics Ecommerce?
MeasureCamp #10 - WTF are Related Products in Google Analytics Ecommerce?
 
Yali presentation for snowplow amsterdam meetup number 2
Yali presentation for snowplow amsterdam meetup number 2Yali presentation for snowplow amsterdam meetup number 2
Yali presentation for snowplow amsterdam meetup number 2
 
Data Driven Attribution in BigQuery with Shapley Values and Markov Chains
Data Driven Attribution in BigQuery with Shapley Values and Markov ChainsData Driven Attribution in BigQuery with Shapley Values and Markov Chains
Data Driven Attribution in BigQuery with Shapley Values and Markov Chains
 
How to Avoid Sampling in Google Analytics
How to Avoid Sampling in Google AnalyticsHow to Avoid Sampling in Google Analytics
How to Avoid Sampling in Google Analytics
 
One Further - Spektrix and Google Analytics 4
One Further - Spektrix and Google Analytics 4One Further - Spektrix and Google Analytics 4
One Further - Spektrix and Google Analytics 4
 
Data driven video advertising campaigns - JustWatch & Snowplow
Data driven video advertising campaigns - JustWatch & SnowplowData driven video advertising campaigns - JustWatch & Snowplow
Data driven video advertising campaigns - JustWatch & Snowplow
 
Simply Business and Snowplow - Multichannel Attribution Analysis
Simply Business and Snowplow - Multichannel Attribution AnalysisSimply Business and Snowplow - Multichannel Attribution Analysis
Simply Business and Snowplow - Multichannel Attribution Analysis
 
What's new in Google Analytics 4
What's new in Google Analytics 4What's new in Google Analytics 4
What's new in Google Analytics 4
 
Jonathan Weber - All Things DATA 2017
Jonathan Weber - All Things DATA 2017Jonathan Weber - All Things DATA 2017
Jonathan Weber - All Things DATA 2017
 
BCCCA Google Analytics for Improved Lead Generation
BCCCA Google Analytics for Improved Lead GenerationBCCCA Google Analytics for Improved Lead Generation
BCCCA Google Analytics for Improved Lead Generation
 
Beyond the clouds with digital analytics
Beyond the clouds with digital analyticsBeyond the clouds with digital analytics
Beyond the clouds with digital analytics
 
The culture trip snowplow implementation
The culture trip snowplow implementationThe culture trip snowplow implementation
The culture trip snowplow implementation
 
Google Analytics with an Intro to Google Tag Manager for Austin WordPress Meetup
Google Analytics with an Intro to Google Tag Manager for Austin WordPress MeetupGoogle Analytics with an Intro to Google Tag Manager for Austin WordPress Meetup
Google Analytics with an Intro to Google Tag Manager for Austin WordPress Meetup
 
[Webinar] Getting Started with BigQuery: Basics, Its Appilcations & Use Cases
[Webinar] Getting Started with BigQuery: Basics, Its Appilcations & Use Cases[Webinar] Getting Started with BigQuery: Basics, Its Appilcations & Use Cases
[Webinar] Getting Started with BigQuery: Basics, Its Appilcations & Use Cases
 
Google analytics version 4 in details
Google analytics version 4 in detailsGoogle analytics version 4 in details
Google analytics version 4 in details
 
Snowplow at Sigfig
Snowplow at SigfigSnowplow at Sigfig
Snowplow at Sigfig
 
How Incuda builds user journey models with Snowplow
How Incuda builds user journey models with SnowplowHow Incuda builds user journey models with Snowplow
How Incuda builds user journey models with Snowplow
 
Adobe part 1
Adobe part 1Adobe part 1
Adobe part 1
 
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016
 
Google Analytics For Retailers - Getting Your House in Order
Google Analytics For Retailers - Getting Your House in OrderGoogle Analytics For Retailers - Getting Your House in Order
Google Analytics For Retailers - Getting Your House in Order
 

Similar to Google Analytics sampling limitations and how to overcome them

Usage Trend Reporting
Usage Trend Reporting Usage Trend Reporting
Usage Trend Reporting Gainsight
 
Google Analytics Training - full 2017
Google Analytics Training - full 2017Google Analytics Training - full 2017
Google Analytics Training - full 2017Nate Plaunt
 
10 Most Underused Features of Google Analytics 360 According to Experts
10 Most Underused Features of Google Analytics 360 According to Experts10 Most Underused Features of Google Analytics 360 According to Experts
10 Most Underused Features of Google Analytics 360 According to ExpertsTatvic Analytics
 
Geek Sync | Intro to Query Store
Geek Sync | Intro to Query StoreGeek Sync | Intro to Query Store
Geek Sync | Intro to Query StoreIDERA Software
 
Universal Analytics Out of Beta - BrightonSEO 2014
Universal Analytics Out of Beta - BrightonSEO 2014Universal Analytics Out of Beta - BrightonSEO 2014
Universal Analytics Out of Beta - BrightonSEO 2014darafitzgerald
 
Delivering fast, powerful and scalable analytics
Delivering fast, powerful and scalable analyticsDelivering fast, powerful and scalable analytics
Delivering fast, powerful and scalable analyticsMariaDB plc
 
Google Analytics for SEO Beginners
Google Analytics for SEO BeginnersGoogle Analytics for SEO Beginners
Google Analytics for SEO BeginnersAditya Todawal
 
Implementing google big query automation using google analytics data
Implementing google big query automation using google analytics dataImplementing google big query automation using google analytics data
Implementing google big query automation using google analytics dataCountants
 
Google Analytics - Webmaster Tools Pro Setup & Tips
Google Analytics -  Webmaster Tools Pro Setup & TipsGoogle Analytics -  Webmaster Tools Pro Setup & Tips
Google Analytics - Webmaster Tools Pro Setup & TipsRank Fuse Digital Marketing
 
Business analytics and data visualisation
Business analytics and data visualisationBusiness analytics and data visualisation
Business analytics and data visualisationShwetabh Jaiswal
 
Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...
Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...
Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...Rittman Analytics
 
Tech4Africa Google Workshop 1
Tech4Africa Google Workshop 1Tech4Africa Google Workshop 1
Tech4Africa Google Workshop 1Sarah Blake
 
Comparing google analytics vs adobe analytics vs ibm
Comparing google analytics vs adobe analytics vs ibmComparing google analytics vs adobe analytics vs ibm
Comparing google analytics vs adobe analytics vs ibmCountants
 
Google Cloud Platform as a Backend Solution for your Product
Google Cloud Platform as a Backend Solution for your ProductGoogle Cloud Platform as a Backend Solution for your Product
Google Cloud Platform as a Backend Solution for your ProductSergey Smetanin
 
Google_Analytics_PPT.pptx
Google_Analytics_PPT.pptxGoogle_Analytics_PPT.pptx
Google_Analytics_PPT.pptxitsmahek2001
 
Bdf16 big-data-warehouse-case-study-data kitchen
Bdf16 big-data-warehouse-case-study-data kitchenBdf16 big-data-warehouse-case-study-data kitchen
Bdf16 big-data-warehouse-case-study-data kitchenChristopher Bergh
 
web analysis document | web analysis document | web analysis document | web a...
web analysis document | web analysis document | web analysis document | web a...web analysis document | web analysis document | web analysis document | web a...
web analysis document | web analysis document | web analysis document | web a...nazen2
 
GAUC 2020 - presentatie Hans en Reinier
GAUC 2020 - presentatie Hans en ReinierGAUC 2020 - presentatie Hans en Reinier
GAUC 2020 - presentatie Hans en ReinierOnline Dialogue
 

Similar to Google Analytics sampling limitations and how to overcome them (20)

Usage Trend Reporting
Usage Trend Reporting Usage Trend Reporting
Usage Trend Reporting
 
Google Analytics Training - full 2017
Google Analytics Training - full 2017Google Analytics Training - full 2017
Google Analytics Training - full 2017
 
10 Most Underused Features of Google Analytics 360 According to Experts
10 Most Underused Features of Google Analytics 360 According to Experts10 Most Underused Features of Google Analytics 360 According to Experts
10 Most Underused Features of Google Analytics 360 According to Experts
 
Geek Sync | Intro to Query Store
Geek Sync | Intro to Query StoreGeek Sync | Intro to Query Store
Geek Sync | Intro to Query Store
 
Universal Analytics Out of Beta - BrightonSEO 2014
Universal Analytics Out of Beta - BrightonSEO 2014Universal Analytics Out of Beta - BrightonSEO 2014
Universal Analytics Out of Beta - BrightonSEO 2014
 
Delivering fast, powerful and scalable analytics
Delivering fast, powerful and scalable analyticsDelivering fast, powerful and scalable analytics
Delivering fast, powerful and scalable analytics
 
Google Analytics for SEO Beginners
Google Analytics for SEO BeginnersGoogle Analytics for SEO Beginners
Google Analytics for SEO Beginners
 
Implementing google big query automation using google analytics data
Implementing google big query automation using google analytics dataImplementing google big query automation using google analytics data
Implementing google big query automation using google analytics data
 
Google Analytics - Webmaster Tools Pro Setup & Tips
Google Analytics -  Webmaster Tools Pro Setup & TipsGoogle Analytics -  Webmaster Tools Pro Setup & Tips
Google Analytics - Webmaster Tools Pro Setup & Tips
 
Business analytics and data visualisation
Business analytics and data visualisationBusiness analytics and data visualisation
Business analytics and data visualisation
 
Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...
Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...
Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...
 
Tech4Africa Google Workshop 1
Tech4Africa Google Workshop 1Tech4Africa Google Workshop 1
Tech4Africa Google Workshop 1
 
Comparing google analytics vs adobe analytics vs ibm
Comparing google analytics vs adobe analytics vs ibmComparing google analytics vs adobe analytics vs ibm
Comparing google analytics vs adobe analytics vs ibm
 
Google Cloud Platform as a Backend Solution for your Product
Google Cloud Platform as a Backend Solution for your ProductGoogle Cloud Platform as a Backend Solution for your Product
Google Cloud Platform as a Backend Solution for your Product
 
WGP Universal Analytics Breakfast Seminar
WGP Universal Analytics Breakfast SeminarWGP Universal Analytics Breakfast Seminar
WGP Universal Analytics Breakfast Seminar
 
Google_Analytics_PPT.pptx
Google_Analytics_PPT.pptxGoogle_Analytics_PPT.pptx
Google_Analytics_PPT.pptx
 
Bdf16 big-data-warehouse-case-study-data kitchen
Bdf16 big-data-warehouse-case-study-data kitchenBdf16 big-data-warehouse-case-study-data kitchen
Bdf16 big-data-warehouse-case-study-data kitchen
 
Machine Data Analytics
Machine Data AnalyticsMachine Data Analytics
Machine Data Analytics
 
web analysis document | web analysis document | web analysis document | web a...
web analysis document | web analysis document | web analysis document | web a...web analysis document | web analysis document | web analysis document | web a...
web analysis document | web analysis document | web analysis document | web a...
 
GAUC 2020 - presentatie Hans en Reinier
GAUC 2020 - presentatie Hans en ReinierGAUC 2020 - presentatie Hans en Reinier
GAUC 2020 - presentatie Hans en Reinier
 

Recently uploaded

The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 

Recently uploaded (20)

The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 

Google Analytics sampling limitations and how to overcome them

  • 1. Google Analytics sampling limitations and how to overcome them George  Papadongonas Web  Analyst,  Amazee  Metrics 16/7/2013
  • 2. How Google Analytics stores data • All unfiltered data of a web property (up to 10 million hits per month) are stored in the Google Analytics database • Each standard report has an associated data table in the Google Analytics database with unsampled data • Reports for accounts with more than 200,000 visits per day are processed daily • Reports for accounts with less than 200,000 visits per day are processed more often 2
  • 3. Google Analytics Report Sampling • Sampling starts when the requested date range has more than 500,000 visits • The sample size can be arranged by using the Google Analytics sampling slider. The default setting is for 250,000 visits, maximum setting is 500,000 • Visits are counted for the specific date range on a Web Property level, not on a Profile level • Standard reports without filters, advanced segments or secondary dimensions use always unsampled data • Sampling applies to custom reports 3
  • 4. Organic Search traffic report is unsampled 4
  • 5. By adding a second dimension, sampling is applied 5
  • 9. How sampling is calculated • Web property has 24,580,303 visits • Profile has 492,786 visits • Default sampling is: 250,000x492,786/24,580,303 = 5,012 visits (1%) • Maximum sampling is: 500,000x492,786/24,580,303 = 10,024 visits (2%) 9
  • 12. Pageviews or events reports can be unreliable 12
  • 13. Google Analytics Report Sampling • Visits and Visitors reports are usually reliable, even with a small sample • E-commerce transactions, individual pageviews, adwords data, revenue and goal conversions are less reliable 13
  • 15. 1. Buy Google Analytics Premium • “Only”$150.000 / year • 1 Billion hits processed per month • Unsampled reports • Data processing every 4 hours 15
  • 16. 2. Create custom profiles • Instead of creating reports with specific advanced segments, create custom profiles using filters • The default reports of all profiles are always unsampled, even if the visitors are more than 500,000 16
  • 17. 3. Enable Data Sampling • Sample your data , by adding a line in the Google Analytics tracking code code_gaq.push(['_setSampleRate', '80']); Sets sampling rate at 80% • Not a perfect solution, as the data are still sampled, but you have control and can avoid tracking interruption (for more than 10 million hits per month) 17
  • 18. 4. Use smaller date range • Break you report in smaller data ranges, each one having less than 500.000 visits • This ensures that the data are unsampled • Export the reports using the Google Analytics API • Aggregate the data in Excel and create the master report 18
  • 19. 5. Use analyticscanvas.com • Analytics Canvas offers query partitioning, using the Google Analytics API. • Reports are exported in smaller date ranges, so that they are unsampled and they are then merged automatically with Analytics Canvas. 19
  • 20. 6. Download Google Analytics data locally • It is possible to keep a local copy of Google Analytics data • Add a line in the Google Analytics tracking code _gaq.push(['_setLocalRemoteServerMode']); • Add _utm.gif to your web server root 20
  • 21. 6. Download Google Analytics data locally 86.138.209.96 www.mysite.com - [01/Oct/2007:03:34:02 +0100] "GET /__utm.gif?utmwv=1&utmt=var&utmn= 2108116629 HTTP/1.1" 200 35 "http://www.mysite.com/pageX.htm" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; .NET CLR 1.1.4322)" "__utma=1.117971038.1175394730.1175394730.1175394730.1; __utmb=1; __utmc=1; __utmz=1.1175394730.1.1.utmcid=23|utmgclid=CP-Bssq-oIsCFQMrlAodeUThgA| utmccn=(not+set)|utmcmd=(not+set)|utmctr=looking+for+site; __utmv=1.Section One" 21 • Data are recorded in the server log files • Use http://analytics.angelfishstats.com/ to analyze them, as Urchin is discontiniued