SlideShare a Scribd company logo
1 of 43
Download to read offline
javier ramirez
@supercoco9
Get more from
Analytics with
Google BigQuery
datawaki
about me
19 years working on software: banking, e-commerce,
government, CMS, start-ups...
founder of
https://datawaki.com
https://teowaki.com
Google Developer Expert on
the Cloud Platform
mail: hello@datawaki.com twitter: @supercoco9
datawaki
BigQuery
is awes..
I use Google
Analytics
javier ramirez @supercoco9 https://datawaki.com
Isn't Google Analytics
good enough?
javier ramirez @supercoco9 https://datawaki.com
Google Analytics is great but...
It lets you access aggregated data and reports, not
individual sessions/visits data.
Even premium accounts get sampled reports when
there are too many data (and not all the reports can
be unsampled).
javier ramirez @supercoco9 https://datawaki.com
Google Analytics is great but...
If you need to manage many different segments, and
if you want to combine segments, it can get tricky.
Moreover, you can only segment or create reports
using the pre-defined filters, which might or not be
enough for you*.
*even if segments have experienced a huge
improvement with Universal Analytics
javier ramirez @supercoco9 https://datawaki.com
Google Analytics is great but...
It's not easy to cross data in Analytics with data from
other sources (CRM, invoicing system...)
Now you can use Import Data from Universal
Analytics, but there are many constraints to what
you can do
javier ramirez @supercoco9 https://datawaki.com
Google Analytics is great but...
Good for knowing what's happening in your
application, but difficult for:
* business intelligence/big data (data mining,
find patterns...)
* machine learning (classify information,
predict future trends...)
javier ramirez @supercoco9 https://datawaki.com
big data -analysed and organised
into information- has big value
BIG
DATA
HIPSTER
avier ramirez @supercoco9 https://datawaki.com
data that exceeds the
processing capacity of
conventional database
systems. The data is too big,
moves too fast, or doesn’t fit
the structures of your
database architectures.
Ed Dumbill
program chair for the O’Reilly Strata Conference
javier ramirez @supercoco9 https://datawaki.com
bigdata is cool but...
expensive cluster
hard to set up and monitor
not interactive enough
What if I could...
..query billions of rows in seconds..
..using a SQL-like web interface..
..on a fully managed cloud..
..paying only when I use it?
javier ramirez @supercoco9 https://datawaki.com
Designed to run analytics
over huge volumes of raw
data, and to integrate
with other data sources
javier ramirez @supercoco9 https://datawaki.com
Google BigQuery
Google BigQuery
Data analysis as a service
https://cloud.google.com/products/bigquery/
javier ramirez @supercoco9 https://datawaki.com
one
more
thing
Google Analytics Premium
users get free daily
exports from GA to
BigQuery.
javier ramirez @supercoco9 https://datawaki.com
Google BigQuery + GA Premium
All your raw data.
Unsampled.
Use it however you want.
BOOM!
javier ramirez @supercoco9 https://datawaki.com
Google BigQuery + GA Premium
o'reilly
khan academy
US Cellular Case Study
5th
largest US telecommunications company
over 10.6 million customers
They didn't know how many offline (in-store +
telesales) sales were originated by online media.
After combining GA data with other internal data,
they can more accurately attribute sales to
digital channel (website, social, search and display)
It helps them optimize their campaign and forecast
sales
data
schema
javier ramirez @supercoco9
it's just SQL
javier ramirez @supercoco9 https://datawaki.com
SELECT trafficSource.source, SUM( totals.transactions ) AS total_transactions
FROM playground.ga_sessions_20140621
GROUP BY trafficSource.source
ORDER BY total_transactions;
basic queries (metric/dimension)
SELECT device.isMobile, SUM ( totals.pageviews ) AS total_pageviews
FROM playground.ga_sessions_20140621
GROUP BY device.isMobile
ORDER BY total_pageviews;
SELECT
IF(DOMAIN(trafficSource.source) is null,
trafficSource.source,
DOMAIN(trafficSource.source))
AS normalized_source,
SUM ( totals.transactions ) AS total_transactions
FROM playground.ga_sessions_20140621
GROUP BY normalized_source
ORDER BY total_transactions;
basic queries
with a twist
SELECT ( SUM(total_transactionrevenue_per_user) / SUM(total_visits_per_user) )
AS avg_revenue_by_user_per_visit
FROM (
SELECT SUM(totals.visits) AS total_visits_per_user,
SUM( totals.transactionRevenue ) AS total_transactionrevenue_per_user,
visitorId
FROM playground.ga_sessions_20140621
WHERE totals.visits>0
AND totals.transactions>=1
AND totals.transactionRevenue IS NOT NULL
GROUP BY visitorId ) ;
Average amount spent per visit
2 segments, combined
SELECT hits.item.productName AS other_purchased_products,
COUNT(hits.item.productName) AS quantity
FROM playground.ga_sessions_20140621
WHERE fullVisitorId IN (
SELECT fullVisitorId
FROM playground.ga_sessions_20140621
WHERE hits.item.productName CONTAINS 'Light Helmet'
AND totals.transactions>=1
GROUP BY fullVisitorId )
AND hits.item.productName IS NOT NULL
AND hits.item.productName !='Light Helmet'
GROUP BY other_purchased_products
ORDER BY quantity DESC;
Users who bought product A,
also bought product B
SELECT prod_name, count(*) as transactions
FROM
(
SELECT fullVisitorId, min(date) AS date, visitId,
hits.item.productName as prod_name
FROM (
SELECT fullVisitorId, date, visitId,
totals.transactions,
hits.item.productName FROM
(TABLE_DATE_RANGE([dataset.ga_sessions_],
TIMESTAMP('2014-06-01'),
TIMESTAMP('2014-06-14')))
)
WHERE fullVisitorId IN
(
SELECT fullVisitorId
FROM (TABLE_DATE_RANGE([dataset.ga_sessions_],
TIMESTAMP('2014-06-01'),
TIMESTAMP('2014-06-14')))
GROUP BY fullVisitorId
HAVING SUM(totals.transactions) > 1
)
AND hits.item.productName IS NOT NULL
GROUP BY fullVisitorId, visitId, prod_name ORDER BY
fullVisitorId DESC
)
GROUP BY prod_name ORDER BY transactions DESC;
* example query from the lunametrics blog. Check them out for more awesomeness
Products that
are purchased
and lead to
other products
being purchased
SELECT fullvisitorID, visitID, visitNumber, hits.page.pagePath
FROM playground.ga_sessions_20140621
where hits.type='PAGE'
order by fullvisitorID, visitID, hits.hitnumber asc
Identify user path/user actions
individual users data is awesome
Cross CRM data with individual users actions to see
how your response to incidents affect your users.
Use the “frequently bought together” query and find
users who didn't buy the related products. Send an
e-mail campaign with an offer for those products.
integrating with external
data sources
* Connectors/REST API
* Export into GCS
* Import into BigQuery
javier ramirez @supercoco9 https://datawaki.com
Working with external data
BigQuery pricing
$20 per stored TB
$5 per processed TB
*the 1st
TB every month is free of charge
** GA premium get $500 free credit monthly
javier ramirez @supercoco9 https://datawaki.com
for GA premium users
BigQuery is effectively
for free
*unless you upload huge external data or make
huge queries
javier ramirez @supercoco9 https://datawaki.com
What if I don't have
a GA Premium
Account?
just send your own data
javier ramirez @supercoco9 https://datawaki.com
define a data structure that fits your needs
(or replicate the one GA provides)
use a JS snippet to send data to your server, then
to BigQuery
or use
javier ramirez @supercoco9 https://datawaki.com
Just add an extra snippet to your GA
datawaki
javier ramirez @supercoco9 https://datawaki.com
send data from any other source (CRM, back-end,
sensors, mobile apps, log system, external tools...)
datawaki
Postgre
SQL
Log
NGINX
Log
NGINX
Logstash
Redis
BigQuery
Neo4j
Ruby
Worker
Ruby
Worker
Rails
App
Alert
system
datawaki in a nutshell
Report
system
user
interaction
data
input
javier ramirez @supercoco9 https://datawaki.com
* Get full access to your data
* Receive reports by e-mail
* Get individual or group alerts
* if there is a purchase over $1000
* if a user has visited a product page over
20 times in one week and didn't buy
* if a product is seen over 200 times one hour
* every time a product reaches 5000 views
datawaki
Want to know more?
https://cloud.google.com/products/bigquery/
Need help?
https://teowaki.com/services
Thanks!
Gracias
Javier Ramírez
@supercoco9
datawaki

More Related Content

Similar to Get more from Analytics with Google BigQuery - Javier Ramirez - Datawaki- BBVACI

Big query the first step - (MOSG)
Big query the first step - (MOSG)Big query the first step - (MOSG)
Big query the first step - (MOSG)Soshi Nemoto
 
Supercharge your data analytics with BigQuery
Supercharge your data analytics with BigQuerySupercharge your data analytics with BigQuery
Supercharge your data analytics with BigQueryMárton Kodok
 
Google Analytics with an Intro to Google Tag Manager for Austin WordPress Meetup
Google Analytics with an Intro to Google Tag Manager for Austin WordPress MeetupGoogle Analytics with an Intro to Google Tag Manager for Austin WordPress Meetup
Google Analytics with an Intro to Google Tag Manager for Austin WordPress MeetupRich Plakas
 
Top 10 Tips for Google Tag Manager
Top 10 Tips for Google Tag ManagerTop 10 Tips for Google Tag Manager
Top 10 Tips for Google Tag ManagerAnna Lewis
 
Why Big Query is so Powerful - Trusted Conf
Why Big Query is so Powerful - Trusted ConfWhy Big Query is so Powerful - Trusted Conf
Why Big Query is so Powerful - Trusted ConfIn Marketing We Trust
 
Structured Data & Schema.org - SMX Milan 2014
Structured Data & Schema.org - SMX Milan 2014Structured Data & Schema.org - SMX Milan 2014
Structured Data & Schema.org - SMX Milan 2014Bastian Grimm
 
Usando metodologías ágiles en UX
Usando metodologías ágiles en UX Usando metodologías ágiles en UX
Usando metodologías ágiles en UX Paradigma Digital
 
SEO Tactics for Startups - Making Your Funnel Work program - Startupbootcamp
SEO Tactics for Startups - Making Your Funnel Work program - StartupbootcampSEO Tactics for Startups - Making Your Funnel Work program - Startupbootcamp
SEO Tactics for Startups - Making Your Funnel Work program - StartupbootcampJoost Hoogstrate
 
Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...
Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...
Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...Rittman Analytics
 
Google Analytics for Beginners - Training
Google Analytics for Beginners - TrainingGoogle Analytics for Beginners - Training
Google Analytics for Beginners - TrainingRuben Vezzoli
 
Florian Pertynski session at Google Partner Summit Review
Florian Pertynski session at Google Partner Summit Review Florian Pertynski session at Google Partner Summit Review
Florian Pertynski session at Google Partner Summit Review IIHEvents
 
How to design and implement a data ops architecture with sdc and gcp
How to design and implement a data ops architecture with sdc and gcpHow to design and implement a data ops architecture with sdc and gcp
How to design and implement a data ops architecture with sdc and gcpJoseph Arriola
 
Google Analytics for Beginners.pdf
Google Analytics for Beginners.pdfGoogle Analytics for Beginners.pdf
Google Analytics for Beginners.pdfAnaGabunia3
 
Ga premium bigquery-integration
Ga premium bigquery-integrationGa premium bigquery-integration
Ga premium bigquery-integrationStefan Xhunga
 
Ai based analytics in the cloud
Ai based analytics in the cloudAi based analytics in the cloud
Ai based analytics in the cloudSvetlin Stanchev
 
Gits class #22: [ONLINE] Analyze Your User's Activities Using BigQuery and Da...
Gits class #22: [ONLINE] Analyze Your User's Activities Using BigQuery and Da...Gits class #22: [ONLINE] Analyze Your User's Activities Using BigQuery and Da...
Gits class #22: [ONLINE] Analyze Your User's Activities Using BigQuery and Da...GITS Indonesia
 
What's in my SEO Toolbox: Linkbuilding Edition - SMX Milan 2014
What's in my SEO Toolbox: Linkbuilding Edition - SMX Milan 2014What's in my SEO Toolbox: Linkbuilding Edition - SMX Milan 2014
What's in my SEO Toolbox: Linkbuilding Edition - SMX Milan 2014Bastian Grimm
 
Turbocharging Google Analytics
Turbocharging Google AnalyticsTurbocharging Google Analytics
Turbocharging Google AnalyticsDana DiTomaso
 
Log analysis and pro use cases for search marketers online version (1)
Log analysis and pro use cases for search marketers online version (1)Log analysis and pro use cases for search marketers online version (1)
Log analysis and pro use cases for search marketers online version (1)David Sottimano
 

Similar to Get more from Analytics with Google BigQuery - Javier Ramirez - Datawaki- BBVACI (20)

Big query the first step - (MOSG)
Big query the first step - (MOSG)Big query the first step - (MOSG)
Big query the first step - (MOSG)
 
Supercharge your data analytics with BigQuery
Supercharge your data analytics with BigQuerySupercharge your data analytics with BigQuery
Supercharge your data analytics with BigQuery
 
Google Analytics with an Intro to Google Tag Manager for Austin WordPress Meetup
Google Analytics with an Intro to Google Tag Manager for Austin WordPress MeetupGoogle Analytics with an Intro to Google Tag Manager for Austin WordPress Meetup
Google Analytics with an Intro to Google Tag Manager for Austin WordPress Meetup
 
Top 10 Tips for Google Tag Manager
Top 10 Tips for Google Tag ManagerTop 10 Tips for Google Tag Manager
Top 10 Tips for Google Tag Manager
 
Why Big Query is so Powerful - Trusted Conf
Why Big Query is so Powerful - Trusted ConfWhy Big Query is so Powerful - Trusted Conf
Why Big Query is so Powerful - Trusted Conf
 
Structured Data & Schema.org - SMX Milan 2014
Structured Data & Schema.org - SMX Milan 2014Structured Data & Schema.org - SMX Milan 2014
Structured Data & Schema.org - SMX Milan 2014
 
Big query
Big queryBig query
Big query
 
Usando metodologías ágiles en UX
Usando metodologías ágiles en UX Usando metodologías ágiles en UX
Usando metodologías ágiles en UX
 
SEO Tactics for Startups - Making Your Funnel Work program - Startupbootcamp
SEO Tactics for Startups - Making Your Funnel Work program - StartupbootcampSEO Tactics for Startups - Making Your Funnel Work program - Startupbootcamp
SEO Tactics for Startups - Making Your Funnel Work program - Startupbootcamp
 
Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...
Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...
Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...
 
Google Analytics for Beginners - Training
Google Analytics for Beginners - TrainingGoogle Analytics for Beginners - Training
Google Analytics for Beginners - Training
 
Florian Pertynski session at Google Partner Summit Review
Florian Pertynski session at Google Partner Summit Review Florian Pertynski session at Google Partner Summit Review
Florian Pertynski session at Google Partner Summit Review
 
How to design and implement a data ops architecture with sdc and gcp
How to design and implement a data ops architecture with sdc and gcpHow to design and implement a data ops architecture with sdc and gcp
How to design and implement a data ops architecture with sdc and gcp
 
Google Analytics for Beginners.pdf
Google Analytics for Beginners.pdfGoogle Analytics for Beginners.pdf
Google Analytics for Beginners.pdf
 
Ga premium bigquery-integration
Ga premium bigquery-integrationGa premium bigquery-integration
Ga premium bigquery-integration
 
Ai based analytics in the cloud
Ai based analytics in the cloudAi based analytics in the cloud
Ai based analytics in the cloud
 
Gits class #22: [ONLINE] Analyze Your User's Activities Using BigQuery and Da...
Gits class #22: [ONLINE] Analyze Your User's Activities Using BigQuery and Da...Gits class #22: [ONLINE] Analyze Your User's Activities Using BigQuery and Da...
Gits class #22: [ONLINE] Analyze Your User's Activities Using BigQuery and Da...
 
What's in my SEO Toolbox: Linkbuilding Edition - SMX Milan 2014
What's in my SEO Toolbox: Linkbuilding Edition - SMX Milan 2014What's in my SEO Toolbox: Linkbuilding Edition - SMX Milan 2014
What's in my SEO Toolbox: Linkbuilding Edition - SMX Milan 2014
 
Turbocharging Google Analytics
Turbocharging Google AnalyticsTurbocharging Google Analytics
Turbocharging Google Analytics
 
Log analysis and pro use cases for search marketers online version (1)
Log analysis and pro use cases for search marketers online version (1)Log analysis and pro use cases for search marketers online version (1)
Log analysis and pro use cases for search marketers online version (1)
 

More from javier ramirez

¿Se puede vivir del open source? T3chfest
¿Se puede vivir del open source? T3chfest¿Se puede vivir del open source? T3chfest
¿Se puede vivir del open source? T3chfestjavier ramirez
 
QuestDB: The building blocks of a fast open-source time-series database
QuestDB: The building blocks of a fast open-source time-series databaseQuestDB: The building blocks of a fast open-source time-series database
QuestDB: The building blocks of a fast open-source time-series databasejavier ramirez
 
Como creamos QuestDB Cloud, un SaaS basado en Kubernetes alrededor de QuestDB...
Como creamos QuestDB Cloud, un SaaS basado en Kubernetes alrededor de QuestDB...Como creamos QuestDB Cloud, un SaaS basado en Kubernetes alrededor de QuestDB...
Como creamos QuestDB Cloud, un SaaS basado en Kubernetes alrededor de QuestDB...javier ramirez
 
Ingesting Over Four Million Rows Per Second With QuestDB Timeseries Database ...
Ingesting Over Four Million Rows Per Second With QuestDB Timeseries Database ...Ingesting Over Four Million Rows Per Second With QuestDB Timeseries Database ...
Ingesting Over Four Million Rows Per Second With QuestDB Timeseries Database ...javier ramirez
 
Deduplicating and analysing time-series data with Apache Beam and QuestDB
Deduplicating and analysing time-series data with Apache Beam and QuestDBDeduplicating and analysing time-series data with Apache Beam and QuestDB
Deduplicating and analysing time-series data with Apache Beam and QuestDBjavier ramirez
 
Your Database Cannot Do this (well)
Your Database Cannot Do this (well)Your Database Cannot Do this (well)
Your Database Cannot Do this (well)javier ramirez
 
Your Timestamps Deserve Better than a Generic Database
Your Timestamps Deserve Better than a Generic DatabaseYour Timestamps Deserve Better than a Generic Database
Your Timestamps Deserve Better than a Generic Databasejavier ramirez
 
Cómo se diseña una base de datos que pueda ingerir más de cuatro millones de ...
Cómo se diseña una base de datos que pueda ingerir más de cuatro millones de ...Cómo se diseña una base de datos que pueda ingerir más de cuatro millones de ...
Cómo se diseña una base de datos que pueda ingerir más de cuatro millones de ...javier ramirez
 
QuestDB-Community-Call-20220728
QuestDB-Community-Call-20220728QuestDB-Community-Call-20220728
QuestDB-Community-Call-20220728javier ramirez
 
Processing and analysing streaming data with Python. Pycon Italy 2022
Processing and analysing streaming  data with Python. Pycon Italy 2022Processing and analysing streaming  data with Python. Pycon Italy 2022
Processing and analysing streaming data with Python. Pycon Italy 2022javier ramirez
 
QuestDB: ingesting a million time series per second on a single instance. Big...
QuestDB: ingesting a million time series per second on a single instance. Big...QuestDB: ingesting a million time series per second on a single instance. Big...
QuestDB: ingesting a million time series per second on a single instance. Big...javier ramirez
 
Servicios e infraestructura de AWS y la próxima región en Aragón
Servicios e infraestructura de AWS y la próxima región en AragónServicios e infraestructura de AWS y la próxima región en Aragón
Servicios e infraestructura de AWS y la próxima región en Aragónjavier ramirez
 
Primeros pasos en desarrollo serverless
Primeros pasos en desarrollo serverlessPrimeros pasos en desarrollo serverless
Primeros pasos en desarrollo serverlessjavier ramirez
 
How AWS is reinventing the cloud
How AWS is reinventing the cloudHow AWS is reinventing the cloud
How AWS is reinventing the cloudjavier ramirez
 
Analitica de datos en tiempo real con Apache Flink y Apache BEAM
Analitica de datos en tiempo real con Apache Flink y Apache BEAMAnalitica de datos en tiempo real con Apache Flink y Apache BEAM
Analitica de datos en tiempo real con Apache Flink y Apache BEAMjavier ramirez
 
Getting started with streaming analytics
Getting started with streaming analyticsGetting started with streaming analytics
Getting started with streaming analyticsjavier ramirez
 
Getting started with streaming analytics: Setting up a pipeline
Getting started with streaming analytics: Setting up a pipelineGetting started with streaming analytics: Setting up a pipeline
Getting started with streaming analytics: Setting up a pipelinejavier ramirez
 
Getting started with streaming analytics: Deep Dive
Getting started with streaming analytics: Deep DiveGetting started with streaming analytics: Deep Dive
Getting started with streaming analytics: Deep Divejavier ramirez
 
Getting started with streaming analytics: streaming basics (1 of 3)
Getting started with streaming analytics: streaming basics (1 of 3)Getting started with streaming analytics: streaming basics (1 of 3)
Getting started with streaming analytics: streaming basics (1 of 3)javier ramirez
 
Monitorización de seguridad y detección de amenazas con AWS
Monitorización de seguridad y detección de amenazas con AWSMonitorización de seguridad y detección de amenazas con AWS
Monitorización de seguridad y detección de amenazas con AWSjavier ramirez
 

More from javier ramirez (20)

¿Se puede vivir del open source? T3chfest
¿Se puede vivir del open source? T3chfest¿Se puede vivir del open source? T3chfest
¿Se puede vivir del open source? T3chfest
 
QuestDB: The building blocks of a fast open-source time-series database
QuestDB: The building blocks of a fast open-source time-series databaseQuestDB: The building blocks of a fast open-source time-series database
QuestDB: The building blocks of a fast open-source time-series database
 
Como creamos QuestDB Cloud, un SaaS basado en Kubernetes alrededor de QuestDB...
Como creamos QuestDB Cloud, un SaaS basado en Kubernetes alrededor de QuestDB...Como creamos QuestDB Cloud, un SaaS basado en Kubernetes alrededor de QuestDB...
Como creamos QuestDB Cloud, un SaaS basado en Kubernetes alrededor de QuestDB...
 
Ingesting Over Four Million Rows Per Second With QuestDB Timeseries Database ...
Ingesting Over Four Million Rows Per Second With QuestDB Timeseries Database ...Ingesting Over Four Million Rows Per Second With QuestDB Timeseries Database ...
Ingesting Over Four Million Rows Per Second With QuestDB Timeseries Database ...
 
Deduplicating and analysing time-series data with Apache Beam and QuestDB
Deduplicating and analysing time-series data with Apache Beam and QuestDBDeduplicating and analysing time-series data with Apache Beam and QuestDB
Deduplicating and analysing time-series data with Apache Beam and QuestDB
 
Your Database Cannot Do this (well)
Your Database Cannot Do this (well)Your Database Cannot Do this (well)
Your Database Cannot Do this (well)
 
Your Timestamps Deserve Better than a Generic Database
Your Timestamps Deserve Better than a Generic DatabaseYour Timestamps Deserve Better than a Generic Database
Your Timestamps Deserve Better than a Generic Database
 
Cómo se diseña una base de datos que pueda ingerir más de cuatro millones de ...
Cómo se diseña una base de datos que pueda ingerir más de cuatro millones de ...Cómo se diseña una base de datos que pueda ingerir más de cuatro millones de ...
Cómo se diseña una base de datos que pueda ingerir más de cuatro millones de ...
 
QuestDB-Community-Call-20220728
QuestDB-Community-Call-20220728QuestDB-Community-Call-20220728
QuestDB-Community-Call-20220728
 
Processing and analysing streaming data with Python. Pycon Italy 2022
Processing and analysing streaming  data with Python. Pycon Italy 2022Processing and analysing streaming  data with Python. Pycon Italy 2022
Processing and analysing streaming data with Python. Pycon Italy 2022
 
QuestDB: ingesting a million time series per second on a single instance. Big...
QuestDB: ingesting a million time series per second on a single instance. Big...QuestDB: ingesting a million time series per second on a single instance. Big...
QuestDB: ingesting a million time series per second on a single instance. Big...
 
Servicios e infraestructura de AWS y la próxima región en Aragón
Servicios e infraestructura de AWS y la próxima región en AragónServicios e infraestructura de AWS y la próxima región en Aragón
Servicios e infraestructura de AWS y la próxima región en Aragón
 
Primeros pasos en desarrollo serverless
Primeros pasos en desarrollo serverlessPrimeros pasos en desarrollo serverless
Primeros pasos en desarrollo serverless
 
How AWS is reinventing the cloud
How AWS is reinventing the cloudHow AWS is reinventing the cloud
How AWS is reinventing the cloud
 
Analitica de datos en tiempo real con Apache Flink y Apache BEAM
Analitica de datos en tiempo real con Apache Flink y Apache BEAMAnalitica de datos en tiempo real con Apache Flink y Apache BEAM
Analitica de datos en tiempo real con Apache Flink y Apache BEAM
 
Getting started with streaming analytics
Getting started with streaming analyticsGetting started with streaming analytics
Getting started with streaming analytics
 
Getting started with streaming analytics: Setting up a pipeline
Getting started with streaming analytics: Setting up a pipelineGetting started with streaming analytics: Setting up a pipeline
Getting started with streaming analytics: Setting up a pipeline
 
Getting started with streaming analytics: Deep Dive
Getting started with streaming analytics: Deep DiveGetting started with streaming analytics: Deep Dive
Getting started with streaming analytics: Deep Dive
 
Getting started with streaming analytics: streaming basics (1 of 3)
Getting started with streaming analytics: streaming basics (1 of 3)Getting started with streaming analytics: streaming basics (1 of 3)
Getting started with streaming analytics: streaming basics (1 of 3)
 
Monitorización de seguridad y detección de amenazas con AWS
Monitorización de seguridad y detección de amenazas con AWSMonitorización de seguridad y detección de amenazas con AWS
Monitorización de seguridad y detección de amenazas con AWS
 

Recently uploaded

Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 

Recently uploaded (20)

Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 

Get more from Analytics with Google BigQuery - Javier Ramirez - Datawaki- BBVACI

  • 1. javier ramirez @supercoco9 Get more from Analytics with Google BigQuery datawaki
  • 2. about me 19 years working on software: banking, e-commerce, government, CMS, start-ups... founder of https://datawaki.com https://teowaki.com Google Developer Expert on the Cloud Platform mail: hello@datawaki.com twitter: @supercoco9 datawaki
  • 3. BigQuery is awes.. I use Google Analytics
  • 4. javier ramirez @supercoco9 https://datawaki.com Isn't Google Analytics good enough?
  • 5. javier ramirez @supercoco9 https://datawaki.com Google Analytics is great but... It lets you access aggregated data and reports, not individual sessions/visits data. Even premium accounts get sampled reports when there are too many data (and not all the reports can be unsampled).
  • 6. javier ramirez @supercoco9 https://datawaki.com Google Analytics is great but... If you need to manage many different segments, and if you want to combine segments, it can get tricky. Moreover, you can only segment or create reports using the pre-defined filters, which might or not be enough for you*. *even if segments have experienced a huge improvement with Universal Analytics
  • 7. javier ramirez @supercoco9 https://datawaki.com Google Analytics is great but... It's not easy to cross data in Analytics with data from other sources (CRM, invoicing system...) Now you can use Import Data from Universal Analytics, but there are many constraints to what you can do
  • 8. javier ramirez @supercoco9 https://datawaki.com Google Analytics is great but... Good for knowing what's happening in your application, but difficult for: * business intelligence/big data (data mining, find patterns...) * machine learning (classify information, predict future trends...)
  • 9. javier ramirez @supercoco9 https://datawaki.com big data -analysed and organised into information- has big value
  • 11. data that exceeds the processing capacity of conventional database systems. The data is too big, moves too fast, or doesn’t fit the structures of your database architectures. Ed Dumbill program chair for the O’Reilly Strata Conference javier ramirez @supercoco9 https://datawaki.com
  • 12. bigdata is cool but... expensive cluster hard to set up and monitor not interactive enough
  • 13. What if I could... ..query billions of rows in seconds.. ..using a SQL-like web interface.. ..on a fully managed cloud.. ..paying only when I use it? javier ramirez @supercoco9 https://datawaki.com
  • 14. Designed to run analytics over huge volumes of raw data, and to integrate with other data sources javier ramirez @supercoco9 https://datawaki.com Google BigQuery
  • 15. Google BigQuery Data analysis as a service https://cloud.google.com/products/bigquery/ javier ramirez @supercoco9 https://datawaki.com
  • 17. Google Analytics Premium users get free daily exports from GA to BigQuery. javier ramirez @supercoco9 https://datawaki.com Google BigQuery + GA Premium
  • 18. All your raw data. Unsampled. Use it however you want. BOOM! javier ramirez @supercoco9 https://datawaki.com Google BigQuery + GA Premium
  • 19.
  • 22. US Cellular Case Study 5th largest US telecommunications company over 10.6 million customers They didn't know how many offline (in-store + telesales) sales were originated by online media. After combining GA data with other internal data, they can more accurately attribute sales to digital channel (website, social, search and display) It helps them optimize their campaign and forecast sales
  • 24. it's just SQL javier ramirez @supercoco9 https://datawaki.com
  • 25. SELECT trafficSource.source, SUM( totals.transactions ) AS total_transactions FROM playground.ga_sessions_20140621 GROUP BY trafficSource.source ORDER BY total_transactions; basic queries (metric/dimension) SELECT device.isMobile, SUM ( totals.pageviews ) AS total_pageviews FROM playground.ga_sessions_20140621 GROUP BY device.isMobile ORDER BY total_pageviews;
  • 26. SELECT IF(DOMAIN(trafficSource.source) is null, trafficSource.source, DOMAIN(trafficSource.source)) AS normalized_source, SUM ( totals.transactions ) AS total_transactions FROM playground.ga_sessions_20140621 GROUP BY normalized_source ORDER BY total_transactions; basic queries with a twist
  • 27. SELECT ( SUM(total_transactionrevenue_per_user) / SUM(total_visits_per_user) ) AS avg_revenue_by_user_per_visit FROM ( SELECT SUM(totals.visits) AS total_visits_per_user, SUM( totals.transactionRevenue ) AS total_transactionrevenue_per_user, visitorId FROM playground.ga_sessions_20140621 WHERE totals.visits>0 AND totals.transactions>=1 AND totals.transactionRevenue IS NOT NULL GROUP BY visitorId ) ; Average amount spent per visit
  • 29. SELECT hits.item.productName AS other_purchased_products, COUNT(hits.item.productName) AS quantity FROM playground.ga_sessions_20140621 WHERE fullVisitorId IN ( SELECT fullVisitorId FROM playground.ga_sessions_20140621 WHERE hits.item.productName CONTAINS 'Light Helmet' AND totals.transactions>=1 GROUP BY fullVisitorId ) AND hits.item.productName IS NOT NULL AND hits.item.productName !='Light Helmet' GROUP BY other_purchased_products ORDER BY quantity DESC; Users who bought product A, also bought product B
  • 30. SELECT prod_name, count(*) as transactions FROM ( SELECT fullVisitorId, min(date) AS date, visitId, hits.item.productName as prod_name FROM ( SELECT fullVisitorId, date, visitId, totals.transactions, hits.item.productName FROM (TABLE_DATE_RANGE([dataset.ga_sessions_], TIMESTAMP('2014-06-01'), TIMESTAMP('2014-06-14'))) ) WHERE fullVisitorId IN ( SELECT fullVisitorId FROM (TABLE_DATE_RANGE([dataset.ga_sessions_], TIMESTAMP('2014-06-01'), TIMESTAMP('2014-06-14'))) GROUP BY fullVisitorId HAVING SUM(totals.transactions) > 1 ) AND hits.item.productName IS NOT NULL GROUP BY fullVisitorId, visitId, prod_name ORDER BY fullVisitorId DESC ) GROUP BY prod_name ORDER BY transactions DESC; * example query from the lunametrics blog. Check them out for more awesomeness Products that are purchased and lead to other products being purchased
  • 31. SELECT fullvisitorID, visitID, visitNumber, hits.page.pagePath FROM playground.ga_sessions_20140621 where hits.type='PAGE' order by fullvisitorID, visitID, hits.hitnumber asc Identify user path/user actions
  • 32. individual users data is awesome Cross CRM data with individual users actions to see how your response to incidents affect your users. Use the “frequently bought together” query and find users who didn't buy the related products. Send an e-mail campaign with an offer for those products.
  • 33. integrating with external data sources * Connectors/REST API * Export into GCS * Import into BigQuery javier ramirez @supercoco9 https://datawaki.com
  • 35. BigQuery pricing $20 per stored TB $5 per processed TB *the 1st TB every month is free of charge ** GA premium get $500 free credit monthly javier ramirez @supercoco9 https://datawaki.com
  • 36. for GA premium users BigQuery is effectively for free *unless you upload huge external data or make huge queries javier ramirez @supercoco9 https://datawaki.com
  • 37. What if I don't have a GA Premium Account?
  • 38. just send your own data javier ramirez @supercoco9 https://datawaki.com define a data structure that fits your needs (or replicate the one GA provides) use a JS snippet to send data to your server, then to BigQuery
  • 39. or use javier ramirez @supercoco9 https://datawaki.com Just add an extra snippet to your GA datawaki
  • 40. javier ramirez @supercoco9 https://datawaki.com send data from any other source (CRM, back-end, sensors, mobile apps, log system, external tools...) datawaki
  • 42. javier ramirez @supercoco9 https://datawaki.com * Get full access to your data * Receive reports by e-mail * Get individual or group alerts * if there is a purchase over $1000 * if a user has visited a product page over 20 times in one week and didn't buy * if a product is seen over 200 times one hour * every time a product reaches 5000 views datawaki
  • 43. Want to know more? https://cloud.google.com/products/bigquery/ Need help? https://teowaki.com/services Thanks! Gracias Javier Ramírez @supercoco9 datawaki