SlideShare ist ein Scribd-Unternehmen logo
1 von 34
Downloaden Sie, um offline zu lesen
Snowplow: open source
game analytics powered
by AWS
Hello! We’re Alex and Yali.
We created Snowplow
• We cofounded Snowplow
• Open source event data pipeline built on
AWS tech
• Collect granular, rich, event-level data
across digital platforms
• Validate, enrich, model and deliver that
data to the places it can be analysed and
acted on
Wonder at what the data made possible
drove us to create Snowplow
• Digital event data is rich, behavioral information on how millions of people do
things (play, work, socialize, flirt, unwind etc.) collected at scale
• Endless possibilities to ask and answer different questions, build intelligence
and act on that intelligence
• Packaged solutions do a poor job of enabling companies to realise all the
different possibilities presented by this data
• Lots of companies build their own event data pipelines to realise those
possibilities. If we can build a standard pipeline, companies can focus on
doing stuff with the data
A call to arms for games
analysts
Games companies are typically
very analytically sophisticated
• At a (often early) stage invest in event data
warehouse / data pipeline
• Analytics is often very specific to each game:
packaged solutions can only get you so far
• Data sophistication: competitive advantage
• Larger game studios typically have very large data
teams (engineering, science and analysis) and
significant analytics infrastructure that they’ve built
But you don’t need to build your own
event data pipeline from scratch
• We have a tried and tested open-source stack, that you can
deploy directly to your own AWS account
• Built on top of AWS services incl. Kinesis, Lambda, Redshift,
Elasticsearch, S3, EMR
• Use your data engineers to build analyses specific to your game,
not to re-build the pipe!
Building high quality event data
pipelines is hard
Data quality Schema evolution
Enrichment Data modeling
Today Snowplow is
used by games
studios…
…And companies in
other sectors
Snowplow and our early
gaming influences
Early work with games studios heavily
influenced our thinking
Flexible data schema
that evolve!
Event grammar:
events vs entities
Evolving data models:
understanding
sequences of play
Game analytics
has grown up
Game analytics encompasses a lot
• Product analytics: use data to improve the game
• Customer acquisition analytics: sustainably drive user growth
• Game health analytics: monitor the game
• Data-driven applications within the game e.g. player-matching
• Plenty more that is specific to your game
We distinguish between analytics on
read vs analytics on write
• Decide on how you want to process the
data at the point of query
• Prioritise having the flexibility to query the
data in a rich / varied way
• De-prioritise query latency
• Example: product analytics
Analytics on Read Analytics on Write
• Define in advance how the data will be
queried
• Prioritise low latency
• De-prioritise query flexibility
• Example: game health monitoring
Different architectures are appropriate for the above two cases
With Snowplow, we meet both
requirements via a Lambda Architecture
Analytics on write:
kinesis + AWS
Lambda / Spark
Streaming
Analytics on
read: Redshift /
Spark / Athena
Analytics on read
Analytics on read example: A/B testing
to drive product development
• Limitless possibilities for experiments
• Wide set of metrics that you might be
looking to influence with each
experiment
• Tracking the experiments should be
easy
• All enabled by the flexibility to
compute segments and metrics after
the fact (at query time)
Delivering the A/B testing framework
with Redshift and/or Spark on EMR
Process
• Product manager defines A/B test in
advance incl. KPI and success
threshold
• Rolling program of tests run each week
• Test history documented
Technology
• Event tracked to indicate that a user is
assigned to a specific group and a
particular experiment is run
• KPI can be measured after the fact
Analytics on read example 2:
level optimisation analytics
Delivering level analytics with Redshift
and/or Spark on EMR
Process
• Define key metrics to understand player
engagement with each level
• Build out data modeling process to compute
level aggregation on the underlying event
stream
• Extend over time: build out more
sophisticated metrics as understanding of
play evolves
Technology
• Attach level metadata to all events
• Aggregate event-stream in Redshift /
Spark
• Recompute over historical data as new
metrics are developed
AWS provides a rich and growing toolkit
for analytics on read
• EMR enabling Hadoop, Spark, Flink
• Athena
• Redshift
• Elasticsearch Service
Analytics on write
Analytics on write example 1:
Surface aggregate play data in the game
• https://next.codecombat.com/play/dungeon
Delivering aggregate play data into the game with
Kinesis, Lambda and DynamoDB
Example: calculating # of users live on each level now
Elegantly handle computing complex metrics (count distincts) in real-time
{…},
{ event_name: e,
level_name: l
user_name: u,
timestamp: t },
{…}
Kinesis event stream AWS Lambda
Compute
player
state
Player state
table
Event stream
of updates to
player state
DynamoDB
+ stream
Compute
level state
AWS Lambda DynamoDB
Level state
table
Analytics on write example 2:
Tiered support based on player LTV
Triage user based on expected LTV
1. Standard user: minimise support cost
2. Silver user: personalised service
3. Platinum user: concierge service
Delivering tiered support using Kinesis, Lambda,
DynamoDB and API Gateway
Example: computing customer lifetime value and serving from customer API
{…},
{ event_name: e,
user_name: u,
transaction_value: v
timestamp: t },
{…}
Kinesis event stream AWS Lambda
Compute
Player
Lifetime
Value
Player
State
table
DynamoDB
+ stream
Serve
Player
State
API Gateway
Triage
player
support
tier
AWS provides a rich and growing toolkit
for analytics on write
• Spark Streaming on EMR
• Kinesis Client Library
Stream processing frameworks Serverless event processing
• AWS Lambda
• Kinesis Analytics
Design considerations for
game analytics
1. Keep your analytics stack
independent from your game’s stack
Evolve game and
analytics independently
Best of breed
components for
analytics and game
Handle order of
magnitude different
scale requirements
• Helpful for larger teams
• Reduce fragility
• Limited overlap
between best tools for
game engines and
best for event analytics
• Game event volumes
will dwarf active game
data
2. Develop your analytics on read first,
then migrate them to on write
• Example: customer acquisition model: set bid prices for different
user cohorts
• Model developed, tested and trained on historical data in data
warehouse
• Model then put live on real-time data in-stream
3. Have a formal framework for
managing change
• Change is inevitable through the lifetime of the game:
• The game evolves
• Analysts and scientists ask new questions of the game
• The analytics team must agree a framework to handle:
• Updates to the in-game event and entity schemas (affects the
developers)
• Evolution of the event data modeling (affects the wider company)
A call to arms for games
analysts
Standardise on your event data pipeline
• Why re-invent the wheel?
• Deploy our tried and tested open-source stack, directly in your
AWS account
• Use your data engineers to build analyses specific to your game,
not to re-build the pipe!
Learn more
• http://snowplowanalytics.com
• https://github.com/snowplow/snowplow
Thank you for attending #AmazonDevDay, please take a moment to
complete our survey for a chance to win the grand prize.
bit.ly/DevDaySurvey
Q&A will be in a room on the third floor

Weitere ähnliche Inhalte

Was ist angesagt?

Implementing improved and consistent arbitrary event tracking company-wide us...
Implementing improved and consistent arbitrary event tracking company-wide us...Implementing improved and consistent arbitrary event tracking company-wide us...
Implementing improved and consistent arbitrary event tracking company-wide us...yalisassoon
 
2016 09 measurecamp - event data modeling
2016 09 measurecamp - event data modeling2016 09 measurecamp - event data modeling
2016 09 measurecamp - event data modelingyalisassoon
 
How to evolve your analytics stack with your business using Snowplow
How to evolve your analytics stack with your business using SnowplowHow to evolve your analytics stack with your business using Snowplow
How to evolve your analytics stack with your business using SnowplowGiuseppe Gaviani
 
Snowplow the evolving data pipeline
Snowplow   the evolving data pipelineSnowplow   the evolving data pipeline
Snowplow the evolving data pipelineyalisassoon
 
Modelling event data in look ml
Modelling event data in look mlModelling event data in look ml
Modelling event data in look mlyalisassoon
 
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016yalisassoon
 
Simply Business and Snowplow - Multichannel Attribution Analysis
Simply Business and Snowplow - Multichannel Attribution AnalysisSimply Business and Snowplow - Multichannel Attribution Analysis
Simply Business and Snowplow - Multichannel Attribution AnalysisStewart Duncan
 
Snowplow Analytics: from NoSQL to SQL and back again
Snowplow Analytics: from NoSQL to SQL and back againSnowplow Analytics: from NoSQL to SQL and back again
Snowplow Analytics: from NoSQL to SQL and back againAlexander Dean
 
Viewbix tracking journey
Viewbix tracking journeyViewbix tracking journey
Viewbix tracking journeyidan_by
 
Snowplow, Metail and Cascalog
Snowplow, Metail and CascalogSnowplow, Metail and Cascalog
Snowplow, Metail and CascalogRobert Boland
 
Snowplow: putting digital analysts at the heart of digital analytics - the fo...
Snowplow: putting digital analysts at the heart of digital analytics - the fo...Snowplow: putting digital analysts at the heart of digital analytics - the fo...
Snowplow: putting digital analysts at the heart of digital analytics - the fo...yalisassoon
 
Data driven video advertising campaigns - JustWatch & Snowplow
Data driven video advertising campaigns - JustWatch & SnowplowData driven video advertising campaigns - JustWatch & Snowplow
Data driven video advertising campaigns - JustWatch & SnowplowGiuseppe Gaviani
 
How we use Hive at SnowPlow, and how the role of HIve is changing
How we use Hive at SnowPlow, and how the role of HIve is changingHow we use Hive at SnowPlow, and how the role of HIve is changing
How we use Hive at SnowPlow, and how the role of HIve is changingyalisassoon
 
The analytics journey at Viewbix - how they came to use Snowplow and the setu...
The analytics journey at Viewbix - how they came to use Snowplow and the setu...The analytics journey at Viewbix - how they came to use Snowplow and the setu...
The analytics journey at Viewbix - how they came to use Snowplow and the setu...yalisassoon
 
Snowplow: where we came from and where we are going - March 2016
Snowplow: where we came from and where we are going - March 2016Snowplow: where we came from and where we are going - March 2016
Snowplow: where we came from and where we are going - March 2016yalisassoon
 
Yali presentation for snowplow amsterdam meetup number 2
Yali presentation for snowplow amsterdam meetup number 2Yali presentation for snowplow amsterdam meetup number 2
Yali presentation for snowplow amsterdam meetup number 2yalisassoon
 
Big Data Beers - Introducing Snowplow
Big Data Beers - Introducing SnowplowBig Data Beers - Introducing Snowplow
Big Data Beers - Introducing SnowplowAlexander Dean
 
Snowplow is at the core of everything we do
Snowplow is at the core of everything we doSnowplow is at the core of everything we do
Snowplow is at the core of everything we doyalisassoon
 
Getting It Right Exactly Once: Principles for Streaming Architectures
Getting It Right Exactly Once: Principles for Streaming ArchitecturesGetting It Right Exactly Once: Principles for Streaming Architectures
Getting It Right Exactly Once: Principles for Streaming ArchitecturesSingleStore
 
The culture trip snowplow implementation
The culture trip snowplow implementationThe culture trip snowplow implementation
The culture trip snowplow implementationidan_by
 

Was ist angesagt? (20)

Implementing improved and consistent arbitrary event tracking company-wide us...
Implementing improved and consistent arbitrary event tracking company-wide us...Implementing improved and consistent arbitrary event tracking company-wide us...
Implementing improved and consistent arbitrary event tracking company-wide us...
 
2016 09 measurecamp - event data modeling
2016 09 measurecamp - event data modeling2016 09 measurecamp - event data modeling
2016 09 measurecamp - event data modeling
 
How to evolve your analytics stack with your business using Snowplow
How to evolve your analytics stack with your business using SnowplowHow to evolve your analytics stack with your business using Snowplow
How to evolve your analytics stack with your business using Snowplow
 
Snowplow the evolving data pipeline
Snowplow   the evolving data pipelineSnowplow   the evolving data pipeline
Snowplow the evolving data pipeline
 
Modelling event data in look ml
Modelling event data in look mlModelling event data in look ml
Modelling event data in look ml
 
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016
 
Simply Business and Snowplow - Multichannel Attribution Analysis
Simply Business and Snowplow - Multichannel Attribution AnalysisSimply Business and Snowplow - Multichannel Attribution Analysis
Simply Business and Snowplow - Multichannel Attribution Analysis
 
Snowplow Analytics: from NoSQL to SQL and back again
Snowplow Analytics: from NoSQL to SQL and back againSnowplow Analytics: from NoSQL to SQL and back again
Snowplow Analytics: from NoSQL to SQL and back again
 
Viewbix tracking journey
Viewbix tracking journeyViewbix tracking journey
Viewbix tracking journey
 
Snowplow, Metail and Cascalog
Snowplow, Metail and CascalogSnowplow, Metail and Cascalog
Snowplow, Metail and Cascalog
 
Snowplow: putting digital analysts at the heart of digital analytics - the fo...
Snowplow: putting digital analysts at the heart of digital analytics - the fo...Snowplow: putting digital analysts at the heart of digital analytics - the fo...
Snowplow: putting digital analysts at the heart of digital analytics - the fo...
 
Data driven video advertising campaigns - JustWatch & Snowplow
Data driven video advertising campaigns - JustWatch & SnowplowData driven video advertising campaigns - JustWatch & Snowplow
Data driven video advertising campaigns - JustWatch & Snowplow
 
How we use Hive at SnowPlow, and how the role of HIve is changing
How we use Hive at SnowPlow, and how the role of HIve is changingHow we use Hive at SnowPlow, and how the role of HIve is changing
How we use Hive at SnowPlow, and how the role of HIve is changing
 
The analytics journey at Viewbix - how they came to use Snowplow and the setu...
The analytics journey at Viewbix - how they came to use Snowplow and the setu...The analytics journey at Viewbix - how they came to use Snowplow and the setu...
The analytics journey at Viewbix - how they came to use Snowplow and the setu...
 
Snowplow: where we came from and where we are going - March 2016
Snowplow: where we came from and where we are going - March 2016Snowplow: where we came from and where we are going - March 2016
Snowplow: where we came from and where we are going - March 2016
 
Yali presentation for snowplow amsterdam meetup number 2
Yali presentation for snowplow amsterdam meetup number 2Yali presentation for snowplow amsterdam meetup number 2
Yali presentation for snowplow amsterdam meetup number 2
 
Big Data Beers - Introducing Snowplow
Big Data Beers - Introducing SnowplowBig Data Beers - Introducing Snowplow
Big Data Beers - Introducing Snowplow
 
Snowplow is at the core of everything we do
Snowplow is at the core of everything we doSnowplow is at the core of everything we do
Snowplow is at the core of everything we do
 
Getting It Right Exactly Once: Principles for Streaming Architectures
Getting It Right Exactly Once: Principles for Streaming ArchitecturesGetting It Right Exactly Once: Principles for Streaming Architectures
Getting It Right Exactly Once: Principles for Streaming Architectures
 
The culture trip snowplow implementation
The culture trip snowplow implementationThe culture trip snowplow implementation
The culture trip snowplow implementation
 

Ähnlich wie Snowplow: open source game analytics powered by AWS

(GAM301) Real-Time Game Analytics with Amazon Kinesis, Amazon Redshift, and A...
(GAM301) Real-Time Game Analytics with Amazon Kinesis, Amazon Redshift, and A...(GAM301) Real-Time Game Analytics with Amazon Kinesis, Amazon Redshift, and A...
(GAM301) Real-Time Game Analytics with Amazon Kinesis, Amazon Redshift, and A...Amazon Web Services
 
Game analytics - The challenges of mobile free-to-play games
Game analytics - The challenges of mobile free-to-play gamesGame analytics - The challenges of mobile free-to-play games
Game analytics - The challenges of mobile free-to-play gamesChristian Beckers
 
SPSNYC2019 - What is Common Data Model and how to use it?
SPSNYC2019 - What is Common Data Model and how to use it?SPSNYC2019 - What is Common Data Model and how to use it?
SPSNYC2019 - What is Common Data Model and how to use it?Nicolas Georgeault
 
Gam301 Real-Time Game Analytics with Amazon Redshift, Amazon Kinesis, and Ama...
Gam301 Real-Time Game Analytics with Amazon Redshift, Amazon Kinesis, and Ama...Gam301 Real-Time Game Analytics with Amazon Redshift, Amazon Kinesis, and Ama...
Gam301 Real-Time Game Analytics with Amazon Redshift, Amazon Kinesis, and Ama...Amazon Web Services Korea
 
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...MSAdvAnalytics
 
Analytics in Your Enterprise
Analytics in Your EnterpriseAnalytics in Your Enterprise
Analytics in Your EnterpriseWSO2
 
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...MSAdvAnalytics
 
QCon New York 2014 - Scalable, Reliable Analytics Infrastructure at KIXEYE
QCon New York 2014 - Scalable, Reliable Analytics Infrastructure at KIXEYEQCon New York 2014 - Scalable, Reliable Analytics Infrastructure at KIXEYE
QCon New York 2014 - Scalable, Reliable Analytics Infrastructure at KIXEYERandy Shoup
 
Impact AI 2020: Portfolio-Scale Data Science at Zynga
Impact AI 2020: Portfolio-Scale Data Science at ZyngaImpact AI 2020: Portfolio-Scale Data Science at Zynga
Impact AI 2020: Portfolio-Scale Data Science at ZyngaBen Weber
 
Getting Started with Real-time Analytics
Getting Started with Real-time AnalyticsGetting Started with Real-time Analytics
Getting Started with Real-time AnalyticsAmazon Web Services
 
Using Data Science for Cybersecurity
Using Data Science for CybersecurityUsing Data Science for Cybersecurity
Using Data Science for CybersecurityVMware Tanzu
 
How Celtra Optimizes its Advertising Platform with Databricks
How Celtra Optimizes its Advertising Platformwith DatabricksHow Celtra Optimizes its Advertising Platformwith Databricks
How Celtra Optimizes its Advertising Platform with DatabricksGrega Kespret
 
Super bowl 2017 presentation
Super bowl 2017 presentationSuper bowl 2017 presentation
Super bowl 2017 presentationVikal Gupta
 
Big data on_aws in korea by abhishek sinha (lunch and learn)
Big data on_aws in korea by abhishek sinha (lunch and learn)Big data on_aws in korea by abhishek sinha (lunch and learn)
Big data on_aws in korea by abhishek sinha (lunch and learn)Amazon Web Services Korea
 
SRV420 Analyzing Streaming Data in Real-time with Amazon Kinesis
SRV420 Analyzing Streaming Data in Real-time with Amazon KinesisSRV420 Analyzing Streaming Data in Real-time with Amazon Kinesis
SRV420 Analyzing Streaming Data in Real-time with Amazon KinesisAmazon Web Services
 
PlayFab analytics gdc
PlayFab analytics gdcPlayFab analytics gdc
PlayFab analytics gdcCrystin Cox
 
Turn Data Into Actionable Insights - StampedeCon 2016
Turn Data Into Actionable Insights - StampedeCon 2016Turn Data Into Actionable Insights - StampedeCon 2016
Turn Data Into Actionable Insights - StampedeCon 2016StampedeCon
 
Snowplow: evolve your analytics stack with your business
Snowplow: evolve your analytics stack with your businessSnowplow: evolve your analytics stack with your business
Snowplow: evolve your analytics stack with your businessyalisassoon
 
The Science of Fun - Data-driven Game Development
The Science of Fun - Data-driven Game DevelopmentThe Science of Fun - Data-driven Game Development
The Science of Fun - Data-driven Game Developmentalex_turcan
 

Ähnlich wie Snowplow: open source game analytics powered by AWS (20)

(GAM301) Real-Time Game Analytics with Amazon Kinesis, Amazon Redshift, and A...
(GAM301) Real-Time Game Analytics with Amazon Kinesis, Amazon Redshift, and A...(GAM301) Real-Time Game Analytics with Amazon Kinesis, Amazon Redshift, and A...
(GAM301) Real-Time Game Analytics with Amazon Kinesis, Amazon Redshift, and A...
 
Game analytics - The challenges of mobile free-to-play games
Game analytics - The challenges of mobile free-to-play gamesGame analytics - The challenges of mobile free-to-play games
Game analytics - The challenges of mobile free-to-play games
 
SPSNYC2019 - What is Common Data Model and how to use it?
SPSNYC2019 - What is Common Data Model and how to use it?SPSNYC2019 - What is Common Data Model and how to use it?
SPSNYC2019 - What is Common Data Model and how to use it?
 
Gam301 Real-Time Game Analytics with Amazon Redshift, Amazon Kinesis, and Ama...
Gam301 Real-Time Game Analytics with Amazon Redshift, Amazon Kinesis, and Ama...Gam301 Real-Time Game Analytics with Amazon Redshift, Amazon Kinesis, and Ama...
Gam301 Real-Time Game Analytics with Amazon Redshift, Amazon Kinesis, and Ama...
 
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
 
Analytics in Your Enterprise
Analytics in Your EnterpriseAnalytics in Your Enterprise
Analytics in Your Enterprise
 
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
 
QCon New York 2014 - Scalable, Reliable Analytics Infrastructure at KIXEYE
QCon New York 2014 - Scalable, Reliable Analytics Infrastructure at KIXEYEQCon New York 2014 - Scalable, Reliable Analytics Infrastructure at KIXEYE
QCon New York 2014 - Scalable, Reliable Analytics Infrastructure at KIXEYE
 
Impact AI 2020: Portfolio-Scale Data Science at Zynga
Impact AI 2020: Portfolio-Scale Data Science at ZyngaImpact AI 2020: Portfolio-Scale Data Science at Zynga
Impact AI 2020: Portfolio-Scale Data Science at Zynga
 
Getting Started with Real-time Analytics
Getting Started with Real-time AnalyticsGetting Started with Real-time Analytics
Getting Started with Real-time Analytics
 
Using Data Science for Cybersecurity
Using Data Science for CybersecurityUsing Data Science for Cybersecurity
Using Data Science for Cybersecurity
 
How Celtra Optimizes its Advertising Platform with Databricks
How Celtra Optimizes its Advertising Platformwith DatabricksHow Celtra Optimizes its Advertising Platformwith Databricks
How Celtra Optimizes its Advertising Platform with Databricks
 
Super bowl 2017 presentation
Super bowl 2017 presentationSuper bowl 2017 presentation
Super bowl 2017 presentation
 
V3 gamingcasestudy
V3 gamingcasestudyV3 gamingcasestudy
V3 gamingcasestudy
 
Big data on_aws in korea by abhishek sinha (lunch and learn)
Big data on_aws in korea by abhishek sinha (lunch and learn)Big data on_aws in korea by abhishek sinha (lunch and learn)
Big data on_aws in korea by abhishek sinha (lunch and learn)
 
SRV420 Analyzing Streaming Data in Real-time with Amazon Kinesis
SRV420 Analyzing Streaming Data in Real-time with Amazon KinesisSRV420 Analyzing Streaming Data in Real-time with Amazon Kinesis
SRV420 Analyzing Streaming Data in Real-time with Amazon Kinesis
 
PlayFab analytics gdc
PlayFab analytics gdcPlayFab analytics gdc
PlayFab analytics gdc
 
Turn Data Into Actionable Insights - StampedeCon 2016
Turn Data Into Actionable Insights - StampedeCon 2016Turn Data Into Actionable Insights - StampedeCon 2016
Turn Data Into Actionable Insights - StampedeCon 2016
 
Snowplow: evolve your analytics stack with your business
Snowplow: evolve your analytics stack with your businessSnowplow: evolve your analytics stack with your business
Snowplow: evolve your analytics stack with your business
 
The Science of Fun - Data-driven Game Development
The Science of Fun - Data-driven Game DevelopmentThe Science of Fun - Data-driven Game Development
The Science of Fun - Data-driven Game Development
 

Kürzlich hochgeladen

Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...amitlee9823
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...amitlee9823
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...amitlee9823
 

Kürzlich hochgeladen (20)

Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 

Snowplow: open source game analytics powered by AWS

  • 1. Snowplow: open source game analytics powered by AWS
  • 2. Hello! We’re Alex and Yali. We created Snowplow • We cofounded Snowplow • Open source event data pipeline built on AWS tech • Collect granular, rich, event-level data across digital platforms • Validate, enrich, model and deliver that data to the places it can be analysed and acted on
  • 3. Wonder at what the data made possible drove us to create Snowplow • Digital event data is rich, behavioral information on how millions of people do things (play, work, socialize, flirt, unwind etc.) collected at scale • Endless possibilities to ask and answer different questions, build intelligence and act on that intelligence • Packaged solutions do a poor job of enabling companies to realise all the different possibilities presented by this data • Lots of companies build their own event data pipelines to realise those possibilities. If we can build a standard pipeline, companies can focus on doing stuff with the data
  • 4. A call to arms for games analysts
  • 5. Games companies are typically very analytically sophisticated • At a (often early) stage invest in event data warehouse / data pipeline • Analytics is often very specific to each game: packaged solutions can only get you so far • Data sophistication: competitive advantage • Larger game studios typically have very large data teams (engineering, science and analysis) and significant analytics infrastructure that they’ve built
  • 6. But you don’t need to build your own event data pipeline from scratch • We have a tried and tested open-source stack, that you can deploy directly to your own AWS account • Built on top of AWS services incl. Kinesis, Lambda, Redshift, Elasticsearch, S3, EMR • Use your data engineers to build analyses specific to your game, not to re-build the pipe!
  • 7. Building high quality event data pipelines is hard Data quality Schema evolution Enrichment Data modeling
  • 8. Today Snowplow is used by games studios… …And companies in other sectors
  • 9. Snowplow and our early gaming influences
  • 10. Early work with games studios heavily influenced our thinking Flexible data schema that evolve! Event grammar: events vs entities Evolving data models: understanding sequences of play
  • 12. Game analytics encompasses a lot • Product analytics: use data to improve the game • Customer acquisition analytics: sustainably drive user growth • Game health analytics: monitor the game • Data-driven applications within the game e.g. player-matching • Plenty more that is specific to your game
  • 13. We distinguish between analytics on read vs analytics on write • Decide on how you want to process the data at the point of query • Prioritise having the flexibility to query the data in a rich / varied way • De-prioritise query latency • Example: product analytics Analytics on Read Analytics on Write • Define in advance how the data will be queried • Prioritise low latency • De-prioritise query flexibility • Example: game health monitoring Different architectures are appropriate for the above two cases
  • 14. With Snowplow, we meet both requirements via a Lambda Architecture Analytics on write: kinesis + AWS Lambda / Spark Streaming Analytics on read: Redshift / Spark / Athena
  • 16. Analytics on read example: A/B testing to drive product development • Limitless possibilities for experiments • Wide set of metrics that you might be looking to influence with each experiment • Tracking the experiments should be easy • All enabled by the flexibility to compute segments and metrics after the fact (at query time)
  • 17. Delivering the A/B testing framework with Redshift and/or Spark on EMR Process • Product manager defines A/B test in advance incl. KPI and success threshold • Rolling program of tests run each week • Test history documented Technology • Event tracked to indicate that a user is assigned to a specific group and a particular experiment is run • KPI can be measured after the fact
  • 18. Analytics on read example 2: level optimisation analytics
  • 19. Delivering level analytics with Redshift and/or Spark on EMR Process • Define key metrics to understand player engagement with each level • Build out data modeling process to compute level aggregation on the underlying event stream • Extend over time: build out more sophisticated metrics as understanding of play evolves Technology • Attach level metadata to all events • Aggregate event-stream in Redshift / Spark • Recompute over historical data as new metrics are developed
  • 20. AWS provides a rich and growing toolkit for analytics on read • EMR enabling Hadoop, Spark, Flink • Athena • Redshift • Elasticsearch Service
  • 22. Analytics on write example 1: Surface aggregate play data in the game • https://next.codecombat.com/play/dungeon
  • 23. Delivering aggregate play data into the game with Kinesis, Lambda and DynamoDB Example: calculating # of users live on each level now Elegantly handle computing complex metrics (count distincts) in real-time {…}, { event_name: e, level_name: l user_name: u, timestamp: t }, {…} Kinesis event stream AWS Lambda Compute player state Player state table Event stream of updates to player state DynamoDB + stream Compute level state AWS Lambda DynamoDB Level state table
  • 24. Analytics on write example 2: Tiered support based on player LTV Triage user based on expected LTV 1. Standard user: minimise support cost 2. Silver user: personalised service 3. Platinum user: concierge service
  • 25. Delivering tiered support using Kinesis, Lambda, DynamoDB and API Gateway Example: computing customer lifetime value and serving from customer API {…}, { event_name: e, user_name: u, transaction_value: v timestamp: t }, {…} Kinesis event stream AWS Lambda Compute Player Lifetime Value Player State table DynamoDB + stream Serve Player State API Gateway Triage player support tier
  • 26. AWS provides a rich and growing toolkit for analytics on write • Spark Streaming on EMR • Kinesis Client Library Stream processing frameworks Serverless event processing • AWS Lambda • Kinesis Analytics
  • 28. 1. Keep your analytics stack independent from your game’s stack Evolve game and analytics independently Best of breed components for analytics and game Handle order of magnitude different scale requirements • Helpful for larger teams • Reduce fragility • Limited overlap between best tools for game engines and best for event analytics • Game event volumes will dwarf active game data
  • 29. 2. Develop your analytics on read first, then migrate them to on write • Example: customer acquisition model: set bid prices for different user cohorts • Model developed, tested and trained on historical data in data warehouse • Model then put live on real-time data in-stream
  • 30. 3. Have a formal framework for managing change • Change is inevitable through the lifetime of the game: • The game evolves • Analysts and scientists ask new questions of the game • The analytics team must agree a framework to handle: • Updates to the in-game event and entity schemas (affects the developers) • Evolution of the event data modeling (affects the wider company)
  • 31. A call to arms for games analysts
  • 32. Standardise on your event data pipeline • Why re-invent the wheel? • Deploy our tried and tested open-source stack, directly in your AWS account • Use your data engineers to build analyses specific to your game, not to re-build the pipe!
  • 33. Learn more • http://snowplowanalytics.com • https://github.com/snowplow/snowplow
  • 34. Thank you for attending #AmazonDevDay, please take a moment to complete our survey for a chance to win the grand prize. bit.ly/DevDaySurvey Q&A will be in a room on the third floor