19. 19
Google
Analytics
Premium
Cloud
Pub/Sub
BigQuery Storage
(tables)
Cloud Bigtable
(NoSQL)
Cloud Storage
(files)
Cloud Dataflow
BigQuery Analytics
Capture Store Analyze
Google
Stackdriver
Process
Stream
Use
Data
Scientists
Business
Analysts
Cloud Dataproc
Cloud
Datalab
Real-time analytics
Real-time
dashboard
Real-time
alerts
Cloud ML
Batch
Firebase
Storage
Transfer
Service
Cloud
Dataflow
CEP over GCP Stack
here’s gaming logs
each square represents an event where a user scored some points for their team
game gets popular
start organizing it into a repeated structure
repetitive structure just a cheap way of representing an infinite data source.
game logs are continuous
distributed systems can cause ambiguity...
Lets look at some points that were scored at 8am
<animate> red score 8am, received quickly
<animate> yellow score also happened at 8am, received at 8:30 due to network congestion
<animate> green element was hours late. this was someone playing in airplane mode on the plane. had to wait for it to land.
so now we’ve got an unordered, infinite data set, how do we process it...