Storing time series data with Apache Cassandra

@PatrickMcFadin
Patrick McFadin 
Chief Evangelist for Apache Cassandra, DataStax
Storing Time Series Data with
1

My Background
…ran into this problem

Gave it my best shot
shard 1 shard 2 shard 3 shard 4
router
client
Patrick,
All your wildest
dreams will come
true.

Dynamo Paper(2007)
• How do we build a data store that is:
• Reliable
• Performant
• “Always On”
• Nothing new and shiny
Evolutionary. Real. Computer Science
Also the basis for Riak and Voldemort

BigTable(2006)
• Richer data model
• 1 key. Lots of values
• Fast sequential access
• 38 Papers cited

Cassandra(2008)
• Distributed features of Dynamo
• Data Model and storage from
BigTable
• February 17, 2010 it graduated to
a top-level Apache project

A Data Ocean or Pond., Lake
An In-Memory Database
A Key-Value Store
A magical database unicorn that farts rainbows

Cassandra for Applications
APACHE
CASSANDRA

Row
Column
1
Partition
Key 1
Column
2
Column
3
Column
4

Partition
Column
1
Partition
Key 1
Column
2
Column
3
Column
4
Column
1
Partition
Key 1
Column
2
Column
3
Column
4
Column
1
Partition
Key 1
Column
2
Column
3
Column
4
Column
1
Partition
Key 1
Column
2
Column
3
Column
4

Table Column
1
Partition
Key 1
Column
2
Column
3
Column
4
Column
1
Partition
Key 1
Column
2
Column
3
Column
4
Column
1
Partition
Key 1
Column
2
Column
3
Column
4
Column
1
Partition
Key 1
Column
2
Column
3
Column
4
Column
1
Partition
Key 2
Column
2
Column
3
Column
4
Column
1
Column
2
Column
3
Column
4
Column
1
Column
2
Column
3
Column
4
Column
1
Column
2
Column
3
Column
4
Partition
Key 2
Partition
Key 2
Partition
Key 2

Keyspace
Column
1
Partition
Key 1
Column
2
Column
3
Column
4
Column
1
Partition
Key 2
Column
2
Column
3
Column
4
Column
1
Partition
Key 1
Column
2
Column
3
Column
4
Column
1
Partition
Key 1
Column
2
Column
3
Column
4
Column
1
Partition
Key 1
Column
2
Column
3
Column
4
Column
1
Partition
Key 2
Column
2
Column
3
Column
4
Column
1
Partition
Key 2
Column
2
Column
3
Column
4
Column
1
Partition
Key 2
Column
2
Column
3
Column
4
Column
1
Partition
Key 1
Column
2
Column
3
Column
4
Column
1
Partition
Key 2
Column
2
Column
3
Column
4
Column
1
Partition
Key 1
Column
2
Column
3
Column
4
Column
1
Partition
Key 1
Column
2
Column
3
Column
4
Column
1
Partition
Key 1
Column
2
Column
3
Column
4
Column
1
Partition
Key 2
Column
2
Column
3
Column
4
Column
1
Partition
Key 2
Column
2
Column
3
Column
4
Column
1
Partition
Key 2
Column
2
Column
3
Column
4
Table 1 Table 2
Keyspace 1

Token
Server
•Each partition is a 128 bit value
•Consistent hash between 2-63
and 264
•Each node owns a range of those
values
•The token is the beginning of that
range to the next node’s token value
•Virtual Nodes break these down
further
Data
Token Range
0 …

Cluster Server
Token Range
0 0-100
0-100

Cluster Server
Token Range
0 0-50
51 51-100
Server
0-50
51-100

Cluster Server
Token Range
0 0-25
26 26-50
51 51-75
76 76-100
Server
ServerServer
0-25
76-100
26-5051-75

Replication
10.0.0.1
00-25
DC1
DC1: RF=1
Node Primary
10.0.0.1 00-25
10.0.0.2 26-50
10.0.0.3 51-75
10.0.0.4 76-100
10.0.0.1
00-25
10.0.0.4
76-100
10.0.0.2
26-50
10.0.0.3
51-75

Replication
10.0.0.1
00-25
10.0.0.4
76-100
10.0.0.2
26-50
10.0.0.3
51-75
DC1
DC1: RF=2
Node Primary Replica
10.0.0.1 00-25 76-100
10.0.0.2 26-50 00-25
10.0.0.3 51-75 26-50
10.0.0.4 76-100 51-75
76-100
00-25
26-50
51-75

Replication
DC1
DC1: RF=3
Node Primary Replica Replica
10.0.0.1 00-25 76-100 51-75
10.0.0.2 26-50 00-25 76-100
10.0.0.3 51-75 26-50 00-25
10.0.0.4 76-100 51-75 26-50
10.0.0.1
00-25
10.0.0.4
76-100
10.0.0.2
26-50
10.0.0.3
51-75
76-100
51-75
00-25
76-100
26-50
00-25
51-75
26-50

Consistency
DC1
DC1: RF=3
10.0.0.1 00-25 76-100 51-75
10.0.0.2 26-50 00-25 76-100
10.0.0.3 51-75 26-50 00-25
10.0.0.4 76-100 51-75 26-50
10.0.0.1
00-25
10.0.0.4
76-100
10.0.0.2
26-50
10.0.0.3
51-75
76-100
51-75
00-25
76-100
26-50
00-25
51-75
26-50
Client
Write to
partition 15

Consistency level
Consistency Level Number of Nodes Acknowledged
One One - Read repair triggered
Local One One - Read repair in local DC
Quorum 51%
Local Quorum 51% in local DC

Consistency
DC1
DC1: RF=3
10.0.0.1 00-25 76-100 51-75
10.0.0.2 26-50 00-25 76-100
10.0.0.3 51-75 26-50 00-25
10.0.0.4 76-100 51-75 26-50
10.0.0.1
00-25
10.0.0.4
76-100
10.0.0.2
26-50
10.0.0.3
51-75
76-100
51-75
00-25
76-100
26-50
00-25
51-75
26-50
Client
Write to
partition 15
CL= One

Consistency
DC1
DC1: RF=3
10.0.0.1 00-25 76-100 51-75
10.0.0.2 26-50 00-25 76-100
10.0.0.3 51-75 26-50 00-25
10.0.0.4 76-100 51-75 26-50
10.0.0.1
00-25
10.0.0.4
76-100
10.0.0.2
26-50
10.0.0.3
51-75
76-100
51-75
00-25
76-100
26-50
00-25
51-75
26-50
Client
Write to
partition 15
CL= Quorum

Multi-datacenter
DC1
DC1: RF=3
10.0.0.1 00-25 76-100 51-75
10.0.0.2 26-50 00-25 76-100
10.0.0.3 51-75 26-50 00-25
10.0.0.4 76-100 51-75 26-50
10.0.0.1
00-25
10.0.0.4
76-100
10.0.0.2
26-50
10.0.0.3
51-75
76-100
51-75
00-25
76-100
26-50
00-25
51-75
26-50
Client
Write to
partition 15
DC2
10.1.0.1
00-25
10.1.0.4
76-100
10.1.0.2
26-50
10.1.0.3
51-75
76-100
51-75
00-25
76-100
26-50
00-25
51-75
26-50
10.0.0.1 00-25 76-100 51-75
10.0.0.2 26-50 00-25 76-100
10.0.0.3 51-75 26-50 00-25
10.0.0.4 76-100 51-75 26-50
DC2: RF=3

Cassandra Query Language - CQL

Table
CREATE TABLE weather_station ( 
id text, 
name text, 
country_code text, 
state_code text, 
call_sign text, 
lat double, 
long double, 
elevation double, 
PRIMARY KEY(id) 
);
Table Name
Column Name
Column CQL Type
Primary Key Designation Partition Key

Table
CREATE TABLE daily_aggregate_precip ( 
wsid text, 
year int, 
month int, 
day int, 
precipitation counter, 
PRIMARY KEY ((wsid), year, month, day) 
) WITH CLUSTERING ORDER BY (year DESC, month DESC, day DESC);
Partition Key
Clustering Columns
Order Override

Insert
INSERT INTO weather_station (id, call_sign, country_code, elevation, lat, long, name, state_code) 
VALUES ('727930:24233', 'KSEA', 'US', 121.9, 47.467, -122.32, 'SEATTLE SEATTLE-TACOMA INTL A', ‘WA');
Table Name Fields
Values
Partition Key: Required

Delete
DELETE FROM weather_station 
WHERE id = '727930:24233';
Table Name
Primary Key: Required

Collections
Set
id text, 
name text, 
state_code text, 
call_sign text, 
lat double, 
long double, 
equipment set<text> 
PRIMARY KEY(id) 
);
equipment set<text>
CQL Type: For Ordering
Column Name

Collections
Set
List
id text, 
name text, 
state_code text, 
call_sign text, 
lat double, 
long double, 
equipment set<text>, 
service_dates list<timestamp>, 
PRIMARY KEY(id) 
);
equipment set<text>
service_dates list<timestamp>
CQL Type
Column Name
Column Name

Collections
Set
List
Map
id text, 
name text, 
state_code text, 
call_sign text, 
lat double, 
long double, 
equipment set<text>, 
service_dates list<timestamp>, 
service_notes map<timestamp,text>, 
PRIMARY KEY(id) 
);
equipment set<text>
service_dates list<timestamp>
service_notes map<timestamp,text>
CQL Type
Column Name
Column Name
CQL Key Type CQL Value Type
Column Name

UDF and UDA
User Defined Function
CREATE OR REPLACE AGGREGATE group_and_count(text) 
SFUNC state_group_and_count 
STYPE map<text, int> 
INITCOND {};
CREATE FUNCTION state_group_and_count( state map<text, int>, type text ) 
CALLED ON NULL INPUT 
RETURNS map<text, int> 
LANGUAGE java AS ' 
Integer count = (Integer) state.get(type);
if (count == null)
count = 1;
else count++;
state.put(type, count);
return state; ' ;
User Defined Aggregate
As of Cassandra 2.2

Example: Weather Station
• Weather station collects data
• Cassandra stores in sequence
• Application reads in sequence

Queries supported
CREATE TABLE raw_weather_data ( 
wsid text, 
year int, 
month int, 
day int, 
hour int, 
temperature double, 
dewpoint double, 
pressure double, 
wind_direction int, 
wind_speed double, 
sky_condition int, 
sky_condition_text text, 
one_hour_precip double, 
six_hour_precip double, 
PRIMARY KEY ((wsid), year, month, day, hour) 
) WITH CLUSTERING ORDER BY (year DESC, month DESC, day DESC, hour DESC);
Get weather data given
•Weather Station ID
•Weather Station ID and Time
•Weather Station ID and Range of Time

Primary Key
wsid text, 
year int, 
month int, 
day int, 
hour int, 
dewpoint double, 
pressure double, 

Primary key relationship
PRIMARY KEY ((wsid),year,month,day,hour)

Partition Key

Partition Key Clustering Columns

10010:99999

2005:12:1:10
-5.6
10010:99999
-5.3-4.9-5.1
2005:12:1:9 2005:12:1:8 2005:12:1:7

Partition keys
10010:99999 Murmur3 Hash Token = 7224631062609997448
722266:13850 Murmur3 Hash Token = -6804302034103043898
INSERT INTO raw_weather_data(wsid,year,month,day,hour,temperature)
VALUES (‘10010:99999’,2005,12,1,7,-5.6);
VALUES (‘722266:13850’,2005,12,1,7,-5.6);
Consistent hash. 128 bit number
between 2-63
and 264

Partition keys
For this example, let’s make it a
reasonable number
VALUES (‘10010:99999’,2005,12,1,7,-5.6);
VALUES (‘722266:13850’,2005,12,1,7,-5.6);

Data Locality
DC1
DC1: RF=3
10.0.0.1 00-25 76-100 51-75
10.0.0.2 26-50 00-25 76-100
10.0.0.3 51-75 26-50 00-25
10.0.0.4 76-100 51-75 26-50
10.0.0.1
00-25
10.0.0.4
76-100
10.0.0.2
26-50
10.0.0.3
51-75
76-100
51-75
00-25
76-100
26-50
00-25
51-75
26-50
Client
Read partition
15
DC2
10.1.0.1
00-25
10.1.0.4
76-100
10.1.0.2
26-50
10.1.0.3
51-75
76-100
51-75
00-25
76-100
26-50
00-25
51-75
26-50
10.0.0.1 00-25 76-100 51-75
10.0.0.2 26-50 00-25 76-100
10.0.0.3 51-75 26-50 00-25
10.0.0.4 76-100 51-75 26-50
DC2: RF=3
Client
Read partition
15

Data Locality
wsid=‘10010:99999’ ?
1000 Node Cluster
You are here!

Writes
wsid text, 
year int, 
month int, 
day int, 
hour int, 
dewpoint double, 
pressure double, 

Writes
wsid text, 
year int, 
month int, 
day int, 
hour int, 
INSERT INTO raw_weather_data(wsid,year,month,day,hour,temperature) 
VALUES (‘10010:99999’,2005,12,1,10,-5.6);
VALUES (‘10010:99999’,2005,12,1,9,-5.1);
VALUES (‘10010:99999’,2005,12,1,8,-4.9);
VALUES (‘10010:99999’,2005,12,1,7,-5.3);

Write Path
Client
VALUES (‘10010:99999’,2005,12,1,7,-5.3);
Column
1
Partition
Key 1
Column
2
Column
3
Column
4
Column
1
Partition
Key 1
Column
2
Column
3
Column
4
Memtable
SSTable
SSTable
SSTable
SSTable
Node
Commit Log Data * Compaction *

Date Tiered Compaction Strategy
•Group similar time blocks
•Never compact again
•Used for high density
SSTable
SSTable
SSTable
T=2015-01-01 -> 2015-01-5
T=2015-01-06 -> 2015-01-10
T=2015-01-11 -> 2015-01-15

Storage Model - Logical View
2005:12:1:10
-5.6
2005:12:1:9
-5.1
2005:12:1:8
-4.9
10010:99999
10010:99999
10010:99999
wsid hour temperature
2005:12:1:7
-5.3
10010:99999
SELECT wsid, hour, temperature 
FROM raw_weather_data 
WHERE wsid=‘10010:99999’ 
AND year = 2005 AND month = 12 AND day = 1;

2005:12:1:10
-5.6 -5.3-4.9-5.1
Storage Model - Disk Layout
2005:12:1:9 2005:12:1:8
10010:99999
2005:12:1:7
Merged, Sorted and Stored Sequentially
WHERE wsid=‘10010:99999’ 

2005:12:1:10
-5.6
2005:12:1:11
-4.9 -5.3-4.9-5.1
2005:12:1:9 2005:12:1:8
10010:99999
2005:12:1:7
WHERE wsid=‘10010:99999’ 

2005:12:1:10
-5.6
2005:12:1:11
-4.9 -5.3-4.9-5.1
2005:12:1:9 2005:12:1:8
10010:99999
2005:12:1:7
WHERE wsid=‘10010:99999’ 
2005:12:1:12
-5.4

Read Path
Client
Column
1
Partition
Key 1
Column
2
Column
3
Column
4
Column
1
Partition
Key 1
Column
2
Column
3
Column
4
Memtable
SSTable
SSTable
SSTable
Node
Data
SELECT wsid,hour,temperature 
WHERE wsid='10010:99999' 
AND year = 2005 AND month = 12 AND day = 1  
AND hour >= 7 AND hour <= 10;

Query patterns
• Range queries
• “Slice” operation on disk
Single seek on disk
10010:99999
Partition key for locality
SELECT wsid,hour,temperature 
WHERE wsid='10010:99999' 
AND year = 2005 AND month = 12 AND day = 1  
2005:12:1:10
-5.6 -5.3-4.9-5.1
2005:12:1:9 2005:12:1:8 2005:12:1:7

Query patterns
• Range queries
• “Slice” operation on disk
Programmers like this
Sorted by event_time
2005:12:1:10
-5.6
2005:12:1:9
-5.1
2005:12:1:8
-4.9
10010:99999
10010:99999
10010:99999
weather_station hour temperature
2005:12:1:7
-5.3
10010:99999
SELECT weatherstation,hour,temperature
FROM temperature
WHERE weatherstation_id=‘10010:99999'
AND year = 2005 AND month = 12 AND day = 1

Thank you!
Bring the questions
Follow me on twitter
@PatrickMcFadin

Storing time series data with Apache Cassandra

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie Storing time series data with Apache Cassandra

Ähnlich wie Storing time series data with Apache Cassandra (20)

Mehr von Patrick McFadin

Mehr von Patrick McFadin (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Storing time series data with Apache Cassandra