SlideShare ist ein Scribd-Unternehmen logo
1 von 102
$ whoami
Name: Zvi Avraham
Title: Founder & CEO
Company: ZADATA Ltd
Email: zvi@zadata.com
ZΛDΛTΛ © 2015
$ whois ZADATA.com
Marketplace and Platform for
Realtime Data Feeds
ZΛDΛTΛ © 2015
Market for Realtime Data Feeds
Data
Sellers
Data
Buyers
ZΛDΛTΛ © 2015
70% of fees
Data Scientists,
Data Analysts,
Researchers
Quants,
Algo-traders,
Financial Analysts
Businesses
Content & App Publishers,
Utilities / M2M / Internet-of-Things
Independent Developers & ISVs: Web/Mobile/Devices
Can be both Data Sellers & Data Buyers
ZΛDΛTΛ
30% commission
+ acess fees
Realtime Data
Analytics
Subscription fees
Data Feeds Historical Data
Crowdsourcing
& Sensor Apps
Connected Apps
Data Sources & Destinations
ZΛDΛTΛ © 2015
ZΛDΛTΛ web apps
Physical
Alt. Finance
Social Media
analytics apps
mobile apps
Sports
Entertainment
Traffic & Transit
End-to-End Platform
for Apps & Devices
ZΛDΛTΛ © 2015
ZΛDΛTΛ
aggregate
collect broadcast
analyze
ZΛDΛTΛ © 2013
Time is
PK of the
Universe
ZΛDΛTΛ © 2013
Location
is 2i
ZΛDΛTΛ © 2013
Timeseries
Data
ZΛDΛTΛ © 2013
Financial
Timeseries
ZΛDΛTΛ © 2013
Monitoring Data
ZΛDΛTΛ © 2013
Quantified Self
IoT & M2M
ZΛDΛTΛ © 2013
NIKE+
ZΛDΛTΛ © 2013
Sensor Data
ZΛDΛTΛ © 2013
Smart Grid – Meter Data
ZΛDΛTΛ © 2013
ZΛDΛTΛ © 2013
Logs – Timestamped Events
ZΛDΛTΛ © 2015
http://shop.oreilly.com/product/0636920034339.do
Drinking from
Twitter FirehoseZADATA © 2013
ZΛDΛTΛ © 2013
What is Timeseries?
a sequence of measurements
taken at discrete
time intervals
ZΛDΛTΛ © 2013
ZΛDΛTΛ © 2013
Timeseries
Databases
ZΛDΛTΛ © 2015
http://shop.oreilly.com/product/0636920035435.do
Just use SQL?
ZΛDΛTΛ © 2013
ZΛDΛTΛ © 2013
ZΛDΛTΛ © 2013
ZΛDΛTΛ © 2013
ZΛDΛTΛ © 2013
ZΛDΛTΛ © 2013
ZΛDΛTΛ © 2013
ZΛDΛTΛ © 2013
ZΛDΛTΛ © 2013
Timeseries/metrics for Riak
• 4 closed source implementations:
– Boundary Kobayashi
– Hosted Graphite
– Kivra Metyr
– Temetra (smart meter data)
ZΛDΛTΛ © 2013
ZΛDΛTΛ © 2013
ZΛDΛTΛ © 2013
ZΛDΛTΛ © 2013
ZΛDΛTΛ © 2013
TimeSpace DB
ZΛDΛTΛ © 2013
What is TimeSpace DB?
Scalable
GPU-accelerated*
Geospatial
Timeseries*
Database*ZΛDΛTΛ © 2013
TSDB (TimeSeries DataBase )
The TSDB project was divided into two parts – Storage (A) & Analysis (B)
A distributed system for storage and analysis of
geo-temporal data collected from different sources.
Dr. Yehuda Ben-Shimol Mr Zvi Avraham
Eyal Segal
Shahar Ben-David
Alon Rolnik
Ron Schmid Morag
ZΛDΛTΛ © 2013
Why YADB?
Yet Another Database?
ZΛDΛTΛ © 2013
1st Prototype used Graphite
• Graphite problems
–1-sec resolution
–Losing datapoints
–Designed for Monitoring
–Only support simple timeseries of
•{timestamp, numeric_value}
ZΛDΛTΛ © 2013
Requirements
• 1-ms resolution
• Support for geolocation
• Multiple pre-defined schemas
– Not only {timestamp, numeric_value}
• Ability to run pre-defined functions on stored data
• Use Data Locality – running Computations near Data
• Use efficient compute language – OpenCL
• Near realtime queries shouldn’t interfere with online
• Both REST APIs & Push (Graphite protocol, MQTT)
• Bulk import/export (CSV, JSON)
ZΛDΛTΛ © 2013
Query/Workload types
OLTP / Online OLAP / Analytics
Pre-defined queries
UDF queries
- GET
- PUT
- UPDATE
- DELETE
- 2i
- Full-text search
- Statistics
- Aggregations
- Rollups
- Reporting
- Scan
- etc.
Ad-hoc queries - SQL injection ;-)
- http://TrySQL.com ?
SELECT *
FROM …
WHERE …
Online Cluster
NoSQL –
Dynamo-style
Analytics Cluster
MPP or
in-memory
batch
import
GET
PUT
DELETE
Pre-defiined
queries
Ad-hoc
queries
Map/Reduce
Pre-defined
queris
Query/Workload types
OLTP / Online OLAP / Analytics
Pre-defined queries
UDF queries
- GET
- PUT
- UPDATE
- DELETE
- 2i
- Full-text search
- Statistics
- Aggregations
- Rollups
- Reporting
- Scan
- etc.
Ad-hoc queries - SQL injection ;-)
- http://TrySQL.com ?
SELECT *
FROM …
WHERE …
Online Cluster
NoSQL –
Dynamo-style
Analytics Cluster
MPP or
in-memory
batch
import
GET
PUT
DELETE
Pre-defiined
queries
Ad-hoc
queries
Map/Reduce
Pre-defined
queris
OLTP vs OLAP DBs
OLTP OLAP
Online Analytics
Realtime Interactive
Pre-defined queries Ad-hoc queries
Low predictable latency Latency small enough, so analyst will not
lose concentration
Many clients Not many clients
Read and/or Write-intensive Batch import / ETL
Online Cluster
NoSQL –
Dynamo-style
Analytics Cluster
MPP or
in-memory
batch
import
GET
PUT
DELETE
Pre-defiined
queries
Ad-hoc
queries
Map/Reduce
Pre-defined
queris
No Need in Analytics Cluster
ZΛDΛTΛ © 2013
Online Cluster
NoSQL –
Dynamo-style
Analytics Cluster
MPP or
in-memory
batch import
GET
PUT
DELETE
Pre-defiined
queries
Ad-hoc
queries
Map/Reduce
Pre-defined
queries
Online & pre-defined analytics
in the same DB cluster
• Each node have
dedicated Compute
Device for M/R
• M/R run on either
dedicated CPU cores or
on GPUs or Accelerators
(like Xeon Phi)
ZΛDΛTΛ © 2013
Online Cluster
NoSQL – Dynamo-
style
batch import
GET
PUT
DELETE
Ad-hoc
queries
Predef. queries:
M/R in OpenCL
In-Memory
Analytics DB
TimeSpace DB Stack
CPU
OpenCL
Erlang VM
Stats/Timeseries
Application
Riak
CPUCPU
CPUCPUGPU
CPUCPUFPGA
CPUCPUAccelerator
Geo NLP/Search
TimeSpace DB
Open, heterogeneous
CPU+GPU, standard
Compute Language
Open, reliable, cross-
platform software for
concurrent, distributed
computing
ZΛDΛTΛ © 2013
Modular,
Ops-friendly,
Distributed
K/V Store
OPENCL
OpenCL – Open Compute Language
ZΛDΛTΛ © 2013
Accelerators
ZΛDΛTΛ © 2013
ZΛDΛTΛ © 2013
OpenCL HW Model
ZΛDΛTΛ © 2013
OpenCL Datatypes
ZΛDΛTΛ © 2013
ND-Range
N-dimensional index space
ZΛDΛTΛ © 2013
Example of OpenCL kernel
ZΛDΛTΛ © 2013
vs
Erlang/OTP OpenCL
Parallelism Task-parallel Data-parallel(*)
T-put Moderate to bad(*) Optimized for
high t-put
Latency Optimized for
low latency
bad
Floating Point / HPC bad excellent
Self-hosted Yes No –
requires host code
IO & Network Yes No
ZΛDΛTΛ © 2013
Data Representation
• Timeseries divided to number of “tablets”
• Each “tablet” has header & payload
• Everything is in binary format
– Binary
– Little Endian
– Aligned for OpenCL data types
– Essentially unpacked OpenCL struct
ZΛDΛTΛ © 2013
Marshaling Erlang ↔ OpenCL
• Erlang Binary Syntax &
Binary Comprehensions to
marshal & unmarshal
“tablets”
• Arrays of unpacked aligned
OpenCL structs
• No need in parsing
ZΛDΛTΛ © 2013
Row vs Column
ZΛDΛTΛ © 2013
Record Column
Row vs. Column DB
ZΛDΛTΛ © 2013
AoS vs SoA
ZΛDΛTΛ © 2013
ZΛDΛTΛ © 2013
ZΛDΛTΛ © 2013
ZΛDΛTΛ © 2013
Raw & Aggregated Schemas - OpenCL
ZΛDΛTΛ © 2013
Raw & Aggregated Schemas - Erlang
ZΛDΛTΛ © 2013
3 W-s
• WHAT?
– Topic name
• WHEN?
– timestamp
• WHERE?
– Location (lat, lon)
• PAYLOAD:
– Value(s)
ZΛDΛTΛ © 2013
Metadata
Data
3 W-s
• WHAT?
– “/weather/us/ca/san-francisco/temp_c”
• WHEN?
– 2013-10-23T07:31:00.150Z
• WHERE?
– (37.7756, -122.4193)
• PAYLOAD:
– 20.35
ZΛDΛTΛ © 2013
Other Pre-defined Schemas
• Raw & Aggregated Geospatial Timeseries
– Timestamp, Lat, Lon, Value
– Timestamp, Bounding Box, Count, Min, Max, Sum
• Financial Timeseries
– Timestamp, Bid, Ask, Last, Volume
– Timestamp, Open, High, Low, Close, Volume
• Raw & Aggregated Analytics data
– Clickstream, CTR, etc.
• Twitter data (see demo)
ZΛDΛTΛ © 2013
ZΛDΛTΛ © 2013
ZΛDΛTΛ © 2013
& OpenCL!
ZΛDΛTΛ © 2013
Multiple Storage Backends
• ETS
– In-memory, mostly for testing
• Riak PB
– Using Riak as external DB
• Riak Local Client
– Native Erlang client
– Usefull in M/R & riak-core
• DynamoDB
– Using AWS DynamoDB as external DB
ZΛDΛTΛ © 2013
Multiple Storage Backends
• ETS
– In-memory, mostly for testing
• Riak PB
– Using Riak as external DB
• Riak Local Client – Data Locality
– Native Erlang client
– Usefull in M/R & riak-core
• DynamoDB
– Using AWS DynamoDB as external DB
ZΛDΛTΛ © 2013
Riak Storage
• LevelDB backend, since we need 2i
• Unlike BitCask, no auto expiration in LevelDB,
so we have a process deleting old “tablets”
ZΛDΛTΛ © 2013
ZΛDΛTΛ © 2013
Geohash
ZΛDΛTΛ © 2013
Dimensionality Reduction
using Z-curve
ZΛDΛTΛ © 2013
Semantic Keys for Tablets
• Tablet Key:
– “timeframe|topic_name|first_timestamp”
• 2i for time range:
– Integer index: [first_timestamp, last_timestamp]
• 2i for location bounding box:
– Binary index: [southwest_geohash, northeast_geohash]
• where timeframe:
– raw|sec|min|hour|day
ZΛDΛTΛ © 2013
Problem
• Max recommended Riak Object size is 5MB
– (theoretical limit is 50MB)
• But OpenCL need much larger buffers to be
efficient!
ZΛDΛTΛ © 2013
TimeSpace DB API
• API calls:
– Insert value with timestamp
– Insert value with timestamp & location
– Insert bulk (from CSV)
– Fetch/Delete by time range
– Fetch/Delete by time range & location bounding box
• Common params for all API calls:
– Topic name
– Timeframe (raw/sec/min/hour/day)
ZΛDΛTΛ © 2013
TimeSpace DB API (2)
• Rollups for time range
– Convert from one timeframe to another
– Store result in a new timeseries topic
• Reduce for time range
– Calculate statistics (min/max/sum/avg/etc.)
• Run OpenCL kernel on time range using M/R:
– i.e. Sentiment Analysis for tweets
– Calculate Correlations between timeseries
– Etc.
ZΛDΛTΛ © 2013
Riak Map/Reduce Languages
• JavaScript
– Slow (not V8)
• Erlang
– ~ 6 time faster than JS, but still slow
• OpenCL (“OpenCL from Erlang”)
– as fast as it gets, but
– Has overhead for small buffers
– Can interfere with Erlang VM Scheduling, if
running on host CPUs
ZΛDΛTΛ © 2013
DEMO – TWITTER DATA SCAN
ZΛDΛTΛ © 2013
Geo on GPU
• Check if a point is inside a bounding box
• Check if a point is inside a circle
• Clustering of nearby points
– Using Naïve Grid-based Clustering + CoGs
ZΛDΛTΛ © 2013
Naïve Grid-based
Clustering
ZΛDΛTΛ © 2013
NLP on GPU
• Implemented using Prime Encoding:
–Full-text Search
–Sentiment Analysis
• By counting negative vs. positive words
–Language Detection
• By counting language-specific stopwords
ZΛDΛTΛ © 2013
Prime Encoding
• Assign primes to each unique token in corpus:
– most frequent word is assigned “2”
– the next most frequent “3”, and so on
• To encode a tweet, calculate product of
primes for each token in the tweet:
– the product stored in ulong (64-bit unsinged int)
– If there is overflow, then start new 64-bit product
• Erlang:
-spec prime_encode(Str::binary()) -> [cl_ulong()].
ZΛDΛTΛ © 2013
Prime Encoding
Check if a word in a tweet
• If at least one of the products divides without
reminder to prime of the search token
– then tweet has this token
• If non of the products can be divided without
reminder
– Then this token is not in a tweet
• Erlang:
-spec prime_test(Tweet::[cl_ulong()],
Token::cl_ulong()) -> boolean().
ZΛDΛTΛ © 2013
TWITTER DATA SCHEMA
ZΛDΛTΛ © 2013
Anatomy
of a
tweet
• JSON
• ~ 4KB
• Need to parse
• Many duplications
• Nested objects
• Only few fields
actually needed
ZΛDΛTΛ © 2013
Tweet Schema – OpenCL struct
ZΛDΛTΛ © 2013
Only 112B (with calculated fields)
Tweet Schema – Erlang Record
ZΛDΛTΛ © 2013
CSV API & Export example
• Twitter data exported as CSV:
ZΛDΛTΛ © 2013
ZΛDΛTΛ © 2013
ZΛDΛTΛ © 2013
Full Scan + Location Clustering
(in pseudo SQL)
then run Location Clustering before returning
results to the demo app in browser
ZΛDΛTΛ © 2013
Future Directions (1)
• To Open Source or not to Open Source?
• Benchmarks, Benchmarks, Benchmarks!
• Build library of reusable OpenCL kernels
• Kernels optimized for specific devices:
–Xeon Phi, NVIDIA Tesla, AMD, latest CPUs
ZΛDΛTΛ © 2013
Future Directions (2)
• Migrate to AoS / true Column Store
• Implement NULL-columns / fields
• Consider using Parse Transforms or Elixir
metaprograming for various OpenCL & Erlang
code generation (schemas, marshalling, etc.)
ZΛDΛTΛ © 2013
Future Directions (3)
• Workarounds for Riak’s max 5MB/object limit
• Riak Core
• Consider using Riak Pipe instead of M/R
• CRDT in Riak 2.0
– counters, sets, maps, etc.
ZΛDΛTΛ © 2013
Thank You! Now Q&A
All images are taken from Google Image search and various other places on the Internet
© Copyright of corresponding owners

Weitere ähnliche Inhalte

Was ist angesagt?

New Directions for Mahout
New Directions for MahoutNew Directions for Mahout
New Directions for MahoutTed Dunning
 
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...DataWorks Summit/Hadoop Summit
 
YARN - Presented At Dallas Hadoop User Group
YARN - Presented At Dallas Hadoop User GroupYARN - Presented At Dallas Hadoop User Group
YARN - Presented At Dallas Hadoop User GroupRommel Garcia
 
Pig on Tez: Low Latency Data Processing with Big Data
Pig on Tez: Low Latency Data Processing with Big DataPig on Tez: Low Latency Data Processing with Big Data
Pig on Tez: Low Latency Data Processing with Big DataDataWorks Summit
 
Unified, Efficient, and Portable Data Processing with Apache Beam
Unified, Efficient, and Portable Data Processing with Apache BeamUnified, Efficient, and Portable Data Processing with Apache Beam
Unified, Efficient, and Portable Data Processing with Apache BeamDataWorks Summit/Hadoop Summit
 
Architecting a Scalable Hadoop Platform: Top 10 considerations for success
Architecting a Scalable Hadoop Platform: Top 10 considerations for successArchitecting a Scalable Hadoop Platform: Top 10 considerations for success
Architecting a Scalable Hadoop Platform: Top 10 considerations for successDataWorks Summit
 
High Speed Continuous & Reliable Data Ingest into Hadoop
High Speed Continuous & Reliable Data Ingest into HadoopHigh Speed Continuous & Reliable Data Ingest into Hadoop
High Speed Continuous & Reliable Data Ingest into HadoopDataWorks Summit
 
Sherlock: an anomaly detection service on top of Druid
Sherlock: an anomaly detection service on top of Druid Sherlock: an anomaly detection service on top of Druid
Sherlock: an anomaly detection service on top of Druid DataWorks Summit
 
Quick Introduction to Apache Tez
Quick Introduction to Apache TezQuick Introduction to Apache Tez
Quick Introduction to Apache TezGetInData
 
SQL and Machine Learning on Hadoop using HAWQ
SQL and Machine Learning on Hadoop using HAWQSQL and Machine Learning on Hadoop using HAWQ
SQL and Machine Learning on Hadoop using HAWQpivotalny
 
Build a Time Series Application with Apache Spark and Apache HBase
Build a Time Series Application with Apache Spark and Apache  HBaseBuild a Time Series Application with Apache Spark and Apache  HBase
Build a Time Series Application with Apache Spark and Apache HBaseCarol McDonald
 
Apache Tez : Accelerating Hadoop Query Processing
Apache Tez : Accelerating Hadoop Query ProcessingApache Tez : Accelerating Hadoop Query Processing
Apache Tez : Accelerating Hadoop Query ProcessingBikas Saha
 
Apache Hadoop YARN - Enabling Next Generation Data Applications
Apache Hadoop YARN - Enabling Next Generation Data ApplicationsApache Hadoop YARN - Enabling Next Generation Data Applications
Apache Hadoop YARN - Enabling Next Generation Data ApplicationsHortonworks
 
Apache Tez – Present and Future
Apache Tez – Present and FutureApache Tez – Present and Future
Apache Tez – Present and FutureJianfeng Zhang
 
Data Analysis with TensorFlow in PostgreSQL
Data Analysis with TensorFlow in PostgreSQLData Analysis with TensorFlow in PostgreSQL
Data Analysis with TensorFlow in PostgreSQLEDB
 
Hive+Tez: A performance deep dive
Hive+Tez: A performance deep diveHive+Tez: A performance deep dive
Hive+Tez: A performance deep divet3rmin4t0r
 
Observing Intraday Indicators Using Real-Time Tick Data on Apache Superset an...
Observing Intraday Indicators Using Real-Time Tick Data on Apache Superset an...Observing Intraday Indicators Using Real-Time Tick Data on Apache Superset an...
Observing Intraday Indicators Using Real-Time Tick Data on Apache Superset an...DataWorks Summit
 
GE IOT Predix Time Series & Data Ingestion Service using Apache Apex (Hadoop)
GE IOT Predix Time Series & Data Ingestion Service using Apache Apex (Hadoop)GE IOT Predix Time Series & Data Ingestion Service using Apache Apex (Hadoop)
GE IOT Predix Time Series & Data Ingestion Service using Apache Apex (Hadoop)Apache Apex
 
Design Patterns For Real Time Streaming Data Analytics
Design Patterns For Real Time Streaming Data AnalyticsDesign Patterns For Real Time Streaming Data Analytics
Design Patterns For Real Time Streaming Data AnalyticsDataWorks Summit
 

Was ist angesagt? (20)

New Directions for Mahout
New Directions for MahoutNew Directions for Mahout
New Directions for Mahout
 
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
 
YARN - Presented At Dallas Hadoop User Group
YARN - Presented At Dallas Hadoop User GroupYARN - Presented At Dallas Hadoop User Group
YARN - Presented At Dallas Hadoop User Group
 
Pig on Tez: Low Latency Data Processing with Big Data
Pig on Tez: Low Latency Data Processing with Big DataPig on Tez: Low Latency Data Processing with Big Data
Pig on Tez: Low Latency Data Processing with Big Data
 
Unified, Efficient, and Portable Data Processing with Apache Beam
Unified, Efficient, and Portable Data Processing with Apache BeamUnified, Efficient, and Portable Data Processing with Apache Beam
Unified, Efficient, and Portable Data Processing with Apache Beam
 
Architecting a Scalable Hadoop Platform: Top 10 considerations for success
Architecting a Scalable Hadoop Platform: Top 10 considerations for successArchitecting a Scalable Hadoop Platform: Top 10 considerations for success
Architecting a Scalable Hadoop Platform: Top 10 considerations for success
 
High Speed Continuous & Reliable Data Ingest into Hadoop
High Speed Continuous & Reliable Data Ingest into HadoopHigh Speed Continuous & Reliable Data Ingest into Hadoop
High Speed Continuous & Reliable Data Ingest into Hadoop
 
Sherlock: an anomaly detection service on top of Druid
Sherlock: an anomaly detection service on top of Druid Sherlock: an anomaly detection service on top of Druid
Sherlock: an anomaly detection service on top of Druid
 
Quick Introduction to Apache Tez
Quick Introduction to Apache TezQuick Introduction to Apache Tez
Quick Introduction to Apache Tez
 
SQL and Machine Learning on Hadoop using HAWQ
SQL and Machine Learning on Hadoop using HAWQSQL and Machine Learning on Hadoop using HAWQ
SQL and Machine Learning on Hadoop using HAWQ
 
Build a Time Series Application with Apache Spark and Apache HBase
Build a Time Series Application with Apache Spark and Apache  HBaseBuild a Time Series Application with Apache Spark and Apache  HBase
Build a Time Series Application with Apache Spark and Apache HBase
 
Apache Tez : Accelerating Hadoop Query Processing
Apache Tez : Accelerating Hadoop Query ProcessingApache Tez : Accelerating Hadoop Query Processing
Apache Tez : Accelerating Hadoop Query Processing
 
Apache Hadoop YARN - Enabling Next Generation Data Applications
Apache Hadoop YARN - Enabling Next Generation Data ApplicationsApache Hadoop YARN - Enabling Next Generation Data Applications
Apache Hadoop YARN - Enabling Next Generation Data Applications
 
Apache Tez – Present and Future
Apache Tez – Present and FutureApache Tez – Present and Future
Apache Tez – Present and Future
 
Data Analysis with TensorFlow in PostgreSQL
Data Analysis with TensorFlow in PostgreSQLData Analysis with TensorFlow in PostgreSQL
Data Analysis with TensorFlow in PostgreSQL
 
Hive+Tez: A performance deep dive
Hive+Tez: A performance deep diveHive+Tez: A performance deep dive
Hive+Tez: A performance deep dive
 
Observing Intraday Indicators Using Real-Time Tick Data on Apache Superset an...
Observing Intraday Indicators Using Real-Time Tick Data on Apache Superset an...Observing Intraday Indicators Using Real-Time Tick Data on Apache Superset an...
Observing Intraday Indicators Using Real-Time Tick Data on Apache Superset an...
 
February 2014 HUG : Hive On Tez
February 2014 HUG : Hive On TezFebruary 2014 HUG : Hive On Tez
February 2014 HUG : Hive On Tez
 
GE IOT Predix Time Series & Data Ingestion Service using Apache Apex (Hadoop)
GE IOT Predix Time Series & Data Ingestion Service using Apache Apex (Hadoop)GE IOT Predix Time Series & Data Ingestion Service using Apache Apex (Hadoop)
GE IOT Predix Time Series & Data Ingestion Service using Apache Apex (Hadoop)
 
Design Patterns For Real Time Streaming Data Analytics
Design Patterns For Real Time Streaming Data AnalyticsDesign Patterns For Real Time Streaming Data Analytics
Design Patterns For Real Time Streaming Data Analytics
 

Andere mochten auch

Intercom's first pitch deck!
Intercom's first pitch deck!Intercom's first pitch deck!
Intercom's first pitch deck!Eoghan McCabe
 
Introduction MQTT in English
Introduction MQTT in EnglishIntroduction MQTT in English
Introduction MQTT in EnglishEric Xiao
 
[http://1PU.SH] Building Wireless Sensor Networks with MQTT-SN, RaspberryPi a...
[http://1PU.SH] Building Wireless Sensor Networks with MQTT-SN, RaspberryPi a...[http://1PU.SH] Building Wireless Sensor Networks with MQTT-SN, RaspberryPi a...
[http://1PU.SH] Building Wireless Sensor Networks with MQTT-SN, RaspberryPi a...Zvi Avraham
 
Ethereum VM and DSLs for Smart Contracts (updated on May 12th 2015)
Ethereum VM and DSLs for Smart Contracts (updated on May 12th 2015)Ethereum VM and DSLs for Smart Contracts (updated on May 12th 2015)
Ethereum VM and DSLs for Smart Contracts (updated on May 12th 2015)Zvi Avraham
 
OSv at Cassandra Summit
OSv at Cassandra SummitOSv at Cassandra Summit
OSv at Cassandra SummitDon Marti
 
State of Bitcoin and Blockchain 2016
State of Bitcoin and Blockchain 2016State of Bitcoin and Blockchain 2016
State of Bitcoin and Blockchain 2016CoinDesk
 
Culture Code: Creating A Lovable Company
Culture Code: Creating A Lovable CompanyCulture Code: Creating A Lovable Company
Culture Code: Creating A Lovable CompanyHubSpot
 
Habits at Work - Merci Victoria Grace, Growth, Slack - 2016 Habit Summit
Habits at Work - Merci Victoria Grace, Growth, Slack - 2016 Habit SummitHabits at Work - Merci Victoria Grace, Growth, Slack - 2016 Habit Summit
Habits at Work - Merci Victoria Grace, Growth, Slack - 2016 Habit SummitHabit Summit
 

Andere mochten auch (10)

Erlang on OSv
Erlang on OSvErlang on OSv
Erlang on OSv
 
Intercom's first pitch deck!
Intercom's first pitch deck!Intercom's first pitch deck!
Intercom's first pitch deck!
 
Introduction MQTT in English
Introduction MQTT in EnglishIntroduction MQTT in English
Introduction MQTT in English
 
Machine Learning, Stock Market and Chaos
Machine Learning, Stock Market and Chaos Machine Learning, Stock Market and Chaos
Machine Learning, Stock Market and Chaos
 
[http://1PU.SH] Building Wireless Sensor Networks with MQTT-SN, RaspberryPi a...
[http://1PU.SH] Building Wireless Sensor Networks with MQTT-SN, RaspberryPi a...[http://1PU.SH] Building Wireless Sensor Networks with MQTT-SN, RaspberryPi a...
[http://1PU.SH] Building Wireless Sensor Networks with MQTT-SN, RaspberryPi a...
 
Ethereum VM and DSLs for Smart Contracts (updated on May 12th 2015)
Ethereum VM and DSLs for Smart Contracts (updated on May 12th 2015)Ethereum VM and DSLs for Smart Contracts (updated on May 12th 2015)
Ethereum VM and DSLs for Smart Contracts (updated on May 12th 2015)
 
OSv at Cassandra Summit
OSv at Cassandra SummitOSv at Cassandra Summit
OSv at Cassandra Summit
 
State of Bitcoin and Blockchain 2016
State of Bitcoin and Blockchain 2016State of Bitcoin and Blockchain 2016
State of Bitcoin and Blockchain 2016
 
Culture Code: Creating A Lovable Company
Culture Code: Creating A Lovable CompanyCulture Code: Creating A Lovable Company
Culture Code: Creating A Lovable Company
 
Habits at Work - Merci Victoria Grace, Growth, Slack - 2016 Habit Summit
Habits at Work - Merci Victoria Grace, Growth, Slack - 2016 Habit SummitHabits at Work - Merci Victoria Grace, Growth, Slack - 2016 Habit Summit
Habits at Work - Merci Victoria Grace, Growth, Slack - 2016 Habit Summit
 

Ähnlich wie TimeSpaceDB

How jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStaxHow jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStaxjKool
 
How jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStaxHow jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStaxDataStax
 
A Journey from Oracle to PostgreSQL
A Journey from Oracle to PostgreSQLA Journey from Oracle to PostgreSQL
A Journey from Oracle to PostgreSQLEDB
 
IBM Internet-of-Things architecture and capabilities
IBM Internet-of-Things architecture and capabilitiesIBM Internet-of-Things architecture and capabilities
IBM Internet-of-Things architecture and capabilitiesIBM_Info_Management
 
Jethro data meetup index base sql on hadoop - oct-2014
Jethro data meetup    index base sql on hadoop - oct-2014Jethro data meetup    index base sql on hadoop - oct-2014
Jethro data meetup index base sql on hadoop - oct-2014Eli Singer
 
IBM IoT Architecture and Capabilities at the Edge and Cloud
IBM IoT Architecture and Capabilities at the Edge and Cloud IBM IoT Architecture and Capabilities at the Edge and Cloud
IBM IoT Architecture and Capabilities at the Edge and Cloud Pradeep Natarajan
 
Building a Pluggable Analytics Stack with Cassandra (Jim Peregord, Element Co...
Building a Pluggable Analytics Stack with Cassandra (Jim Peregord, Element Co...Building a Pluggable Analytics Stack with Cassandra (Jim Peregord, Element Co...
Building a Pluggable Analytics Stack with Cassandra (Jim Peregord, Element Co...DataStax
 
Building Scalable Applications using Pivotal Gemfire/Apache Geode
Building Scalable Applications using Pivotal Gemfire/Apache GeodeBuilding Scalable Applications using Pivotal Gemfire/Apache Geode
Building Scalable Applications using Pivotal Gemfire/Apache Geodeimcpune
 
Visual Mapping of Clickstream Data
Visual Mapping of Clickstream DataVisual Mapping of Clickstream Data
Visual Mapping of Clickstream DataDataWorks Summit
 
Ibm_IoT_Architecture_and_Capabilities
Ibm_IoT_Architecture_and_CapabilitiesIbm_IoT_Architecture_and_Capabilities
Ibm_IoT_Architecture_and_CapabilitiesIBM_Info_Management
 
Cloud expo june 2013: Building a Real Time Analytics Platform on Big Data in ...
Cloud expo june 2013: Building a Real Time Analytics Platform on Big Data in ...Cloud expo june 2013: Building a Real Time Analytics Platform on Big Data in ...
Cloud expo june 2013: Building a Real Time Analytics Platform on Big Data in ...Sanjay Sharma
 
Big Data Pipeline for Analytics at Scale @ FIT CVUT 2014
Big Data Pipeline for Analytics at Scale @ FIT CVUT 2014Big Data Pipeline for Analytics at Scale @ FIT CVUT 2014
Big Data Pipeline for Analytics at Scale @ FIT CVUT 2014Jaroslav Gergic
 
Real time big data analytics with Storm by Ron Bodkin of Think Big Analytics
Real time big data analytics with Storm by Ron Bodkin of Think Big AnalyticsReal time big data analytics with Storm by Ron Bodkin of Think Big Analytics
Real time big data analytics with Storm by Ron Bodkin of Think Big AnalyticsData Con LA
 
Impala tech-talk by Dimitris Tsirogiannis
Impala tech-talk by Dimitris TsirogiannisImpala tech-talk by Dimitris Tsirogiannis
Impala tech-talk by Dimitris TsirogiannisFelicia Haggarty
 
unix_linux_ORATOP_TechDays2016_presentations
unix_linux_ORATOP_TechDays2016_presentationsunix_linux_ORATOP_TechDays2016_presentations
unix_linux_ORATOP_TechDays2016_presentationsgarosgaros
 
SQL Engines for Hadoop - The case for Impala
SQL Engines for Hadoop - The case for ImpalaSQL Engines for Hadoop - The case for Impala
SQL Engines for Hadoop - The case for Impalamarkgrover
 
Open Sourcing GemFire - Apache Geode
Open Sourcing GemFire - Apache GeodeOpen Sourcing GemFire - Apache Geode
Open Sourcing GemFire - Apache GeodeApache Geode
 
An Introduction to Apache Geode (incubating)
An Introduction to Apache Geode (incubating)An Introduction to Apache Geode (incubating)
An Introduction to Apache Geode (incubating)Anthony Baker
 

Ähnlich wie TimeSpaceDB (20)

How jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStaxHow jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStax
 
How jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStaxHow jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStax
 
Greenplum Architecture
Greenplum ArchitectureGreenplum Architecture
Greenplum Architecture
 
A Journey from Oracle to PostgreSQL
A Journey from Oracle to PostgreSQLA Journey from Oracle to PostgreSQL
A Journey from Oracle to PostgreSQL
 
IBM Internet-of-Things architecture and capabilities
IBM Internet-of-Things architecture and capabilitiesIBM Internet-of-Things architecture and capabilities
IBM Internet-of-Things architecture and capabilities
 
Jethro data meetup index base sql on hadoop - oct-2014
Jethro data meetup    index base sql on hadoop - oct-2014Jethro data meetup    index base sql on hadoop - oct-2014
Jethro data meetup index base sql on hadoop - oct-2014
 
IBM IoT Architecture and Capabilities at the Edge and Cloud
IBM IoT Architecture and Capabilities at the Edge and Cloud IBM IoT Architecture and Capabilities at the Edge and Cloud
IBM IoT Architecture and Capabilities at the Edge and Cloud
 
Building a Pluggable Analytics Stack with Cassandra (Jim Peregord, Element Co...
Building a Pluggable Analytics Stack with Cassandra (Jim Peregord, Element Co...Building a Pluggable Analytics Stack with Cassandra (Jim Peregord, Element Co...
Building a Pluggable Analytics Stack with Cassandra (Jim Peregord, Element Co...
 
Building Scalable Applications using Pivotal Gemfire/Apache Geode
Building Scalable Applications using Pivotal Gemfire/Apache GeodeBuilding Scalable Applications using Pivotal Gemfire/Apache Geode
Building Scalable Applications using Pivotal Gemfire/Apache Geode
 
Visual Mapping of Clickstream Data
Visual Mapping of Clickstream DataVisual Mapping of Clickstream Data
Visual Mapping of Clickstream Data
 
OpenDataPlane Project
OpenDataPlane ProjectOpenDataPlane Project
OpenDataPlane Project
 
Ibm_IoT_Architecture_and_Capabilities
Ibm_IoT_Architecture_and_CapabilitiesIbm_IoT_Architecture_and_Capabilities
Ibm_IoT_Architecture_and_Capabilities
 
Cloud expo june 2013: Building a Real Time Analytics Platform on Big Data in ...
Cloud expo june 2013: Building a Real Time Analytics Platform on Big Data in ...Cloud expo june 2013: Building a Real Time Analytics Platform on Big Data in ...
Cloud expo june 2013: Building a Real Time Analytics Platform on Big Data in ...
 
Big Data Pipeline for Analytics at Scale @ FIT CVUT 2014
Big Data Pipeline for Analytics at Scale @ FIT CVUT 2014Big Data Pipeline for Analytics at Scale @ FIT CVUT 2014
Big Data Pipeline for Analytics at Scale @ FIT CVUT 2014
 
Real time big data analytics with Storm by Ron Bodkin of Think Big Analytics
Real time big data analytics with Storm by Ron Bodkin of Think Big AnalyticsReal time big data analytics with Storm by Ron Bodkin of Think Big Analytics
Real time big data analytics with Storm by Ron Bodkin of Think Big Analytics
 
Impala tech-talk by Dimitris Tsirogiannis
Impala tech-talk by Dimitris TsirogiannisImpala tech-talk by Dimitris Tsirogiannis
Impala tech-talk by Dimitris Tsirogiannis
 
unix_linux_ORATOP_TechDays2016_presentations
unix_linux_ORATOP_TechDays2016_presentationsunix_linux_ORATOP_TechDays2016_presentations
unix_linux_ORATOP_TechDays2016_presentations
 
SQL Engines for Hadoop - The case for Impala
SQL Engines for Hadoop - The case for ImpalaSQL Engines for Hadoop - The case for Impala
SQL Engines for Hadoop - The case for Impala
 
Open Sourcing GemFire - Apache Geode
Open Sourcing GemFire - Apache GeodeOpen Sourcing GemFire - Apache Geode
Open Sourcing GemFire - Apache Geode
 
An Introduction to Apache Geode (incubating)
An Introduction to Apache Geode (incubating)An Introduction to Apache Geode (incubating)
An Introduction to Apache Geode (incubating)
 

Mehr von Zvi Avraham

Data isn't the new Oil - it's a new Asset Class!
Data isn't the new Oil - it's a new Asset Class!Data isn't the new Oil - it's a new Asset Class!
Data isn't the new Oil - it's a new Asset Class!Zvi Avraham
 
Functional APIs with Absinthe GraphQL
Functional APIs with Absinthe GraphQLFunctional APIs with Absinthe GraphQL
Functional APIs with Absinthe GraphQLZvi Avraham
 
Erlang - Concurrent Language for Concurrent World
Erlang - Concurrent Language for Concurrent WorldErlang - Concurrent Language for Concurrent World
Erlang - Concurrent Language for Concurrent WorldZvi Avraham
 
Cloud Computing: AWS for Lean Startups
Cloud Computing: AWS for Lean StartupsCloud Computing: AWS for Lean Startups
Cloud Computing: AWS for Lean StartupsZvi Avraham
 
Migration To Multi Core - Parallel Programming Models
Migration To Multi Core - Parallel Programming ModelsMigration To Multi Core - Parallel Programming Models
Migration To Multi Core - Parallel Programming ModelsZvi Avraham
 

Mehr von Zvi Avraham (7)

Data isn't the new Oil - it's a new Asset Class!
Data isn't the new Oil - it's a new Asset Class!Data isn't the new Oil - it's a new Asset Class!
Data isn't the new Oil - it's a new Asset Class!
 
Functional APIs with Absinthe GraphQL
Functional APIs with Absinthe GraphQLFunctional APIs with Absinthe GraphQL
Functional APIs with Absinthe GraphQL
 
Limited supply
Limited supplyLimited supply
Limited supply
 
Erlang - Concurrent Language for Concurrent World
Erlang - Concurrent Language for Concurrent WorldErlang - Concurrent Language for Concurrent World
Erlang - Concurrent Language for Concurrent World
 
Cloud Computing: AWS for Lean Startups
Cloud Computing: AWS for Lean StartupsCloud Computing: AWS for Lean Startups
Cloud Computing: AWS for Lean Startups
 
Erlang OTP
Erlang OTPErlang OTP
Erlang OTP
 
Migration To Multi Core - Parallel Programming Models
Migration To Multi Core - Parallel Programming ModelsMigration To Multi Core - Parallel Programming Models
Migration To Multi Core - Parallel Programming Models
 

Kürzlich hochgeladen

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823
 
Detecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachDetecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachBoston Institute of Analytics
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...amitlee9823
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...amitlee9823
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...only4webmaster01
 
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...amitlee9823
 
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...amitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...karishmasinghjnh
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNKTimothy Spann
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Pooja Nehwal
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 

Kürzlich hochgeladen (20)

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Detecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachDetecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning Approach
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
 
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 

TimeSpaceDB

  • 1. $ whoami Name: Zvi Avraham Title: Founder & CEO Company: ZADATA Ltd Email: zvi@zadata.com ZΛDΛTΛ © 2015
  • 2. $ whois ZADATA.com Marketplace and Platform for Realtime Data Feeds ZΛDΛTΛ © 2015
  • 3. Market for Realtime Data Feeds Data Sellers Data Buyers ZΛDΛTΛ © 2015 70% of fees Data Scientists, Data Analysts, Researchers Quants, Algo-traders, Financial Analysts Businesses Content & App Publishers, Utilities / M2M / Internet-of-Things Independent Developers & ISVs: Web/Mobile/Devices Can be both Data Sellers & Data Buyers ZΛDΛTΛ 30% commission + acess fees Realtime Data Analytics Subscription fees Data Feeds Historical Data Crowdsourcing & Sensor Apps Connected Apps
  • 4. Data Sources & Destinations ZΛDΛTΛ © 2015 ZΛDΛTΛ web apps Physical Alt. Finance Social Media analytics apps mobile apps Sports Entertainment Traffic & Transit
  • 5. End-to-End Platform for Apps & Devices ZΛDΛTΛ © 2015 ZΛDΛTΛ aggregate collect broadcast analyze
  • 6. ZΛDΛTΛ © 2013 Time is PK of the Universe
  • 14. ZΛDΛTΛ © 2013 Smart Grid – Meter Data
  • 16. ZΛDΛTΛ © 2013 Logs – Timestamped Events
  • 20. What is Timeseries? a sequence of measurements taken at discrete time intervals ZΛDΛTΛ © 2013
  • 32. Timeseries/metrics for Riak • 4 closed source implementations: – Boundary Kobayashi – Hosted Graphite – Kivra Metyr – Temetra (smart meter data) ZΛDΛTΛ © 2013
  • 38. What is TimeSpace DB? Scalable GPU-accelerated* Geospatial Timeseries* Database*ZΛDΛTΛ © 2013
  • 39. TSDB (TimeSeries DataBase ) The TSDB project was divided into two parts – Storage (A) & Analysis (B) A distributed system for storage and analysis of geo-temporal data collected from different sources. Dr. Yehuda Ben-Shimol Mr Zvi Avraham Eyal Segal Shahar Ben-David Alon Rolnik Ron Schmid Morag
  • 40. ZΛDΛTΛ © 2013 Why YADB? Yet Another Database?
  • 42. 1st Prototype used Graphite • Graphite problems –1-sec resolution –Losing datapoints –Designed for Monitoring –Only support simple timeseries of •{timestamp, numeric_value} ZΛDΛTΛ © 2013
  • 43. Requirements • 1-ms resolution • Support for geolocation • Multiple pre-defined schemas – Not only {timestamp, numeric_value} • Ability to run pre-defined functions on stored data • Use Data Locality – running Computations near Data • Use efficient compute language – OpenCL • Near realtime queries shouldn’t interfere with online • Both REST APIs & Push (Graphite protocol, MQTT) • Bulk import/export (CSV, JSON) ZΛDΛTΛ © 2013
  • 44. Query/Workload types OLTP / Online OLAP / Analytics Pre-defined queries UDF queries - GET - PUT - UPDATE - DELETE - 2i - Full-text search - Statistics - Aggregations - Rollups - Reporting - Scan - etc. Ad-hoc queries - SQL injection ;-) - http://TrySQL.com ? SELECT * FROM … WHERE … Online Cluster NoSQL – Dynamo-style Analytics Cluster MPP or in-memory batch import GET PUT DELETE Pre-defiined queries Ad-hoc queries Map/Reduce Pre-defined queris
  • 45. Query/Workload types OLTP / Online OLAP / Analytics Pre-defined queries UDF queries - GET - PUT - UPDATE - DELETE - 2i - Full-text search - Statistics - Aggregations - Rollups - Reporting - Scan - etc. Ad-hoc queries - SQL injection ;-) - http://TrySQL.com ? SELECT * FROM … WHERE … Online Cluster NoSQL – Dynamo-style Analytics Cluster MPP or in-memory batch import GET PUT DELETE Pre-defiined queries Ad-hoc queries Map/Reduce Pre-defined queris
  • 46. OLTP vs OLAP DBs OLTP OLAP Online Analytics Realtime Interactive Pre-defined queries Ad-hoc queries Low predictable latency Latency small enough, so analyst will not lose concentration Many clients Not many clients Read and/or Write-intensive Batch import / ETL Online Cluster NoSQL – Dynamo-style Analytics Cluster MPP or in-memory batch import GET PUT DELETE Pre-defiined queries Ad-hoc queries Map/Reduce Pre-defined queris
  • 47. No Need in Analytics Cluster ZΛDΛTΛ © 2013 Online Cluster NoSQL – Dynamo-style Analytics Cluster MPP or in-memory batch import GET PUT DELETE Pre-defiined queries Ad-hoc queries Map/Reduce Pre-defined queries
  • 48. Online & pre-defined analytics in the same DB cluster • Each node have dedicated Compute Device for M/R • M/R run on either dedicated CPU cores or on GPUs or Accelerators (like Xeon Phi) ZΛDΛTΛ © 2013 Online Cluster NoSQL – Dynamo- style batch import GET PUT DELETE Ad-hoc queries Predef. queries: M/R in OpenCL In-Memory Analytics DB
  • 49. TimeSpace DB Stack CPU OpenCL Erlang VM Stats/Timeseries Application Riak CPUCPU CPUCPUGPU CPUCPUFPGA CPUCPUAccelerator Geo NLP/Search TimeSpace DB Open, heterogeneous CPU+GPU, standard Compute Language Open, reliable, cross- platform software for concurrent, distributed computing ZΛDΛTΛ © 2013 Modular, Ops-friendly, Distributed K/V Store
  • 50. OPENCL OpenCL – Open Compute Language ZΛDΛTΛ © 2013
  • 56. Example of OpenCL kernel ZΛDΛTΛ © 2013
  • 57. vs Erlang/OTP OpenCL Parallelism Task-parallel Data-parallel(*) T-put Moderate to bad(*) Optimized for high t-put Latency Optimized for low latency bad Floating Point / HPC bad excellent Self-hosted Yes No – requires host code IO & Network Yes No ZΛDΛTΛ © 2013
  • 58. Data Representation • Timeseries divided to number of “tablets” • Each “tablet” has header & payload • Everything is in binary format – Binary – Little Endian – Aligned for OpenCL data types – Essentially unpacked OpenCL struct ZΛDΛTΛ © 2013
  • 59. Marshaling Erlang ↔ OpenCL • Erlang Binary Syntax & Binary Comprehensions to marshal & unmarshal “tablets” • Arrays of unpacked aligned OpenCL structs • No need in parsing ZΛDΛTΛ © 2013
  • 60. Row vs Column ZΛDΛTΛ © 2013 Record Column
  • 61. Row vs. Column DB ZΛDΛTΛ © 2013
  • 66. Raw & Aggregated Schemas - OpenCL ZΛDΛTΛ © 2013
  • 67. Raw & Aggregated Schemas - Erlang ZΛDΛTΛ © 2013
  • 68. 3 W-s • WHAT? – Topic name • WHEN? – timestamp • WHERE? – Location (lat, lon) • PAYLOAD: – Value(s) ZΛDΛTΛ © 2013 Metadata Data
  • 69. 3 W-s • WHAT? – “/weather/us/ca/san-francisco/temp_c” • WHEN? – 2013-10-23T07:31:00.150Z • WHERE? – (37.7756, -122.4193) • PAYLOAD: – 20.35 ZΛDΛTΛ © 2013
  • 70. Other Pre-defined Schemas • Raw & Aggregated Geospatial Timeseries – Timestamp, Lat, Lon, Value – Timestamp, Bounding Box, Count, Min, Max, Sum • Financial Timeseries – Timestamp, Bid, Ask, Last, Volume – Timestamp, Open, High, Low, Close, Volume • Raw & Aggregated Analytics data – Clickstream, CTR, etc. • Twitter data (see demo) ZΛDΛTΛ © 2013
  • 74. Multiple Storage Backends • ETS – In-memory, mostly for testing • Riak PB – Using Riak as external DB • Riak Local Client – Native Erlang client – Usefull in M/R & riak-core • DynamoDB – Using AWS DynamoDB as external DB ZΛDΛTΛ © 2013
  • 75. Multiple Storage Backends • ETS – In-memory, mostly for testing • Riak PB – Using Riak as external DB • Riak Local Client – Data Locality – Native Erlang client – Usefull in M/R & riak-core • DynamoDB – Using AWS DynamoDB as external DB ZΛDΛTΛ © 2013
  • 76. Riak Storage • LevelDB backend, since we need 2i • Unlike BitCask, no auto expiration in LevelDB, so we have a process deleting old “tablets” ZΛDΛTΛ © 2013
  • 80. Semantic Keys for Tablets • Tablet Key: – “timeframe|topic_name|first_timestamp” • 2i for time range: – Integer index: [first_timestamp, last_timestamp] • 2i for location bounding box: – Binary index: [southwest_geohash, northeast_geohash] • where timeframe: – raw|sec|min|hour|day ZΛDΛTΛ © 2013
  • 81. Problem • Max recommended Riak Object size is 5MB – (theoretical limit is 50MB) • But OpenCL need much larger buffers to be efficient! ZΛDΛTΛ © 2013
  • 82. TimeSpace DB API • API calls: – Insert value with timestamp – Insert value with timestamp & location – Insert bulk (from CSV) – Fetch/Delete by time range – Fetch/Delete by time range & location bounding box • Common params for all API calls: – Topic name – Timeframe (raw/sec/min/hour/day) ZΛDΛTΛ © 2013
  • 83. TimeSpace DB API (2) • Rollups for time range – Convert from one timeframe to another – Store result in a new timeseries topic • Reduce for time range – Calculate statistics (min/max/sum/avg/etc.) • Run OpenCL kernel on time range using M/R: – i.e. Sentiment Analysis for tweets – Calculate Correlations between timeseries – Etc. ZΛDΛTΛ © 2013
  • 84. Riak Map/Reduce Languages • JavaScript – Slow (not V8) • Erlang – ~ 6 time faster than JS, but still slow • OpenCL (“OpenCL from Erlang”) – as fast as it gets, but – Has overhead for small buffers – Can interfere with Erlang VM Scheduling, if running on host CPUs ZΛDΛTΛ © 2013
  • 85. DEMO – TWITTER DATA SCAN ZΛDΛTΛ © 2013
  • 86. Geo on GPU • Check if a point is inside a bounding box • Check if a point is inside a circle • Clustering of nearby points – Using Naïve Grid-based Clustering + CoGs ZΛDΛTΛ © 2013
  • 88. NLP on GPU • Implemented using Prime Encoding: –Full-text Search –Sentiment Analysis • By counting negative vs. positive words –Language Detection • By counting language-specific stopwords ZΛDΛTΛ © 2013
  • 89. Prime Encoding • Assign primes to each unique token in corpus: – most frequent word is assigned “2” – the next most frequent “3”, and so on • To encode a tweet, calculate product of primes for each token in the tweet: – the product stored in ulong (64-bit unsinged int) – If there is overflow, then start new 64-bit product • Erlang: -spec prime_encode(Str::binary()) -> [cl_ulong()]. ZΛDΛTΛ © 2013
  • 90. Prime Encoding Check if a word in a tweet • If at least one of the products divides without reminder to prime of the search token – then tweet has this token • If non of the products can be divided without reminder – Then this token is not in a tweet • Erlang: -spec prime_test(Tweet::[cl_ulong()], Token::cl_ulong()) -> boolean(). ZΛDΛTΛ © 2013
  • 92. Anatomy of a tweet • JSON • ~ 4KB • Need to parse • Many duplications • Nested objects • Only few fields actually needed ZΛDΛTΛ © 2013
  • 93. Tweet Schema – OpenCL struct ZΛDΛTΛ © 2013 Only 112B (with calculated fields)
  • 94. Tweet Schema – Erlang Record ZΛDΛTΛ © 2013
  • 95. CSV API & Export example • Twitter data exported as CSV: ZΛDΛTΛ © 2013
  • 98. Full Scan + Location Clustering (in pseudo SQL) then run Location Clustering before returning results to the demo app in browser ZΛDΛTΛ © 2013
  • 99. Future Directions (1) • To Open Source or not to Open Source? • Benchmarks, Benchmarks, Benchmarks! • Build library of reusable OpenCL kernels • Kernels optimized for specific devices: –Xeon Phi, NVIDIA Tesla, AMD, latest CPUs ZΛDΛTΛ © 2013
  • 100. Future Directions (2) • Migrate to AoS / true Column Store • Implement NULL-columns / fields • Consider using Parse Transforms or Elixir metaprograming for various OpenCL & Erlang code generation (schemas, marshalling, etc.) ZΛDΛTΛ © 2013
  • 101. Future Directions (3) • Workarounds for Riak’s max 5MB/object limit • Riak Core • Consider using Riak Pipe instead of M/R • CRDT in Riak 2.0 – counters, sets, maps, etc. ZΛDΛTΛ © 2013
  • 102. Thank You! Now Q&A All images are taken from Google Image search and various other places on the Internet © Copyright of corresponding owners