3. #DevoxxFR 3
Open Source Engines & Tools Commercial Engines & Applications
Enterprise-Grade Platform Services
DataProcessing
Web-Scale Storage
MapR-FS MapR-DB
Search and
Others
Real Time Unified Security Multi-tenancy Disaster Recovery Global NamespaceHigh Availability
MapR Streams
Cloud and
Managed
Services
Search and
Others
UnifiedManagementandMonitoring
Search and
Others
Event StreamingDatabase
Custom
Apps
HDFS API POSIX, NFS HBase API JSON API Kafka API
MapR Converged Data Platform
4. #DevoxxFR 4
Streaming technology is enabling the obvious:
continuous processing on data
that is continuously produced
Hint: you already have streaming data
18. #DevoxxFR 18
Deployment
Local Cluster Cloud
Single JVM Standalone,YARN, Mesos AWS, Google
Core
Runtime
Distributed Streaming Dataflow
DataSet API
Batch Processing
API
&
Libraries
19. #DevoxxFR
Flink Architecture
19
Deployment
Local Cluster Cloud
Single JVM Standalone,YARN, Mesos AWS, Google
Core
Runtime
Distributed Streaming Dataflow
DataSet API
Batch Processing
API
&
Libraries
FlinkML
Machine Learning
Gelly
Graph Processing
Table
Relational
20. #DevoxxFR
Flink Architecture
20
Deployment
Local Cluster Cloud
Single JVM Standalone,YARN, Mesos AWS, Google
Core
Runtime
Distributed Streaming Dataflow
DataSet API
Batch Processing
DataStream API
Stream Processing
API
&
Libraries
FlinkML
Machine Learning
Gelly
Graph Processing
Table
Relational
21. #DevoxxFR
Flink Architecture
21
Deployment
Local Cluster Cloud
Single JVM Standalone,YARN, Mesos AWS, Google
Core
Runtime
Distributed Streaming Dataflow
DataSet API
Batch Processing
DataStream API
Stream Processing
API
&
Libraries
FlinkML
Machine Learning
Gelly
Graph Processing
Table
Relational
CEP
Event Processing
Table
Relational
39. #DevoxxFR
What Is Event-Time Processing
39
1977 1980 1983 1999 2002 2005 2015
Processing Time
Episode
IV
Episode
V
Episode
VI
Episode
I
Episode
II
Episode
III
Episode
VII
Event Time
42. #DevoxxFR
Complex Event Processing
42
• Analyzing a stream of events and drawing conclusions
• “if A and then B ! infer event C”
• Demanding requirements on stream processor
• Low latency!
• Exactly-once semantics & event-time support
44. #DevoxxFR
Order Events
44
Process is reflected in a stream of order events
Order(orderId, tStamp, “received”)
Shipment(orderId, tStamp, “shipped”)
Delivery(orderId, tStamp,
“delivered”)
orderId: Identifies the order
tStamp: Time at which the event happened
46. #DevoxxFR
CEP to the Rescue
46
Define processing and delivery intervals (SLAs)
ProcessSucc(orderId, tStamp, duration)
ProcessWarn(orderId, tStamp)
DeliverySucc(orderId, tStamp, duration)
DeliveryWarn(orderId, tStamp)
orderId: Identifies the order
tStamp: Time when the event happened
duration: Duration of the processing/delivery
54. #DevoxxFR
The End
54
• Process events in real time and/or batch
• Complex Event Processing (CEP)
• Many other things to discover
• Deployment
• High Availability
• Table/Relational API
• … https://mapr.com/ebooks/