SlideShare ist ein Scribd-Unternehmen logo
1 von 58
Downloaden Sie, um offline zu lesen
Streaming Pipelines in Kubernetes
Using Apache Pulsar, Heron and
BookKeeper
Karthik	Ramasamy	
Cofounder
Streamlio
2
Information Age
Event	Data	is	key	
Ká !
3
Increasingly Connected World
Internet of Things
30	B	connected	devices	by	2020
Health Care
153	Exabytes	(2013)	->	2314	Exabytes	(2020)
Machine Data
40%	of	digital	universe	by	2020
Connected Vehicles
Data	transferred	per	vehicle	per	month	
4	MB	->	5	GB
Digital Assistants (Predictive Analytics)
$2B	(2012)	->	$6.5B	(2019)	[1]	
Siri/Cortana/Google	Now
Augmented/Virtual Reality
$150B	by	2020	[2]	
Oculus/HoloLens/Magic	Leap
Ñ
!+
>
4
Traditional Batch Processing
Challenges	
Introduces	too	much	“decision	latency”	
Responses	are	delivered	“a[er	the	fact”	
Maximum	value	of	the	idenfied	situaon	is	lost	
Decisions	are	made	on	old	and	stale	data	
Data	at	Rest
Store Analyze Act
5
The New Era: Streaming Data/Fast Data
Events	are	analyzed	and	processed	in	real-me	as	they	arrive	
Decisions	are	mely,	contextual	and	based	on	fresh	data	
Decision	latency	is	eliminated	
Data	in	moon
Modern	Data	Pipelines Streaming	Data	Pipelines
6
Streaming Data Pipeline
A	Streaming	Data	Pipeline		
captures	events	for	analysis,		
process	as	they	arrive	and		
produce	con=nuous	results
7
Streaming Data Pipeline
Data	Lake
Product	Features
Adhoc	Features
Counng
Machine	Learning
Extract	Transform	Load	(ETL)
Click	Data
Engagement	Data
Email	sends
User	events
Use Cases
9
Product Safety
Observa=ons	
Fight	spammy	content,	engagements,	and	behaviors	in	Twieer	
Spam	campaign	comes	in	large	batch	
Despite	randomized	tweaks,	enough	similarity	among	spammy	enes	are	
preserved	
Requirement	
Real	me	-	a	compeon	with	spammers	(i.e)	“detect”	vs	“mutate”	
Generic	-	need	to	support	all	common	feature	representaons
10
Product Safety - System Overview
KV	store	for	
clustering
Messaging	
System	
(Event	Bus)
Similarity	Clustering	(Heron)
11
Real Time Ads
KV	store
Messaging	
System	
(Event	Bus)
Ads	Serving	(Heron)
Ads	Predicon	(Heron)
Impressions
Spend
Ads	Analycs	(Heron)
Engagements
Spend
Ads	Requests
Ads	Responses
Impressions
Spend
12
Connected Cars
KV	store	for	
clustering
Messaging	
System
Traffic	Paeerns
Messaging	
System Data	Capture/Filter
Fuel	Efficiency
13
Streaming Use Cases
Algorithmic	trading	
Online	fraud	detecon	
Geo	fencing	
Proximity/locaon	tracking	
Intrusion	detecon	systems	
Traffic	management
Real	me	recommendaons	
Churn	detecon	
Internet	of	things	
Social	media/data	analycs	
Gaming	data	feed	
IoT	Edge	Analycs
Architectural Pattern
15
Recurring Pattern
ProcessMessaging
Storage
Data	Ingeson Data	Processing
Results	StorageData	Storage
Data	
Serving
16
State of the World
Aggregation
Systems
Messaging
Systems
Result
Engine
HDFS
Queryable
Engines
17
Towards Unification and Simplification
Interactive
Querying
Storm API Streamlets SQL
Application
Builder
Pulsar
API
BK/
HDFS
API
Kubernetes
Metadata
Management
Operational
Monitoring
Chargeback
Security
Authentication
Quota
Management
Kafka
API
Apache Pulsar
19
Why Apache Pulsar?
Ordering	
Guaranteed	ordering
Mul=-tenancy	
A	single	cluster	can	support	many	
tenants	and	use	cases
High	throughput	
Can	reach	1.8	M	messages/s	in	a	
single	par==on
Durability	
Data	replicated	and	synced	to	disk
Geo-replica=on	
Out	of	box	support	for	geographically	
distributed	applica=ons
Unified	messaging	model	
Support	both	Topic	&	Queue	
seman=c	in	a	single	model
Delivery	Guarantees	
At	least	once,	at	most	once	and	effec=vely	
once
Low	Latency	
Low	publish	latency	of	5ms	at	99pct
Highly	scalable	
Can	support	millions	of	topics
20
Pulsar Architecture
Separa=on	of	Storage	and	Serving
SERVING
Brokers can be added independently
Traffic can be shifted quickly across brokers
STORAGE	
Bookies can be added independently
New bookies will ramp up traffic quickly
21
Segment Centric Architecture
22
Broker Failure Recovery
• Topic is reassigned to
an available broker
based on load
• Can reconstruct the
previous state
consistently
• No data needs to be
copied
• Failover handled
transparently by client
library
23
Bookie Failure Recovery
• After a write failure,
BookKeeper will
immediately switch write to
a new bookie, within the
same segment.
24
Bookie Failure Recovery
• In background, starts a
many-to-many recovery
process to regain the
configured replication
factor
25
Pulsar Architecture
Geo	Replica=on
GEO REPLICATION
Asynchronous replication
Integrated in the broker message flow
Simple configuration to add/remove regions
Topic	(T1) Topic	(T1)
Topic	(T1)
Subscripon	
(S1)
Subscripon	
(S1)
Producer		
(P1)
Consumer		
(C1)
Producer		
(P3)
Producer		
(P2)
Consumer		
(C2)
Data	Center	A Data	Center	B
Data	Center	C
26
Pulsar in Production
3+	years	
Serves	2.3	million	topics	
100	billion	messages/day	
Average	latency	<	5	ms	
99%	15	ms	(strong	durability	guarantees)	
Zero	data	loss	
80+	applicaons	
Self	served	provisioning	
Full-mesh	cross-datacenter	replicaon	-	
8+	data	centers
Apache Heron
28
Heron Design Goals
Efficiency	
Reduce	resource	consumpon
Support	for	diverse	workloads	
Throughput	vs	latency	sensive
Support	for	mul=ple	seman=cs	
Atmost	once,	Atleast	once,	
Effecvely	once
Na=ve	Mul=-Language	Support	
C++,	Java,	Python
Task	Isola=on	
Ease	of	debug-ability/isolaon/profiling	
Support	for	back	pressure	
Topologies	should	be	self	adjusng
Use	of	containers	
Runs	in	schedulers	-	Kubernetes	&	DCOS	&	
many	more
Mul=-level	APIs	
Procedural,	Funconal	and	Declarave	for	
diverse	applicaons
Diverse	deployment	models	
Run	as	a	service	or	pure	library
29
Heron Data Model
%
%
%
%
%
Spout 1
Spout 2
Bolt 1
Bolt 2
Bolt 3
Bolt 4
Bolt 5
30
Writing Heron Topologies
Procedural - Low Level API
Directly	write	your	spouts	and	bolts
Functional - Mid Level API
Use	of	maps,	flat	maps,	transform,	windows
Declarative - SQL (in the works)
Use	of	declarave	language	-	specify	what	you	
want,	system	will	figure	it	out.
,
%
31
Topology Execution
Topology Master
ZK
Cluster
Stream
Manager
I1 I2 I3 I4
Stream
Manager
I1 I2 I3 I4
Logical Plan,
Physical Plan and
Execution State
Sync Physical Plan
DATA CONTAINER DATA CONTAINER
Metrics
Manager
Metrics
Manager
Metrics
Manager
Health
Manager
MASTER
CONTAINER
32
Heron @Twitter
LARGEST	CLUSTER
100’s	of	TOPOLOGIES
BILLIONS	OF	MESSAGES100’s	OF	TERABYTESREDUCED	INCIDENTS
GOOD	NIGHT	SLEEP
3X - 5X reduction in resource usage
33
Heron Topology Complexity
34
Heron Topology Scale
CONTAINERS - 1 TO 600 INSTANCES - 10 TO 6000
35
Heron Happy Facts :)
!	No	more	pages	during	midnight	for	Heron	team	
"	Very	rare	incidents	for	Heron	customer	teams	
#	Easy	to	debug	during	incident	for	quick	turn	around	
$	Reduced	resource	ulizaon	saving	cost
Apache BookKeeper
37
Observations
Computaon	across	batch/streaming	is	similar	
Expressed	as	DAGS	
Run	in	parallel	on	the	cluster	
Intermediate	results	need	not	be	materialized	
Funconal/Declarave	APIs	
Storage	is	the	key	
Messaging/Storage	are	two	faces	of	the	same	coin	
They	serve	the	same	data
38
Storage Requirements
Requirements	for	a	real-=me	storage	placorm
Be	able	to	write	and	read	streams	of	records	with	low	latency,	storage	durability	
Data	storage	should	be	durable,	consistent	and	fault	tolerant	
Enable	clients	to	stream	or	tail	ledgers	to	propagate	data	as	they’re	wrieen	
Store	and	provide	access	to	both	historic	and	real-me	data
39
Apache BookKeeper - Stream Storage
A	storage	for	log	streams
Replicated,	durable	storage	of	log	streams	
Provide	fast	tailing/streaming	facility	
Opmized	for	immutable	data	
Low-latency	durability	
Simple	repeatable	read	consistency	
High	write	and	read	availability
40
Record
Smallest	I/O	and	Address	Unit
A	sequence	of	invisible	records	
A	record	is	sequence	of	bytes	
The	smallest	I/O	unit,	as	well	as	the	unit	of	address	
Each	record	contains	sequence	numbers	for	addressing
41
Logs
Two	Storage	Primi=ves
Ledger:	A	finite	sequence	of	records.	
Stream:	An	infinite	sequence	of	records.
42
Ledger
Finite	sequence	of	records
Ledger:	A	finite	sequence	of	records	that	gets	terminated	
A	client	explicitly	close	it	
A	writer	who	writes	records	into	it	has	crashed.
43
Stream
Infinite	sequence	of	records
Stream:	An	unbounded,	infinite	sequence	of	records	
Physically	comprised	of	mulple	ledgers
44
Bookies
Stores	fragment	of	records
Bookie	-	A	storage	server	to	store	data	records	
Ensemble:	A	group	of	bookies	storing	the	data	records	of	a	ledger	
Individual	bookies	store	fragments	of	ledgers
45
Using BookKeeper
Durable	Messaging,	Scalable	Compute	and	Stream	Storage
46
Companies using BookKeeper
Enterprise	Grade	Stream	Storage
47
Companies using the projects
Enterprise	Grade	Stream	Storage
48
Experiences with Kubernetes
Use	of	naked	pods	for	Heron	containers	
Never	gets	rescheduled	
Have	to	create	names	similar	to	Heron	container	(topology-name-shard-id)	
Have	to	keep	track	of	every	pod	
Cannot	use	deployment	
Reproducible	shard	id	
Deployments	produce	different	ids	a[er	a	pod	failure	
Topology	names	have	to	be	lowercase	
First	Implementa=on
49
Experiences with Kubernetes
Use	of	stateful	sets		
Nice	mapping	form	stateful	sets	to	container	shard	ids	
Inially	used	defaults	
Very	long	startup	me	
Very	long	shutdown	me	
Long	pod	restart	me	(during	machine	failure	&	pod	failure)
Second	Implementa=on
50
Experiences with Kubernetes
Invesgaon	pointed	out	us	to		
Default	pod	management	policy	is	set	to	OrderedReady	
Default	rescheduling	pod	on	a	node	failure	is	set	to	5	mins	
Soluons	
Sets	the	toleraons	to	10	seconds	for	restarts	
node.kubernetes.io/not-ready	
node.alpha.kubernetes.io/notReady	
node.alpha.kubernetes.io/unreachable
Second	Implementa=on
51
Experiences with Kubernetes
Dynamically	updang	the	job	
Patching	stateful	sets	-	
Kubernetes	Java	Client	library	has	bug	
Soluons	
Waing	for	a	fix!
Second	Implementa=on
52
Experiences with Kubernetes
Submidng	User	jobs
Heron	API		
Server
Job	1
Job	2
Job	3User
Needs	a	way	to	talk	to	Kubernetes	
API	server
Solu=on	-	Run	a	slim	side	container	with	kubectl	proxy
53
Ingesting Data to Pulsar
Pulsar	Broker	1 Pulsar	Broker	2 Pulsar	Broker	3
Bookie	1 Bookie	2 Bookie	3 Bookie	4 Bookie	5
Apache	BookKeeper
Apache	Pulsar
Producer	 Consumer	
Internal	IPs
54
Ingesting Data to Pulsar
Pulsar	Broker	1 Pulsar	Broker	2 Pulsar	Broker	3
Bookie	1 Bookie	2 Bookie	3 Bookie	4 Bookie	5
Apache	BookKeeper
Apache	Pulsar
Producer	 Consumer	
Load	Balancer
Pulsar	Proxy	1 Pulsar	Proxy	3Pulsar	Proxy	1
55
Finally…
57
GET	IN	TOUCH
C O N T A C T 	 U S
@kramasamy
955	Alma	Street,	Palo	Alto,	CA
karthik@streaml.io
Streaming Pipelines in Kubernetes Using Apache Pulsar, Heron and BookKeeper

Weitere ähnliche Inhalte

Was ist angesagt?

Innovating With Data and Analytics
Innovating With Data and AnalyticsInnovating With Data and Analytics
Innovating With Data and AnalyticsVMware Tanzu
 
Big data privacy issues in public social media
Big data privacy issues in public social mediaBig data privacy issues in public social media
Big data privacy issues in public social mediaSupriya Radhakrishna
 
Cloud Computing, SDN, Big Data and Internet of Everything - Lew Tucker
Cloud Computing, SDN, Big Data and Internet of Everything - Lew TuckerCloud Computing, SDN, Big Data and Internet of Everything - Lew Tucker
Cloud Computing, SDN, Big Data and Internet of Everything - Lew TuckerLew Tucker
 
The Evolution of Big Data Frameworks
The Evolution of Big Data FrameworksThe Evolution of Big Data Frameworks
The Evolution of Big Data FrameworkseXascale Infolab
 
Trivento summercamp masterclass 9/9/2016
Trivento summercamp masterclass 9/9/2016Trivento summercamp masterclass 9/9/2016
Trivento summercamp masterclass 9/9/2016Stavros Kontopoulos
 
Trivento summercamp fast data 9/9/2016
Trivento summercamp fast data 9/9/2016Trivento summercamp fast data 9/9/2016
Trivento summercamp fast data 9/9/2016Stavros Kontopoulos
 
Big Data for Defense and Security
Big Data for Defense and SecurityBig Data for Defense and Security
Big Data for Defense and SecurityEMC
 
Big data overview external
Big data overview externalBig data overview external
Big data overview externalBrett Colbert
 
re:Invent re:Cap - Big Data & IoT at Any Scale
re:Invent re:Cap - Big Data & IoT at Any Scalere:Invent re:Cap - Big Data & IoT at Any Scale
re:Invent re:Cap - Big Data & IoT at Any ScaleAdrian Hornsby
 
Hyper-Converged Infrastructure: Big Data and IoT opportunities and challenges...
Hyper-Converged Infrastructure: Big Data and IoT opportunities and challenges...Hyper-Converged Infrastructure: Big Data and IoT opportunities and challenges...
Hyper-Converged Infrastructure: Big Data and IoT opportunities and challenges...Andrei Khurshudov
 
Voxxed days thessaloniki 21/10/2016 - Streaming Engines for Big Data
Voxxed days thessaloniki 21/10/2016 - Streaming Engines for Big DataVoxxed days thessaloniki 21/10/2016 - Streaming Engines for Big Data
Voxxed days thessaloniki 21/10/2016 - Streaming Engines for Big DataStavros Kontopoulos
 
Big data-analytics-cpe8035
Big data-analytics-cpe8035Big data-analytics-cpe8035
Big data-analytics-cpe8035Neelam Rawat
 
Bigdatacooltools
BigdatacooltoolsBigdatacooltools
Bigdatacooltoolssuresh sood
 
The rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computingThe rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computingMinhazul Arefin
 
How IOT & Big Data will shape up Future Economies?
How IOT & Big Data will shape up Future Economies?How IOT & Big Data will shape up Future Economies?
How IOT & Big Data will shape up Future Economies?Srinath Perera
 
Impact of big data on analytics
Impact of big data on analyticsImpact of big data on analytics
Impact of big data on analyticsCapgemini
 

Was ist angesagt? (20)

Innovating With Data and Analytics
Innovating With Data and AnalyticsInnovating With Data and Analytics
Innovating With Data and Analytics
 
Big data privacy issues in public social media
Big data privacy issues in public social mediaBig data privacy issues in public social media
Big data privacy issues in public social media
 
Cloud Computing, SDN, Big Data and Internet of Everything - Lew Tucker
Cloud Computing, SDN, Big Data and Internet of Everything - Lew TuckerCloud Computing, SDN, Big Data and Internet of Everything - Lew Tucker
Cloud Computing, SDN, Big Data and Internet of Everything - Lew Tucker
 
Bigdata analytics
Bigdata analyticsBigdata analytics
Bigdata analytics
 
The Evolution of Big Data Frameworks
The Evolution of Big Data FrameworksThe Evolution of Big Data Frameworks
The Evolution of Big Data Frameworks
 
Big Data and Fast Data combined – is it possible?
Big Data and Fast Data combined – is it possible?Big Data and Fast Data combined – is it possible?
Big Data and Fast Data combined – is it possible?
 
Trivento summercamp masterclass 9/9/2016
Trivento summercamp masterclass 9/9/2016Trivento summercamp masterclass 9/9/2016
Trivento summercamp masterclass 9/9/2016
 
Trivento summercamp fast data 9/9/2016
Trivento summercamp fast data 9/9/2016Trivento summercamp fast data 9/9/2016
Trivento summercamp fast data 9/9/2016
 
Big Data for Defense and Security
Big Data for Defense and SecurityBig Data for Defense and Security
Big Data for Defense and Security
 
Big data overview external
Big data overview externalBig data overview external
Big data overview external
 
re:Invent re:Cap - Big Data & IoT at Any Scale
re:Invent re:Cap - Big Data & IoT at Any Scalere:Invent re:Cap - Big Data & IoT at Any Scale
re:Invent re:Cap - Big Data & IoT at Any Scale
 
Hyper-Converged Infrastructure: Big Data and IoT opportunities and challenges...
Hyper-Converged Infrastructure: Big Data and IoT opportunities and challenges...Hyper-Converged Infrastructure: Big Data and IoT opportunities and challenges...
Hyper-Converged Infrastructure: Big Data and IoT opportunities and challenges...
 
Voxxed days thessaloniki 21/10/2016 - Streaming Engines for Big Data
Voxxed days thessaloniki 21/10/2016 - Streaming Engines for Big DataVoxxed days thessaloniki 21/10/2016 - Streaming Engines for Big Data
Voxxed days thessaloniki 21/10/2016 - Streaming Engines for Big Data
 
Big data-analytics-cpe8035
Big data-analytics-cpe8035Big data-analytics-cpe8035
Big data-analytics-cpe8035
 
Bigdatacooltools
BigdatacooltoolsBigdatacooltools
Bigdatacooltools
 
The rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computingThe rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computing
 
How IOT & Big Data will shape up Future Economies?
How IOT & Big Data will shape up Future Economies?How IOT & Big Data will shape up Future Economies?
How IOT & Big Data will shape up Future Economies?
 
A Big Data Concept
A Big Data ConceptA Big Data Concept
A Big Data Concept
 
Impact of big data on analytics
Impact of big data on analyticsImpact of big data on analytics
Impact of big data on analytics
 
Datapreneurs
DatapreneursDatapreneurs
Datapreneurs
 

Ähnlich wie Streaming Pipelines in Kubernetes Using Apache Pulsar, Heron and BookKeeper

DataEd Online: Demystifying Big Data
DataEd Online: Demystifying Big DataDataEd Online: Demystifying Big Data
DataEd Online: Demystifying Big DataDATAVERSITY
 
Data-Ed: Demystifying Big Data
Data-Ed: Demystifying Big DataData-Ed: Demystifying Big Data
Data-Ed: Demystifying Big DataData Blueprint
 
Analytics in IoT
Analytics in IoTAnalytics in IoT
Analytics in IoTwesley Dias
 
Event Data Processing with Streamlio
Event Data Processing with StreamlioEvent Data Processing with Streamlio
Event Data Processing with StreamlioStreamlio
 
Harness the Power of Big Data with Oracle
Harness the Power of Big Data with OracleHarness the Power of Big Data with Oracle
Harness the Power of Big Data with OracleSai Janakiram Penumuru
 
Modern data integration | Diyotta
Modern data integration | Diyotta Modern data integration | Diyotta
Modern data integration | Diyotta diyotta
 
Sean gately internet of things
Sean gately   internet of thingsSean gately   internet of things
Sean gately internet of thingsProductCamp SoCal
 
Research issues in the big data and its Challenges
Research issues in the big data and its ChallengesResearch issues in the big data and its Challenges
Research issues in the big data and its ChallengesKathirvel Ayyaswamy
 
Presentación Marcos Grilanda /Amazon Web Services - eCommerce Day Santiago 2017
Presentación Marcos Grilanda /Amazon Web Services - eCommerce Day Santiago 2017Presentación Marcos Grilanda /Amazon Web Services - eCommerce Day Santiago 2017
Presentación Marcos Grilanda /Amazon Web Services - eCommerce Day Santiago 2017eCommerce Institute
 
Move It Don't Lose It: Is Your Big Data Collecting Dust?
Move It Don't Lose It: Is Your Big Data Collecting Dust?Move It Don't Lose It: Is Your Big Data Collecting Dust?
Move It Don't Lose It: Is Your Big Data Collecting Dust?Jennifer Walker
 
Guest Lecture: Introduction to Big Data at Indian Institute of Technology
Guest Lecture: Introduction to Big Data at Indian Institute of TechnologyGuest Lecture: Introduction to Big Data at Indian Institute of Technology
Guest Lecture: Introduction to Big Data at Indian Institute of TechnologyNishant Gandhi
 
Matthew Johnston - Big Data Futures Outlook BCM
Matthew Johnston - Big Data Futures Outlook BCMMatthew Johnston - Big Data Futures Outlook BCM
Matthew Johnston - Big Data Futures Outlook BCMHoi Lan Leong
 
Revolution of Quantum Computing in AI Era
Revolution of Quantum Computing in AI EraRevolution of Quantum Computing in AI Era
Revolution of Quantum Computing in AI EraPrinceBarpaga
 
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Center...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Center...Big Data Applications & Analytics Motivation: Big Data and the Cloud; Center...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Center...Geoffrey Fox
 

Ähnlich wie Streaming Pipelines in Kubernetes Using Apache Pulsar, Heron and BookKeeper (20)

Big Data et eGovernment
Big Data et eGovernmentBig Data et eGovernment
Big Data et eGovernment
 
DataEd Online: Demystifying Big Data
DataEd Online: Demystifying Big DataDataEd Online: Demystifying Big Data
DataEd Online: Demystifying Big Data
 
Data-Ed: Demystifying Big Data
Data-Ed: Demystifying Big DataData-Ed: Demystifying Big Data
Data-Ed: Demystifying Big Data
 
Analytics in IoT
Analytics in IoTAnalytics in IoT
Analytics in IoT
 
Event Data Processing with Streamlio
Event Data Processing with StreamlioEvent Data Processing with Streamlio
Event Data Processing with Streamlio
 
S B Goyal
S B GoyalS B Goyal
S B Goyal
 
Harness the Power of Big Data with Oracle
Harness the Power of Big Data with OracleHarness the Power of Big Data with Oracle
Harness the Power of Big Data with Oracle
 
Modern data integration | Diyotta
Modern data integration | Diyotta Modern data integration | Diyotta
Modern data integration | Diyotta
 
Sean gately internet of things
Sean gately   internet of thingsSean gately   internet of things
Sean gately internet of things
 
Research issues in the big data and its Challenges
Research issues in the big data and its ChallengesResearch issues in the big data and its Challenges
Research issues in the big data and its Challenges
 
Presentación Marcos Grilanda /Amazon Web Services - eCommerce Day Santiago 2017
Presentación Marcos Grilanda /Amazon Web Services - eCommerce Day Santiago 2017Presentación Marcos Grilanda /Amazon Web Services - eCommerce Day Santiago 2017
Presentación Marcos Grilanda /Amazon Web Services - eCommerce Day Santiago 2017
 
Information Overload Phenomena
Information Overload PhenomenaInformation Overload Phenomena
Information Overload Phenomena
 
Move It Don't Lose It: Is Your Big Data Collecting Dust?
Move It Don't Lose It: Is Your Big Data Collecting Dust?Move It Don't Lose It: Is Your Big Data Collecting Dust?
Move It Don't Lose It: Is Your Big Data Collecting Dust?
 
Beyond Online PDFs
Beyond Online PDFs Beyond Online PDFs
Beyond Online PDFs
 
Guest Lecture: Introduction to Big Data at Indian Institute of Technology
Guest Lecture: Introduction to Big Data at Indian Institute of TechnologyGuest Lecture: Introduction to Big Data at Indian Institute of Technology
Guest Lecture: Introduction to Big Data at Indian Institute of Technology
 
Matthew Johnston - Big Data Futures Outlook BCM
Matthew Johnston - Big Data Futures Outlook BCMMatthew Johnston - Big Data Futures Outlook BCM
Matthew Johnston - Big Data Futures Outlook BCM
 
Revolution of Quantum Computing in AI Era
Revolution of Quantum Computing in AI EraRevolution of Quantum Computing in AI Era
Revolution of Quantum Computing in AI Era
 
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Center...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Center...Big Data Applications & Analytics Motivation: Big Data and the Cloud; Center...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Center...
 
Big Data
Big DataBig Data
Big Data
 
IBM Research Future Internet
IBM Research Future InternetIBM Research Future Internet
IBM Research Future Internet
 

Mehr von Karthik Ramasamy

Scaling Apache Pulsar to 10 PB/day
Scaling Apache Pulsar to 10 PB/dayScaling Apache Pulsar to 10 PB/day
Scaling Apache Pulsar to 10 PB/dayKarthik Ramasamy
 
Pulsar summit-keynote-final
Pulsar summit-keynote-finalPulsar summit-keynote-final
Pulsar summit-keynote-finalKarthik Ramasamy
 
Apache Pulsar Seattle - Meetup
Apache Pulsar Seattle - MeetupApache Pulsar Seattle - Meetup
Apache Pulsar Seattle - MeetupKarthik Ramasamy
 
Unifying Messaging, Queueing & Light Weight Compute Using Apache Pulsar
Unifying Messaging, Queueing & Light Weight Compute Using Apache PulsarUnifying Messaging, Queueing & Light Weight Compute Using Apache Pulsar
Unifying Messaging, Queueing & Light Weight Compute Using Apache PulsarKarthik Ramasamy
 
Creating Data Fabric for #IOT with Apache Pulsar
Creating Data Fabric for #IOT with Apache PulsarCreating Data Fabric for #IOT with Apache Pulsar
Creating Data Fabric for #IOT with Apache PulsarKarthik Ramasamy
 
Linked In Stream Processing Meetup - Apache Pulsar
Linked In Stream Processing Meetup - Apache PulsarLinked In Stream Processing Meetup - Apache Pulsar
Linked In Stream Processing Meetup - Apache PulsarKarthik Ramasamy
 
Exactly once in Apache Heron
Exactly once in Apache HeronExactly once in Apache Heron
Exactly once in Apache HeronKarthik Ramasamy
 
Twitter's Real Time Stack - Processing Billions of Events Using Distributed L...
Twitter's Real Time Stack - Processing Billions of Events Using Distributed L...Twitter's Real Time Stack - Processing Billions of Events Using Distributed L...
Twitter's Real Time Stack - Processing Billions of Events Using Distributed L...Karthik Ramasamy
 
Storm@Twitter, SIGMOD 2014 paper
Storm@Twitter, SIGMOD 2014 paperStorm@Twitter, SIGMOD 2014 paper
Storm@Twitter, SIGMOD 2014 paperKarthik Ramasamy
 
Storm@Twitter, SIGMOD 2014
Storm@Twitter, SIGMOD 2014Storm@Twitter, SIGMOD 2014
Storm@Twitter, SIGMOD 2014Karthik Ramasamy
 

Mehr von Karthik Ramasamy (11)

Scaling Apache Pulsar to 10 PB/day
Scaling Apache Pulsar to 10 PB/dayScaling Apache Pulsar to 10 PB/day
Scaling Apache Pulsar to 10 PB/day
 
Apache Pulsar @Splunk
Apache Pulsar @SplunkApache Pulsar @Splunk
Apache Pulsar @Splunk
 
Pulsar summit-keynote-final
Pulsar summit-keynote-finalPulsar summit-keynote-final
Pulsar summit-keynote-final
 
Apache Pulsar Seattle - Meetup
Apache Pulsar Seattle - MeetupApache Pulsar Seattle - Meetup
Apache Pulsar Seattle - Meetup
 
Unifying Messaging, Queueing & Light Weight Compute Using Apache Pulsar
Unifying Messaging, Queueing & Light Weight Compute Using Apache PulsarUnifying Messaging, Queueing & Light Weight Compute Using Apache Pulsar
Unifying Messaging, Queueing & Light Weight Compute Using Apache Pulsar
 
Creating Data Fabric for #IOT with Apache Pulsar
Creating Data Fabric for #IOT with Apache PulsarCreating Data Fabric for #IOT with Apache Pulsar
Creating Data Fabric for #IOT with Apache Pulsar
 
Linked In Stream Processing Meetup - Apache Pulsar
Linked In Stream Processing Meetup - Apache PulsarLinked In Stream Processing Meetup - Apache Pulsar
Linked In Stream Processing Meetup - Apache Pulsar
 
Exactly once in Apache Heron
Exactly once in Apache HeronExactly once in Apache Heron
Exactly once in Apache Heron
 
Twitter's Real Time Stack - Processing Billions of Events Using Distributed L...
Twitter's Real Time Stack - Processing Billions of Events Using Distributed L...Twitter's Real Time Stack - Processing Billions of Events Using Distributed L...
Twitter's Real Time Stack - Processing Billions of Events Using Distributed L...
 
Storm@Twitter, SIGMOD 2014 paper
Storm@Twitter, SIGMOD 2014 paperStorm@Twitter, SIGMOD 2014 paper
Storm@Twitter, SIGMOD 2014 paper
 
Storm@Twitter, SIGMOD 2014
Storm@Twitter, SIGMOD 2014Storm@Twitter, SIGMOD 2014
Storm@Twitter, SIGMOD 2014
 

Kürzlich hochgeladen

Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfSubhamKumar3239
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxSimranPal17
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxTasha Penwell
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataTecnoIncentive
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectBoston Institute of Analytics
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data VisualizationKianJazayeri1
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxHimangsuNath
 

Kürzlich hochgeladen (20)

Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdf
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptx
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded data
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis Project
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data Visualization
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptx
 

Streaming Pipelines in Kubernetes Using Apache Pulsar, Heron and BookKeeper