SlideShare ist ein Scribd-Unternehmen logo
1 von 19
Downloaden Sie, um offline zu lesen
WEBINAR
How To Get Monitoring Right For Streaming 

And Fast Data Systems Built With Spark, 

Mesos, Akka, Cassandra and Kafka
Paul Jasek, Senior Director of Global Solution Architects
Agenda
1. Fast Data & Streaming Applications
2. The Challenges of Monitoring Fast Data Applications	
3. What To Look For In a Fast Data Application
4. Intelligent End-To-End Monitoring from Lightbend	
5. Live Demo
6. Questions
reactivemanifesto.org
Reactive Underpinnings: Fast Data and streaming applications
often incorporate, or are based on, Reactive principles
• Real-time	personalization		
• Real-time	decision-making		
• IoT	data	processing		
• Legacy	batch	processing	
modernization
Growing Number Of Use Cases Across Industries
• Serve	existing	customers	better	
and	reduce	churn	
• Attract	new	ones	and	drive	
growth	
• Launch	new	products	more	
easily		
• Enter	new	markets	more	quickly.
• Rapidly Evolving Ecosystem
• Understanding the Data Pipeline
• Dynamic Architectures
• Intricately Interconnected
• Distributed And Clustered
The Challenges of Monitoring Fast Data Applications
Apache Spark, As An Illustrative Example
The Challenges of Monitoring Fast Data Applications
Concern Questions	To	Ask
Data	Health	(for	a	
given	application)
• Throughput:	is	data	processing	occurring	at	the	expected	rate?		
• Latency:	is	data	processing	occurring	within	the	expected	timeframe?		
• Error/quality:	are	there	problems	with	the	data	being	produced?		
• Input	data:	are	input	data	streams	flowing	into	Spark	behaving	normally?	For	instance,	what	are	the	
throughput	rates	for	Kafka	topics	feeding	into	the	Spark	job?
Dependency	Health • Are	the	systems	feeding	input	into	the	storm	job	(such	as	Kafka)	healthy?		
• Are	the	systems	that	the	application	is	dependent	on,	such	as	Memcache	or	other	API	endpoints,	
healthy?	
Service	Health • Is	the	Spark	master	operating	normally?	If	not,	engineering	will	be	unable	to	re-balance	workloads	or	
restart	jobs.	
Application	Health • Are	the	application	KPIs	within	normal	operating	parameters?	
Topology	Health • Are	there	resources	assigned	to	the	given	Spark	topology?		
• •	Are	the	Spark	tasks	and	executors	well-distributed	amongst	the	Spark	cluster?		
• •	Are	the	performance	counters	(emitted,	failed,	latency,	etc.)	for	the	given	Spark	topology	normal?	
Node	System	Health • Are	the	key	system	metrics	(load,	CPU,	memory,	net-i/o,	disk-i/o,	disk	free)	operating	normally?
Can traditional monitoring tools help?
Why traditional monitoring tools won’t help you
• Built	to	monitor	monolithic	
applications	
• Can	only	be	used	to	extract	
metrics	and	trace	information	
based	on	a	synchronous	flow	
• Not	built	for	asynchronous	
flows	(i.e.	in	Fast	Data	and	
streaming	applications)	
• Cannot	easily	handle	streaming	
systems	running	on	distributed	
clusters
• Deep Telemetry
• Domain Expertise	
• Automated Discovery
• Real-Time Topology Visualization
• Intelligent, Rapid Troubleshooting
What users need to effectively monitor Fast Data and
streaming applications
• Lightbend Monitoring takes a modern approach to instrumenting and
visualizing distributed streaming systems	
• Helps users not just in production but also in development (so they can
build their applications right from Day 1) 	
• Shows the end-to-end status of applications, data frameworks, and the
associated infrastructure in a single view.
Intelligent, End-To-End Monitoring
• Deep Telemetry	
• Domain Expertise
• Intelligent Anomaly
Detection
• Fine-Grained
Visibility, with Drill-
Down Capabilities
Data-Science Driven Anomaly Detection
• Automated Topology
Discovery
• Automatic Metric
Collection
• Real-Time Topology
Visualization
Automated Discovery, Configuration & Topology
Visualization
• Single Pane of Glass
Visibility
• Rapid Root Cause
Analysis
• Reduced Mean-Time-
To-Repair (MTTR)
Intelligent, Rapid Troubleshooting
• Dramatically reduce the time and cost to identify and remediate issues across
application life-cycle.

• Create happier, more satisfied customers – and lower churn	
• Lower HW/infrastructure costs and reduce concerns about chargebacks & SLA
penalties
• Deliver rapid time to value because everything you need for monitoring is packaged
into an easy-to-use solution
Benefits for your business
On to the demo…
Upgrade your grey matter!

Get the free O’Reilly book by Dr. Dean Wampler, 

VP of Fast Data Engineering at Lightbend
bit.ly/lightbend-fast-data
End-To-End Monitoring For Your Fast Data 

And Streaming Applications From Lightbend
SET UP A 20-MIN DEMO

Weitere ähnliche Inhalte

Mehr von Lightbend

Machine Learning At Speed: Operationalizing ML For Real-Time Data Streams
Machine Learning At Speed: Operationalizing ML For Real-Time Data StreamsMachine Learning At Speed: Operationalizing ML For Real-Time Data Streams
Machine Learning At Speed: Operationalizing ML For Real-Time Data Streams
Lightbend
 

Mehr von Lightbend (20)

Detecting Real-Time Financial Fraud with Cloudflow on Kubernetes
Detecting Real-Time Financial Fraud with Cloudflow on KubernetesDetecting Real-Time Financial Fraud with Cloudflow on Kubernetes
Detecting Real-Time Financial Fraud with Cloudflow on Kubernetes
 
Cloudstate - Towards Stateful Serverless
Cloudstate - Towards Stateful ServerlessCloudstate - Towards Stateful Serverless
Cloudstate - Towards Stateful Serverless
 
Digital Transformation from Monoliths to Microservices to Serverless and Beyond
Digital Transformation from Monoliths to Microservices to Serverless and BeyondDigital Transformation from Monoliths to Microservices to Serverless and Beyond
Digital Transformation from Monoliths to Microservices to Serverless and Beyond
 
Akka Anti-Patterns, Goodbye: Six Features of Akka 2.6
Akka Anti-Patterns, Goodbye: Six Features of Akka 2.6Akka Anti-Patterns, Goodbye: Six Features of Akka 2.6
Akka Anti-Patterns, Goodbye: Six Features of Akka 2.6
 
Lessons From HPE: From Batch To Streaming For 20 Billion Sensors With Lightbe...
Lessons From HPE: From Batch To Streaming For 20 Billion Sensors With Lightbe...Lessons From HPE: From Batch To Streaming For 20 Billion Sensors With Lightbe...
Lessons From HPE: From Batch To Streaming For 20 Billion Sensors With Lightbe...
 
How to build streaming data pipelines with Akka Streams, Flink, and Spark usi...
How to build streaming data pipelines with Akka Streams, Flink, and Spark usi...How to build streaming data pipelines with Akka Streams, Flink, and Spark usi...
How to build streaming data pipelines with Akka Streams, Flink, and Spark usi...
 
Microservices, Kubernetes, and Application Modernization Done Right
Microservices, Kubernetes, and Application Modernization Done RightMicroservices, Kubernetes, and Application Modernization Done Right
Microservices, Kubernetes, and Application Modernization Done Right
 
Full Stack Reactive In Practice
Full Stack Reactive In PracticeFull Stack Reactive In Practice
Full Stack Reactive In Practice
 
Akka and Kubernetes: A Symbiotic Love Story
Akka and Kubernetes: A Symbiotic Love StoryAkka and Kubernetes: A Symbiotic Love Story
Akka and Kubernetes: A Symbiotic Love Story
 
Scala 3 Is Coming: Martin Odersky Shares What To Know
Scala 3 Is Coming: Martin Odersky Shares What To KnowScala 3 Is Coming: Martin Odersky Shares What To Know
Scala 3 Is Coming: Martin Odersky Shares What To Know
 
Migrating From Java EE To Cloud-Native Reactive Systems
Migrating From Java EE To Cloud-Native Reactive SystemsMigrating From Java EE To Cloud-Native Reactive Systems
Migrating From Java EE To Cloud-Native Reactive Systems
 
Running Kafka On Kubernetes With Strimzi For Real-Time Streaming Applications
Running Kafka On Kubernetes With Strimzi For Real-Time Streaming ApplicationsRunning Kafka On Kubernetes With Strimzi For Real-Time Streaming Applications
Running Kafka On Kubernetes With Strimzi For Real-Time Streaming Applications
 
Designing Events-First Microservices For A Cloud Native World
Designing Events-First Microservices For A Cloud Native WorldDesigning Events-First Microservices For A Cloud Native World
Designing Events-First Microservices For A Cloud Native World
 
Scala Security: Eliminate 200+ Code-Level Threats With Fortify SCA For Scala
Scala Security: Eliminate 200+ Code-Level Threats With Fortify SCA For ScalaScala Security: Eliminate 200+ Code-Level Threats With Fortify SCA For Scala
Scala Security: Eliminate 200+ Code-Level Threats With Fortify SCA For Scala
 
How To Build, Integrate, and Deploy Real-Time Streaming Pipelines On Kubernetes
How To Build, Integrate, and Deploy Real-Time Streaming Pipelines On KubernetesHow To Build, Integrate, and Deploy Real-Time Streaming Pipelines On Kubernetes
How To Build, Integrate, and Deploy Real-Time Streaming Pipelines On Kubernetes
 
A Glimpse At The Future Of Apache Spark 3.0 With Deep Learning And Kubernetes
A Glimpse At The Future Of Apache Spark 3.0 With Deep Learning And KubernetesA Glimpse At The Future Of Apache Spark 3.0 With Deep Learning And Kubernetes
A Glimpse At The Future Of Apache Spark 3.0 With Deep Learning And Kubernetes
 
Akka and Kubernetes: Reactive From Code To Cloud
Akka and Kubernetes: Reactive From Code To CloudAkka and Kubernetes: Reactive From Code To Cloud
Akka and Kubernetes: Reactive From Code To Cloud
 
Hands On With Spark: Creating A Fast Data Pipeline With Structured Streaming ...
Hands On With Spark: Creating A Fast Data Pipeline With Structured Streaming ...Hands On With Spark: Creating A Fast Data Pipeline With Structured Streaming ...
Hands On With Spark: Creating A Fast Data Pipeline With Structured Streaming ...
 
How Akka Works: Visualize And Demo Akka With A Raspberry-Pi Cluster
How Akka Works: Visualize And Demo Akka With A Raspberry-Pi ClusterHow Akka Works: Visualize And Demo Akka With A Raspberry-Pi Cluster
How Akka Works: Visualize And Demo Akka With A Raspberry-Pi Cluster
 
Machine Learning At Speed: Operationalizing ML For Real-Time Data Streams
Machine Learning At Speed: Operationalizing ML For Real-Time Data StreamsMachine Learning At Speed: Operationalizing ML For Real-Time Data Streams
Machine Learning At Speed: Operationalizing ML For Real-Time Data Streams
 

Kürzlich hochgeladen

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Kürzlich hochgeladen (20)

Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 

How To Get Monitoring Right For Streaming & Fast Data Systems Built With Spark, Mesos, Akka, Cassandra & Kafka

  • 1. WEBINAR How To Get Monitoring Right For Streaming 
 And Fast Data Systems Built With Spark, 
 Mesos, Akka, Cassandra and Kafka Paul Jasek, Senior Director of Global Solution Architects
  • 2. Agenda 1. Fast Data & Streaming Applications 2. The Challenges of Monitoring Fast Data Applications 3. What To Look For In a Fast Data Application 4. Intelligent End-To-End Monitoring from Lightbend 5. Live Demo 6. Questions
  • 3.
  • 4.
  • 5. reactivemanifesto.org Reactive Underpinnings: Fast Data and streaming applications often incorporate, or are based on, Reactive principles
  • 6. • Real-time personalization • Real-time decision-making • IoT data processing • Legacy batch processing modernization Growing Number Of Use Cases Across Industries • Serve existing customers better and reduce churn • Attract new ones and drive growth • Launch new products more easily • Enter new markets more quickly.
  • 7. • Rapidly Evolving Ecosystem • Understanding the Data Pipeline • Dynamic Architectures • Intricately Interconnected • Distributed And Clustered The Challenges of Monitoring Fast Data Applications
  • 8. Apache Spark, As An Illustrative Example The Challenges of Monitoring Fast Data Applications Concern Questions To Ask Data Health (for a given application) • Throughput: is data processing occurring at the expected rate? • Latency: is data processing occurring within the expected timeframe? • Error/quality: are there problems with the data being produced? • Input data: are input data streams flowing into Spark behaving normally? For instance, what are the throughput rates for Kafka topics feeding into the Spark job? Dependency Health • Are the systems feeding input into the storm job (such as Kafka) healthy? • Are the systems that the application is dependent on, such as Memcache or other API endpoints, healthy? Service Health • Is the Spark master operating normally? If not, engineering will be unable to re-balance workloads or restart jobs. Application Health • Are the application KPIs within normal operating parameters? Topology Health • Are there resources assigned to the given Spark topology? • • Are the Spark tasks and executors well-distributed amongst the Spark cluster? • • Are the performance counters (emitted, failed, latency, etc.) for the given Spark topology normal? Node System Health • Are the key system metrics (load, CPU, memory, net-i/o, disk-i/o, disk free) operating normally?
  • 10. Why traditional monitoring tools won’t help you • Built to monitor monolithic applications • Can only be used to extract metrics and trace information based on a synchronous flow • Not built for asynchronous flows (i.e. in Fast Data and streaming applications) • Cannot easily handle streaming systems running on distributed clusters
  • 11. • Deep Telemetry • Domain Expertise • Automated Discovery • Real-Time Topology Visualization • Intelligent, Rapid Troubleshooting What users need to effectively monitor Fast Data and streaming applications
  • 12. • Lightbend Monitoring takes a modern approach to instrumenting and visualizing distributed streaming systems • Helps users not just in production but also in development (so they can build their applications right from Day 1) • Shows the end-to-end status of applications, data frameworks, and the associated infrastructure in a single view. Intelligent, End-To-End Monitoring
  • 13. • Deep Telemetry • Domain Expertise • Intelligent Anomaly Detection • Fine-Grained Visibility, with Drill- Down Capabilities Data-Science Driven Anomaly Detection
  • 14. • Automated Topology Discovery • Automatic Metric Collection • Real-Time Topology Visualization Automated Discovery, Configuration & Topology Visualization
  • 15. • Single Pane of Glass Visibility • Rapid Root Cause Analysis • Reduced Mean-Time- To-Repair (MTTR) Intelligent, Rapid Troubleshooting
  • 16. • Dramatically reduce the time and cost to identify and remediate issues across application life-cycle.
 • Create happier, more satisfied customers – and lower churn • Lower HW/infrastructure costs and reduce concerns about chargebacks & SLA penalties • Deliver rapid time to value because everything you need for monitoring is packaged into an easy-to-use solution Benefits for your business
  • 17. On to the demo…
  • 18. Upgrade your grey matter!
 Get the free O’Reilly book by Dr. Dean Wampler, 
 VP of Fast Data Engineering at Lightbend bit.ly/lightbend-fast-data
  • 19. End-To-End Monitoring For Your Fast Data 
 And Streaming Applications From Lightbend SET UP A 20-MIN DEMO