Kick-Start with SMACK Stack

•Als ODP, PDF herunterladen•

3 gefällt mir•3,451 views

SMACK is a combination of Spark, Mesos, Akka, Cassandra and Kafka. It is used for pipelined data architecture which is required for the real time data analysis and to integrate all the technology at the right place to efficient data pipeline.

Software

Agenda:
● What is SMACK?
● Why SMACK?
● Brief introduction of technologies
● How to Integrate all the technologies to create the data pipeline
● Demo

What is SMACK?
● Spark :Apache Spark is a fast and general-purpose cluster
computing system.
● Mesos :Cluster resource management system that provide
efficient resource allocation.
● Akka :Akka is a toolkit and runtime for building highly
concurrent, distributed, and resilient message-driven
applications on the JVM.
● Cassandra :The Apache Cassandra database is the right
choice when you need scalability and high availability.
● Kafka :distributed messaging system for handling real
time data.

Why SMACK?
● Smack is used for pipelined data architecture which is
required for the real time data analysis.
● Smack is use to integrate all the technology at the right
place to efficient data pipeline.
● Smack is use to linearly scale your whole cluster without
any hassle

Why Spark?
● Its general purpose big data processing engine which have
4 main components spark core, spark streaming, spark
ml, spark graphx
● So we can process our data which any of the component
at real time.
● Its provide fault tolerant for real time application.

Why Cassandra?
● Cassandra implements “no single points of failure
● Cassandra Write-path is so fast so it can handle real-time data easily
● It will support Datacenter architecture so we can easily use different
DC for different things.
Ingestion DC Analysis DC
Cassandra Cluster

Why Mesos?
Mesos Master
Mesos Master
Standby
Mesos Master
Standby
Zookepeer
Mesos Slave
Mesos Slave
Mesos Slave

Models in SMACK
● In SMACK models are Scala and AKKA.
● We can use models to write highly concurrent and parallel
applications.
● Example: We can use akka modules according to our use
case like akka-http, akka-scheduler, akka priority
mailboxes etc.

Models use in SMACK
Akka-Http
Akka-Scheduler

Why Kafka
● streams of data efficiently and in real time
● Use Kafka for fault tolerance.
● To create bridge between two applications.
Streaming
Source
Kafka
Broker
Spark Receiver

Architecture of Spark and cassandra
Cassandra Cluster
Spark Worker
Spark Worker
Spark Worker
Spark Worker
Spark worker nodes
will get the data on
local node so it will
avoid latency

Spark, Mesos, Cassandra
Mesos Slaves and cassandra nodes are collocated to enforce the better data
locality for spark.
Driver
Program
Mesos
Master
Mesos slave
Cassandra node
Mesos slave
Cassandra node
Mesos slave
Cassandra node

Demo Application Architecture
Tweets
Store tweets in
kafka topic
Retrieve
hashtags
Evaluate Top
hashtag in
every 10
seconds
Store tweets in
cassandra table

Empfohlen

SMACK Stack - Fast Data Done Right by Stefan Siprell at Codemotion DubaiCodemotion Dubai

Lambda Architecture with Spark Streaming, Kafka, Cassandra, Akka, ScalaHelena Edelson

NoLambda: Combining Streaming, Ad-Hoc, Machine Learning and Batch AnalysisHelena Edelson

Sa introduction to big data pipelining with cassandra & spark west mins...Simon Ambridge

Reactive dashboard’s using apache sparkRahul Kumar

Real-Time Anomaly Detection with Spark MLlib, Akka and CassandraNatalino Busa

Data processing platforms architectures with Spark, Mesos, Akka, Cassandra an...Anton Kirillov

Lambda architecture with SparkVincent GALOPIN

Empfohlen

SMACK Stack - Fast Data Done Right by Stefan Siprell at Codemotion DubaiCodemotion Dubai

Lambda Architecture with Spark Streaming, Kafka, Cassandra, Akka, ScalaHelena Edelson

NoLambda: Combining Streaming, Ad-Hoc, Machine Learning and Batch AnalysisHelena Edelson

Sa introduction to big data pipelining with cassandra & spark west mins...Simon Ambridge

Reactive dashboard’s using apache sparkRahul Kumar

Real-Time Anomaly Detection with Spark MLlib, Akka and CassandraNatalino Busa

Data processing platforms architectures with Spark, Mesos, Akka, Cassandra an...Anton Kirillov

Lambda architecture with SparkVincent GALOPIN

Rethinking Streaming Analytics For ScaleHelena Edelson

SMACK Stack 1.1Joe Stein

Using the SDACK Architecture to Build a Big Data ProductEvans Ye

Intro to Apache SparkMammoth Data

Streaming Analytics with Spark, Kafka, Cassandra and AkkaHelena Edelson

Typesafe & William Hill: Cassandra, Spark, and Kafka - The New Streaming Data...DataStax Academy

Lambda architecture: from zero to OneSerg Masyutin

Lambda Architecture with SparkKnoldus Inc.

Real-time personal trainer on the SMACK stackAnirvan Chakraborty

Using Spark, Kafka, Cassandra and Akka on Mesos for Real-Time PersonalizationPatrick Di Loreto

2015 01-17 Lambda Architecture with Apache Spark, NextML ConferenceDB Tsai

Reactive app using actor model & apache sparkRahul Kumar

Lambda Architecture with Spark, Spark Streaming, Kafka, Cassandra, Akka and S...Helena Edelson

Kafka Lambda architecture with mirroringAnant Rustagi

Spark Summit EU talk by Yaroslav Nedashkovsky and Andy StarzhinskySpark Summit

How to deploy Apache Spark  to Mesos/DCOSLegacy Typesafe (now Lightbend)

Lambda architecture on Spark, Kafka for real-time large scale MLhuguk

Near Real Time Indexing Kafka Messages into Apache Blur: Presented by Dibyend...Lucidworks

Kappa Architecture on Apache Kafka and Querona: datamass.ioPiotr Czarnas

Lambda Architecture: The Best Way to Build Scalable and Reliable Applications!Tugdual Grall

Akka Finite State MachineKnoldus Inc.

Introduction to AWS IAMKnoldus Inc.

Weitere ähnliche Inhalte

Was ist angesagt?

Rethinking Streaming Analytics For ScaleHelena Edelson

SMACK Stack 1.1Joe Stein

Using the SDACK Architecture to Build a Big Data ProductEvans Ye

Intro to Apache SparkMammoth Data

Streaming Analytics with Spark, Kafka, Cassandra and AkkaHelena Edelson

Typesafe & William Hill: Cassandra, Spark, and Kafka - The New Streaming Data...DataStax Academy

Lambda architecture: from zero to OneSerg Masyutin

Lambda Architecture with SparkKnoldus Inc.

Real-time personal trainer on the SMACK stackAnirvan Chakraborty

Using Spark, Kafka, Cassandra and Akka on Mesos for Real-Time PersonalizationPatrick Di Loreto

2015 01-17 Lambda Architecture with Apache Spark, NextML ConferenceDB Tsai

Reactive app using actor model & apache sparkRahul Kumar

Lambda Architecture with Spark, Spark Streaming, Kafka, Cassandra, Akka and S...Helena Edelson

Kafka Lambda architecture with mirroringAnant Rustagi

Spark Summit EU talk by Yaroslav Nedashkovsky and Andy StarzhinskySpark Summit

How to deploy Apache Spark  to Mesos/DCOSLegacy Typesafe (now Lightbend)

Lambda architecture on Spark, Kafka for real-time large scale MLhuguk

Near Real Time Indexing Kafka Messages into Apache Blur: Presented by Dibyend...Lucidworks

Kappa Architecture on Apache Kafka and Querona: datamass.ioPiotr Czarnas

Lambda Architecture: The Best Way to Build Scalable and Reliable Applications!Tugdual Grall

Was ist angesagt? (20)

Rethinking Streaming Analytics For Scale

SMACK Stack 1.1

Using the SDACK Architecture to Build a Big Data Product

Intro to Apache Spark

Streaming Analytics with Spark, Kafka, Cassandra and Akka

Typesafe & William Hill: Cassandra, Spark, and Kafka - The New Streaming Data...

Lambda architecture: from zero to One

Lambda Architecture with Spark

Real-time personal trainer on the SMACK stack

Using Spark, Kafka, Cassandra and Akka on Mesos for Real-Time Personalization

2015 01-17 Lambda Architecture with Apache Spark, NextML Conference

Reactive app using actor model & apache spark

Lambda Architecture with Spark, Spark Streaming, Kafka, Cassandra, Akka and S...

Kafka Lambda architecture with mirroring

Spark Summit EU talk by Yaroslav Nedashkovsky and Andy Starzhinsky

How to deploy Apache Spark  to Mesos/DCOS

Lambda architecture on Spark, Kafka for real-time large scale ML

Near Real Time Indexing Kafka Messages into Apache Blur: Presented by Dibyend...

Kappa Architecture on Apache Kafka and Querona: datamass.io

Lambda Architecture: The Best Way to Build Scalable and Reliable Applications!

Andere mochten auch

Akka Finite State MachineKnoldus Inc.

Introduction to AWS IAMKnoldus Inc.

Reactive Fast Data & the Data Lake with Akka, Kafka, SparkTodd Fritz

Data Science lifecycle with Apache Zeppelin and Spark by Moonsoo LeeSpark Summit

Streaming Big Data with Spark, Kafka, Cassandra, Akka & Scala (from webinar)Helena Edelson

Geek Talk Backend Unit Testing in Go LanguageHaluan Irsad

Data processing platforms with SMACK: Spark and Mesos internalsAnton Kirillov

Alpine academy apache spark series #1 introduction to cluster computing wit...Holden Karau

Akka in Production - ScalaDays 2015Evan Chan

Spark Kernel Talk - Apache Spark Meetup San Francisco (July 2015)Robert "Chip" Senkbeil

Kafka as Message BrokerHaluan Irsad

Four Things to Know About Reliable Spark Streaming with Typesafe and DatabricksLegacy Typesafe (now Lightbend)

Building Streaming And Fast Data Applications With Spark, Mesos, Akka, Cassan...Lightbend

Building Data Pipelines with SMACK: Designing Storage Strategies for Scale an...DataStax

Webinar - How to Build Data Pipelines for Real-Time Applications with SMACK &...DataStax

Architecture Big Data open source S.M.A.C.KJulien Anguenot

Laying down the smack on your data pipelinesPatrick McFadin

Andere mochten auch (17)

Akka Finite State Machine

Introduction to AWS IAM

Reactive Fast Data & the Data Lake with Akka, Kafka, Spark

Data Science lifecycle with Apache Zeppelin and Spark by Moonsoo Lee

Streaming Big Data with Spark, Kafka, Cassandra, Akka & Scala (from webinar)

Geek Talk Backend Unit Testing in Go Language

Data processing platforms with SMACK: Spark and Mesos internals

Alpine academy apache spark series #1 introduction to cluster computing wit...

Akka in Production - ScalaDays 2015

Spark Kernel Talk - Apache Spark Meetup San Francisco (July 2015)

Kafka as Message Broker

Four Things to Know About Reliable Spark Streaming with Typesafe and Databricks

Building Streaming And Fast Data Applications With Spark, Mesos, Akka, Cassan...

Building Data Pipelines with SMACK: Designing Storage Strategies for Scale an...

Webinar - How to Build Data Pipelines for Real-Time Applications with SMACK &...

Architecture Big Data open source S.M.A.C.K

Laying down the smack on your data pipelines

Ähnlich wie Kick-Start with SMACK Stack

BBL KAPPA Lesfurets.comCedric Vidal

ASPgems - kappa architectureJuantomás García Molina

Stream processing using KafkaKnoldus Inc.

Solution Brief: Real-Time Pipeline AcceleratorBlueData, Inc.

Fast and Simplified Streaming, Ad-Hoc and Batch Analytics with FiloDB and Spa...Helena Edelson

Business Growth Is Fueled By Your Event-Centric Digital Strategyzitipoff

Module01NPN Training

What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...Simplilearn

Apache cassandra lunch #82 instaclustr managed cassandra and next.jsAnant Corporation

Apache Cassandra Lunch #82: Instaclustr Managed Cassandra and Next.jsAnant Corporation

Getting Started with Spark ScalaKnoldus Inc.

Best Practices for Using Apache Spark on AWSAmazon Web Services

Cloud Lambda Architecture PatternsAsis Mohanty

Real Time Analytics with DseDataStax Academy

Connecting kafka message systems with scylla Maheedhar Gunturu

Apache Spark - A High Level overviewKaran Alang

Kafka presentationMohammed Fazuluddin

Cassandra Distributions and VariantsAnant Corporation

Designing & Optimizing Micro Batching Systems Using 100+ Nodes (Ananth Ram, R...DataStax

Cassandra - A Basic Introduction GuideMohammed Fazuluddin

Ähnlich wie Kick-Start with SMACK Stack (20)

BBL KAPPA Lesfurets.com

ASPgems - kappa architecture

Stream processing using Kafka

Solution Brief: Real-Time Pipeline Accelerator

Fast and Simplified Streaming, Ad-Hoc and Batch Analytics with FiloDB and Spa...

Business Growth Is Fueled By Your Event-Centric Digital Strategy

Module01

What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...

Apache cassandra lunch #82 instaclustr managed cassandra and next.js

Apache Cassandra Lunch #82: Instaclustr Managed Cassandra and Next.js

Getting Started with Spark Scala

Best Practices for Using Apache Spark on AWS

Cloud Lambda Architecture Patterns

Real Time Analytics with Dse

Connecting kafka message systems with scylla

Apache Spark - A High Level overview

Kafka presentation

Cassandra Distributions and Variants

Designing & Optimizing Micro Batching Systems Using 100+ Nodes (Ananth Ram, R...

Cassandra - A Basic Introduction Guide

Mehr von Knoldus Inc.

Supply chain security with Kubeclarity.pptxKnoldus Inc.

Mastering Web Scraping with JSoup Unlocking the Secrets of HTML ParsingKnoldus Inc.

Akka gRPC Essentials A Hands-On IntroductionKnoldus Inc.

Entity Core with Core Microservices.pptxKnoldus Inc.

Introduction to Redis and its features.pptxKnoldus Inc.

GraphQL with .NET Core Microservices.pdfKnoldus Inc.

NuGet Packages Presentation (DoT NeT).pptxKnoldus Inc.

Data Quality in Test Automation Navigating the Path to Reliable TestingKnoldus Inc.

K8sGPTThe AI way to diagnose KubernetesKnoldus Inc.

Introduction to Circle Ci Presentation.pptxKnoldus Inc.

Robusta -Tool Presentation (DevOps).pptxKnoldus Inc.

Optimizing Kubernetes using GOLDILOCKS.pptxKnoldus Inc.

Azure Function App Exception Handling.pptxKnoldus Inc.

CQRS Design Pattern Presentation (Java).pptxKnoldus Inc.

ETL Observability: Azure to Snowflake PresentationKnoldus Inc.

Scripting with K6 - Beyond the Basics PresentationKnoldus Inc.

Getting started with dotnet core Web APIsKnoldus Inc.

Introduction To Rust part II PresentationKnoldus Inc.

Data governance with Unity Catalog PresentationKnoldus Inc.

Configuring Workflows & Validators in JIRAKnoldus Inc.

Mehr von Knoldus Inc. (20)

Supply chain security with Kubeclarity.pptx

Mastering Web Scraping with JSoup Unlocking the Secrets of HTML Parsing

Akka gRPC Essentials A Hands-On Introduction

Entity Core with Core Microservices.pptx

Introduction to Redis and its features.pptx

GraphQL with .NET Core Microservices.pdf

NuGet Packages Presentation (DoT NeT).pptx

Data Quality in Test Automation Navigating the Path to Reliable Testing

K8sGPTThe AI way to diagnose Kubernetes

Introduction to Circle Ci Presentation.pptx

Robusta -Tool Presentation (DevOps).pptx

Optimizing Kubernetes using GOLDILOCKS.pptx

Azure Function App Exception Handling.pptx

CQRS Design Pattern Presentation (Java).pptx

ETL Observability: Azure to Snowflake Presentation

Scripting with K6 - Beyond the Basics Presentation

Getting started with dotnet core Web APIs

Introduction To Rust part II Presentation

Data governance with Unity Catalog Presentation

Configuring Workflows & Validators in JIRA

Kürzlich hochgeladen

Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave

Right Money Management App For Your Financial GoalsJhone kinadey

Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812

How To Troubleshoot Collaboration Apps for the Modern Connected WorkerThousandEyes

The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171

Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01

Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.

TECUNIQUE: Success Stories: IT Service providermohitmore19

Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Steffen Staab

Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531

+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...Health

Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...OnePlan Solutions

Optimizing AI for immediate response in Smart CCTVshikhaohhpro

call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls

Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveCall Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE9953056974 Low Rate Call Girls In Saket, Delhi NCR

Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy

CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female serviceanilsa9823

The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS

5 Signs You Need a Fashion PLM Software.pdfWave PLM

Kürzlich hochgeladen (20)

Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...

Right Money Management App For Your Financial Goals

Unlocking the Future of AI Agents with Large Language Models

How To Troubleshoot Collaboration Apps for the Modern Connected Worker

The Ultimate Test Automation Guide_ Best Practices and Tips.pdf

Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...

Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...

TECUNIQUE: Success Stories: IT Service provider

Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...

Hand gesture recognition PROJECT PPT.pptx

+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...

Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...

Optimizing AI for immediate response in Smart CCTV

call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️

Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live

CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE

Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications

CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service

The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...

5 Signs You Need a Fashion PLM Software.pdf

Kick-Start with SMACK Stack

1. Kick-Start with SMACK Stack Sandeep Purohit Software Consultant Knoldus Software LLP

2. Agenda: ● What is SMACK? ● Why SMACK? ● Brief introduction of technologies ● How to Integrate all the technologies to create the data pipeline ● Demo

3. What is SMACK? ● Spark :Apache Spark is a fast and general-purpose cluster computing system. ● Mesos :Cluster resource management system that provide efficient resource allocation. ● Akka :Akka is a toolkit and runtime for building highly concurrent, distributed, and resilient message-driven applications on the JVM. ● Cassandra :The Apache Cassandra database is the right choice when you need scalability and high availability. ● Kafka :distributed messaging system for handling real time data.

4. Why SMACK? ● Smack is used for pipelined data architecture which is required for the real time data analysis. ● Smack is use to integrate all the technology at the right place to efficient data pipeline. ● Smack is use to linearly scale your whole cluster without any hassle

5. SMACK Pipeline Architecture

6. Why Spark? ● Its general purpose big data processing engine which have 4 main components spark core, spark streaming, spark ml, spark graphx ● So we can process our data which any of the component at real time. ● Its provide fault tolerant for real time application.

7. Why Cassandra? ● Cassandra implements “no single points of failure ● Cassandra Write-path is so fast so it can handle real-time data easily ● It will support Datacenter architecture so we can easily use different DC for different things. Ingestion DC Analysis DC Cassandra Cluster

8. Why Mesos? Mesos Master Mesos Master Standby Mesos Master Standby Zookepeer Mesos Slave Mesos Slave Mesos Slave

9. Models in SMACK ● In SMACK models are Scala and AKKA. ● We can use models to write highly concurrent and parallel applications. ● Example: We can use akka modules according to our use case like akka-http, akka-scheduler, akka priority mailboxes etc.

10. Models use in SMACK Akka-Http Akka-Scheduler

11. Why Kafka ● streams of data efficiently and in real time ● Use Kafka for fault tolerance. ● To create bridge between two applications. Streaming Source Kafka Broker Spark Receiver

12. Architecture of Spark and cassandra Cassandra Cluster Spark Worker Spark Worker Spark Worker Spark Worker Spark worker nodes will get the data on local node so it will avoid latency

13. Spark, Mesos, Cassandra Mesos Slaves and cassandra nodes are collocated to enforce the better data locality for spark. Driver Program Mesos Master Mesos slave Cassandra node Mesos slave Cassandra node Mesos slave Cassandra node

14. Demo Application Architecture Tweets Store tweets in kafka topic Retrieve hashtags Evaluate Top hashtag in every 10 seconds Store tweets in cassandra table

15. Demo SMACK_Tweets

16. Thank You!!