Sas 2015 event_driven

Sascha Möllering | zanox AG
Event-driven architecture with Java

Sascha Möllering
sascha@autoscaling.io
Lead Engineer at zanox AG
http://autoscaling.io
@sascha242
About me

Agenda
• Basics
• Infrastructure
• Software & Frameworks
• Deployment
• Putting it all together
• Q&A

Basics
• Today: building event driven system
• Using:
– Apache Kafka/Amazon Kinesis
– Docker
– Vert.x
– Apache Camel/AWS Lamdba
– Google’s Protobuf

Basics
• Producers putting data into messaging
system
• Messages in Google’s Protobuf format
• Consumers pulling data from
messaging system

Infrastructure
• Publish-subscribe messaging
• Implemented as distributed commit log
• Fast: 100s of MB (reads and writes) per
s from thousands of clients
• Scalable: elastically and transparently
expanded without downtime
• Durable: Messages persisted on disk

Infrastructure
• Depends on Apache Zookeeper
• Zookeeper for:
– Cluster management
– Offset storage

Infrastructure
• Managed service for real-time data
processing
• Decoupling services
• Data stored for 24 hours
• 1MB messages max

Infrastructure
• Stream:
– Ordered sequence of data records
– Data records are distributed into shards

Infrastructure
• Shard:
– Group of data records in a stream
– 1MB write per second
– 2MB read per second
– 1000 puts per second

Infrastructure
• Package application with dependencies
• Standardized unit for software
development
• Layered filesystem, share common files
• Isolate applications from each other

Infrastructure
• Docker container: stripped-to-basics
version of a Linux operating system
• Docker image: software you load into a
container

Infrastructure
• Docker image built with a Dockerfile
• Docker images built using “inheritance”
• Custom image based on “base image”

Software & Frameworks
• Toolkit for reactive applications
• Based on the JVM
• Event driven and non-blocking
• Polyglot (Java, JS, Groovy, Ruby)
• Lightweight and modular

• Producer application built using Vert.x
• Sends a message every 5s
• Kafka or Kinesis depending on
deployment target

But … how can we
detect where the
application is running?

http://169.254.169.254/latest/meta-data

• Framework based on EI Patterns
• Small library with minimal dependencies
• Define routing and mediation rules

• Language-neutral
• Platform-neutral
• Extensible mechanism for serializing
structured data
• Support for Java, Python, and C++

• Compute service
• Runs your code in response to events
• Manages underlying compute resources

• Triggered by:
– Modifications in S3 buckets
– Notifications by SNS
– Messages in Kinesis
– Table updates in DynamoDB

• Code run in Lambda: “Lambda function”
• Don’t confuse with Java 8 Lambda
• Lambda functions support
– Java 8
– JavaScript

• Two different event models
– Pull event model
– Push event model

• Building the applications
– ingestion-service
– kafka-consumer-camel
– kinesis-consumer-lambda

• ingestion-service
1. git clone
https://github.com/SaschaMoelle
ring/ingestion-service.git
2. mvn -Dmaven.test.skip=true
package
3. docker build -t
autoscaling/ingestion-service .

• kafka-consumer-camel
1. git clone
ring/kafka-consumer-camel.git
package
3. docker build -t
autoscaling/kafka-consumer .

• kinesis-consumer-lambda
1. git clone
ring/kinesis-consumer-
lambda.git
package

Deployment
• Locally
1. Start the Spotify Kafka Docker Container

Deployment
• Why Spotify Kafka Docker Image?
– Kafka depends on Zookeeper
– Spotify Kafka runs Kafka and Zookeeper
– No dependency to external Zookeeper
– Runs out of the box

Deployment
• Locally
2. Start the Apache Camel Kafka consumer

Deployment
• Kafka-Consumer Docker Container
– Based on phusion/baseimage
– Installs Oracle Java 8
– Add consumer Fat-JAR
– Starts the Fat-JAR

Deployment
• Locally
3. Start the Vert.x Kafka Producer

Deployment
• Vert.x Producer Docker Container
– Based on phusion/baseimage
– Installs Oracle Java 8
– Add producer Fat-JAR
– Starts the Fat-JAR

Deployment
• Requirements for AWS:
– VPC
– User Role for Kinesis access from EC2
– User Role for Kinesis access from Lambda
– EC2 instance
– Kinesis stream
– Lambda package

Deployment
• In AWS
– Create Kinesis stream (in our case
SUMMIT_STREAM)

Deployment
• In AWS
– Create Lambda function
• Upload JAR to S3 bucket
• Specify function
• Add event source (SUMMIT_STREAM)

Deployment
• In AWS
– Start an EC2 instance
• t2.small is sufficient
• Install Docker and run container using EC2
user data
• Important: select correct IAM role

Deployment
• EC2 User Data
#!/bin/bash -ex
yum -y update
yum install docker -y
service docker start
docker run autoscaling/ingestion-service

Putting it all together
• Integration of Kinesis and Kafka
– Kinesis consumer that processes records
– Record processing -> sending to Kafka
– AWS Lambda perfect choice
– Problem: Lambda and VPN (VPC) not
working*

• Integration testing Kinesis and Kafka
– AWS API:
• Create Kinesis stream in @BeforeClass
• Produce data and write into stream
• Delete stream in @AfterClass

• Integration testing Kinesis and Kafka
– Spotify Docker Client
• Run Spotify Kafka container in @BeforeClass
• Produce data and write into stream
• Stop Spotify Kafka container in @AfterClass

• Integrationtests Kinesis and Kafka
– Put messages into Kinesis
– Consumer messages in application
– Put messages in Kafka
– Consume messages from Kafka
– Compare messages

• Integrationtests Kinesis and Kafka
– After tests: clean up infrastructure
– Very cost effective
– Real world tests without mocking
– Quite fast

Recap
• What have we achieved today?
– We created a distributed, message driven
system
– Based on JVM and Docker
– Running locally and AWS

Resources
• ingestion-service
– https://github.com/SaschaMoellering/ingest
ion-service.git
– https://hub.docker.com/r/autoscaling/ingest
ion-service/

Resources
• kafka-consumer
– https://github.com/SaschaMoellering/kafka-
consumer-camel
– https://hub.docker.com/r/autoscaling/kafka-
consumer/

Resources
• EC2 User Data
– https://gist.github.com/SaschaMoellering/c
6ee24ec999325c43e90
• EC2 User Role
– https://gist.github.com/SaschaMoellering/
a971fb73626f41ad80f4

Resources
• Lambda User Role
– https://gist.github.com/SaschaMoellering/b
14540b144263e5fea4b

Resources
• kinesis-consumer-lamdba
• https://github.com/SaschaMoellering/kinesis-
consumer-lambda

Resources
• Spotify Kafka Docker Image
– https://github.com/spotify/docker-kafka.git
– https://hub.docker.com/r/spotify/kafka/
• Spotify Docker Client
– https://github.com/spotify/docker-client.git

Sas 2015 event_driven

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Andere mochten auch

Andere mochten auch (20)

Ähnlich wie Sas 2015 event_driven

Ähnlich wie Sas 2015 event_driven (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Sas 2015 event_driven