Microservices in a Streaming World

Microservices in a
Streaming World
Modern distributed applications and infrastructure

About me
Hans Jespersen
hans@confluent.io
https://www.linkedin.com/in/hansjespersen
https://github.com/hjespers
U of Waterloo – Punch Cards, COBOL, Assembler, RJE, Screen Scraping
AT&T – Unix, Client/Server, Tuxedo & Transactions, Pre-Web Internet
Sun – Solaris, ONC RPC, CORBA
TIBCO – Yahoo!Quotes, Pub/Sub, Multicast, JMS, ESB, SOA, WS-*
Solace – Messaging HW, MQTT ,AMQP, REST
Confluent – Kafka, Event Stream Processing
Personally - active in IoT, open source, MQTT

Not going to talk about...
Meaningless, vendor marketing,
definition of Micro Services

Let’s parse this out together…

https://youtu.be/_RgUxUTuxH4 – or Google “Kafka Summit Martin Fowler”

I am going to talk about...
Patterns in Modern Distributed
Architecture and Application
Infrastructure

You’ve probably heard of Microservices.

Are they SOA all over again?
SOA Microservices

We used to build monolithic applications.

These applications came in a variety of different forms.

But the teams in these companies were grouped into business areas. Silos.

So the applications would also form into silos along their business lines.

This led to a lot of duplication across different applications, in different silos.

The duplication made the IT function expensive to run.

What if they pulled out the common pieces into reusable “services”?
Service

Less code in each silo seemed like a good idea.
Service

Then, they realized they could build whole companies
based on these reusable services.

This seemed like a very good idea!

The trick was to rearrange the people to match the services.

So instead of sitting in silos…

The teams would match the architecture that they wanted to end up with.

SoA
This pattern was termed a
‘Service Oriented Architecture.’

Refactoring companies turned out to be slow and difficult.
Services

So people took the same pattern designed for big companies…

…and applied it to break up monolithic applications...

…knowing that with this architecture, the companies would be able to grow…

This was termed Microservices:
Change through small,
well-defined services
that are easy to reuse.

…and monoliths were broken up.

Evolving incrementally towards contemporary service estates.

But things were not entirely rosy yet.

While silos seemed expensive and wasteful, they had one big advantage...

Each application was free to handle change independently of
the applications around it.

Every application was, in some sense, an island.

Microservices are built with HTTP, REST, or some other protocol
made from requests and replies.
Request
Reply

This works well when ecosystems are small…
Buying
Widgets

But gets harder as they grow more complex and more interconnected.

Services are tightly coupled. No islands here!

…or even just runs slowly…
Buy Widgets

The fall-out could be much larger.

Others end up feeling that pain.
Buy Widgets

Yet, at company scale, the majority of processes run in the background anyway.
They are asynchronous to one another.
Online
Billing Inventory Fulfillment Fraud
Offline

So it makes sense to DECOUPLE services from one another.
Billing Inventory Fulfillment Fraud
Offline
Online
Decouple

Apache Kafka™ helps with this as it provides a data backbone for your services.
Billing Inventory Fulfillment Finance Fraud
HTTP etc
Offline
Online

This connects services together.

It also connects their DATA together.
All Your Data

The three tenets of messaging… embedded into a layer of permanence.
Decoupling Notification Data Transfer
Permanence

So every service gets the data it needs.
All Your Data

Unlike typical service frameworks, it also DECOUPLES services from one another,
so they can evolve independently.

This makes it easier to move away from legacy architectures,
to evolve away from the past…
All Your Data
LEGACY

…towards a better-factored future, whatever that may look like.
All Your Data
LEGACYNew Services
Analytics

So wherever your business ends up…
Cloud
Another Device
Another
Geography

Apache Kafka provides the Service Backbone built to handle today’s
data-centric world...
Big Data Ready

In a way that can adapt to your company's future, wherever you might take it.

Services built on the POWER and IMMEDIACY of an Event Streaming Platform.
Event Streaming Platform

So what is this Kafka thing you speak of so highly?

Traditional Messaging Functionality
Decoupling of Producers and Consumers

Message Exchange Patterns (MEP)
Topic = Publish/Subscribe - N of N delivery Queues = Point-to-Point - 1 of N delivery

Message Exchange Patterns (MEP)
Request/Reply
Content-Based Router

Message Delivery Semantics
At-Most-Once
“Best effort”
“Reliable”
QoS 0
At-Least-Once
“Guaranteed”
“Certified”
QoS 1
Exactly Once
“Once-and-Only-Once”
“Transactional”
QoS 2

Wide Spectrum of Messaging Offerings
Ultra- low Latency (often no broker in the middle)
High Volume (Persistent or Non-Persistent)
Highly Available (Clustered and Fault Tolerant)
Embedded Messaging (inside apps)
Cross Datacenter / Organizational / B2B
Enterprise Message Bus
Messaging-as-a-Service
Web / IoT Messaging
Instant Messaging

“publish-subscribe messaging rethought as a distributed commit log”

Kafka is a Mashup
Mashup of some well proven concepts into something even greater and easier to use:
EAI + ETL
Messaging Middleware + Big Data
Batch + Real-time
Data Movement + Data Processing
Log Data Streams + Structured Database Tables

+ Distributed clustered storage
Kafka is a blend of messaging, stream processing, ETL and
modern database designs built around a distributed log
+ Streaming platform
Pub/Sub
Messaging
ETL
Connectors
Spark
Flink
Beam
IBM MQ
TIBCO
RabbitMQ
Mulesoft
Talend
Informatica
Kafka is much more than messaging
+ Exactly Once
+ Designed for the Cloud
+ Inter DC replication
+ Schema evolution
Stream
Processing
Confluent Confidential

What’s different about Kafka? Topics are also Queues
Consumers can share one copy of the data
• Independent consumers share the same log
• Inter-dependent consumers share the same log
• No need for Topic/Queue bridging or multiple
copies of the data
Message processing is greatly simplified
- There is no “head’ of the queue
- Writes are sequential, distributed, and
parallel

What’s different about Kafka? Messages are not deleted when
consumed
Messages in the commit logs are persistent and immutable
Slow Consumers are (very) decoupled from Fast Producers
Batch and real-time are unified
Message Replay, Replication, and Auditing are built-in (for free)
All production messaging deployment need some form of these
Message Retention is not a waste of disk space
You need to size for offline/disconnected consumers anyway
Distributed State can always be recreated from a common commit log
Makes distributed HA apps much easier to build

What’s different about Kafka? Topic Partitions and Keyed Messages
- Topics/Queues are not the smallest unit of
scalability
- Topics partitions are distributed across
brokers for parallel in-order consumption
- This is very different from a cluster of
traditional message brokers
- [graphic of topic partitions with parallel
Producers, Brokers, and Consumers]
- Sometime you can just use more keys
instead of more topics
- Eg. don’t create a new topic for every user,
or IoT device, create unique keys
- This is proven to scale to many millions of
connected users, cars and IoT devices
- [graphic to show Keyed messages get
distributed across topic partitions]

From an event stream / transaction log we can derive all of the following
database centric features:
- Replication
- Secondary Indexing
- Caching
- Materialized Views
What’s different about Kafka? Duality of streams and databases
Duality of a message streams and database tables is a key design point
=

(Good) Microservices avoid shared mutable state
Shared, mutable state

Old World: REST Based Microservices Interconnect
GUI
UI Service Order
s
Returns
Pay Fulfilment Stock
Each Microservice has to maintain their own stateful
nature by using their own databases
1. Difficult to Enforce Same REST API standards
across many languages and micro-services.
2. Rest APIs Inherently Slow: Limited to Thousands
calls/sec.
3. Inter Service Dependencies are Messy.
4. Each Service Needs to Maintain State.
5. Difficult to enforce consistent security standards.
6. Logging is distributed between services.
7. Version compatibility between services is difficult.

Streaming Microservices with Kafka
GUI
UI
Service
Orders
Service
Returns
Service
Fulfilment
Service
Payment
Service
Stock
Service
Database Sources Now Centralized on the Kafka Bus for all microservices
1. Service inter-communication standard enforce by Kafka Schema Registry.
2. Millions of messages per second on cheap hardware.
3. No Inter-Service Dependency: just depend on Kafka.
4. Each service can be stateless: Kafka maintains state.
5. Security can be enforced by ACLs from Kafka.
6. Logs can be aggregated into Kafka.
7. Version compatibility can be enforced by Scheme Registry.
8. Kafka is inherently HA, horizontal scalable: still no central point of failure.

What’s different about Kafka? Ecosystem and Adoption
The Kafka ecosystem is flourishing and developer adoption continues to grow
• Confluent Platform additions (REST Proxy, Schema Registry, KSQL etc.)
• Third Party Connectors ( Confluent Hub)
• Open Source contributions from individuals, corporations, vendors, consulting organizations
• Inside and outside of Big Data/Stream Processing

Adoption of Event Streaming
60%Fortune 100 Companies
Using Apache Kafka

Event Streaming at the Heart of the Enterprise

Microservices in a Streaming World

Recommended

Recommended

More Related Content

What's hot

What's hot (18)

Similar to Microservices in a Streaming World

Similar to Microservices in a Streaming World (20)

Recently uploaded

Recently uploaded (20)

Microservices in a Streaming World