Timeless design in a cloud-native world

Timeless design in a cloud-native world
Ageless foundations, gently updated and applied to modern service design

Uwe Friedrichsen – codecentric AG – 2016-2018

Uwe Friedrichsen

IT traveller.
Dot Connector.
Cartographer of uncharted territory.
Keeper of timeless wisdom.
CTO and Fellow at codecentric.

https://www.slideshare.net/ufried
https://medium.com/@ufried
@ufried

“Cloud native computing uses an open source software stack
to deploy applications as microservices,
packaging each part into its own container,
and dynamically orchestrating those containers
to optimize resource utilization.”

Source: Cloud Native Computing Foundation Homepage (https://www.cncf.io/)

Consequences of cloud-native computing

Different processes at runtime
Remote communication between services
Distributed system
Cloud-native computing
Services packaged in containers

Distributed systems are not limited to cloud-native computing

(Almost) every system is a distributed system

-- Chas Emerick

http://www.infoq.com/presentations/problems-distributed-systems

The software you develop and maintain is most likely
part of a (big) distributed system landscape

Consequences of distributed systems

Everything fails, all the time.
-- Werner Vogels

Failures in distributed systems ...

•  Crash failure
•  Omission failure
•  Timing failure
•  Response failure
•  Byzantine failure

... lead to a variety of effects

•  Lost messages
•  Incomplete messages
•  Duplicate messages
•  Distorted messages
•  Out-of-order message arrival
•  Partial, out-of-sync local memory
•  ...

These effects are based on the non-determinism
introduced by remote communication
due to distributed failure modes

Understand that remote communication points are
predetermined breaking points of your application

Accept that the effects will hit you at the application level

Typical measures

•  High-availability (HA) hardware/software
•  Only applicable for very small installations
•  Usually not available in cloud environments
•  Delegate failure handling to infrastructure level
•  Partial relief, will not solve all problems
•  Implement resilient software design patterns
•  Very important, still will not fix a bad design
•  Minimize number of remote communication points
•  Minimize problem surface by design

Reducing remote communication points

Reducing remote communication points
Reduce the number of
distributed runtime artifacts

i.e., create coarser grained services
Reduce the need to
communicate across
runtime artifact boundaries

i.e., get the functional design right

Remember?

https://www.martinfowler.com/articles/
distributed-objects-microservices.html

If you can get away with a monolith, just do it!

You are still allowed to structure it well ... ;)

Still, your functional domain usually will be too big
to put it all in a single monolith

Thus, you will need to distribute your system landscape to a certain degree

Rule of thumb

Always think (at least) twice before distributing functionality
Only do it if you really need independent deployments
Addendum
You may also want to distribute your functionality if you have
very disparate NFRs for different parts of your functionality
(e.g., security *, availability, reliability, scalability **, portability,
resource utilization)
* forgotten most of the time
** less often needed than most people assume

Case study

(Very simple) eCommerce shop
•  Implements the core functionalities
•  Search & show
•  Add to shopping cart
•  Checkout
•  Shipment
•  Payments only touched as black box
•  No recommendations, etc.

The typical design approach ...

a.k.a. “the counter-example”

Typical design approach

Focus on avoiding redundancy and maximizing reuse

1.  Start with a comprehensive domain (actually: E/R) model

*
*
*
*
Customer

•  name
•  address
•  payment methods
E
Order

•  customer
•  payment information
•  shipping information
•  order items
•  product
•  quantity
E
Product

•  name
•  description
•  image(s)
•  price
•  items in stock
•  packaging information
(size, weight, special care)
E


Focus on avoiding redundancy and maximizing re-use

2.  Wrap entities with services

CustomerService
S
Customer
OrderService
S
Order
ProductService
S
Product



3.  Spread functionality over services

ProductService
OrderService
CustomerService
S
Customer
OrderService

S
ProductService

•  Search/show
S
Order
Product



4.  Add “process services” for “complex use cases”
•  i.e., use cases that touch more than one data service

CustomerService
S
Customer
OrderService

S
ProductService

•  Search/show
S
Order
Product
CheckoutService

•  Checkout
S
ShipmentService

•  Shipment
S



4.  Add “process services” for “complex use cases”
•  i.e., use cases that touch more than one data service
5.  Add missing data maintenance use cases

CustomerService
ProductService

•  Search/show
CustomerService

•  Customer self care
S
Customer
OrderService

S
ProductService

•  Search/show
•  Product catalog
maintenance
S
CheckoutService

•  Checkout
S
ShipmentService

•  Shipment
S
Order
Product

This (familiar) design looks innocuous at first sight

But how good is it in terms of remote communication?

CheckoutService
OrderService
ProductService
CustomerService
Payment Provider
<proceed to checkout>
read order
read price
loop [order items]
calculate price
read payment methods
<show price and ask
for payment method>
<proceed to payment>
pay
mark paid
<report back completion>

ShipmentService
OrderService
ProductService
CustomerService
Delivery Provider
<initiate shipment>
read order
read product and packaging information
loop [order items]
read delivery address
<show shipment
information>
<parcel(s) packed –
initiate delivery>
inform delivery provider
mark dispatched
update items in stock
loop [order items]

Findings

•  Core business use cases are failure-prone and slow
•  Data maintenance use cases are robust and fast

Congratulations!

You designed a system for a company
that's core business purpose is
to maintain data, not to make money!

Properties of the design
•  Focus on avoiding redundancy and maximizing reuse
•  Based on traditional OO design practices
•  Results in high coupling between services
•  Results in moderate cohesion inside services
•  Okay for CRUD applications
•  But then better use a generator, scaffolding framework, ...
•  Okay-ish for single-process applications
•  Tends to affect maintainability negatively
•  Not okay for distributed services
•  Big failure surface, bad response times

Let’s do a bit of research ...

Structured design

by W. P. Stevens, G. J. Myers and L. L. Constantine

[Stev 1974]

“The fewer and simpler the connections between modules, the easier it is to
understand each module without reference to other modules.

Minimizing connections between modules also minimizes the paths
along which changes and errors can propagate into other parts of the
system, thus eliminating disastrous ‘ripple’ effects, where changes in one
part cause errors in another, necessitating additional changes elsewhere,
giving rise to new errors, etc.”

[Ste 1974]

“Coupling is the measure of the strength of association
established by a connection from one module to another.”

[Ste 1974]

Coupling
High
Low
Contributing factors
Interface
Complexity
Type of
Connection
Type of
Communication
Simple,
obvious
Complicated,
obscure
To module
by name
(depending
on interface)
To internal
elements
(depending on
implementation)
Data
(control flow handled
by environment)
Control
(Explicit passing
of control)
Hybrid
(Manipulation of internal
control flow by parameters)
[Ste 1974]

Realize that this paper was written at a very different time
and in a very different context than we face today

While the core concepts are timeless and still valid,
we usually need to rethink the concrete instructions

Contributing factors
Interface
Complexity
Type of
Connection
Type of
Communication
Simple,
obvious
Complicated,
obscure
To module
by name
(depending
on interface)
To internal
elements
(depending on
implementation)
Data
(control flow handled
by environment)
Control
(Explicit passing
of control)
Hybrid
(Manipulation of internal
control flow by parameters)
* Ability of a service to complete its task without the other service being present
Functional
Independence*
Independent
(does not need other
service to work)
Fully dependent
(does not work
without other service)
Partly dependent
(graceful degradation
of service)
Coupling
High
Low

“Coupling is reduced when the relationships among elements not in the
same module are minimized.

There are two ways of achieving this – minimizing the relationships among
modules and maximizing relationships among elements in the same
module. In practice, both ways are used. [...]

Binding is the measure of the cohesiveness of a module. The objective
here is to reduce coupling by striving for high binding.”

[Ste 1974]

On the criteria to be
used in decomposing
systems into modules

by David L. Parnas

[Par 1972]

“The effectiveness of a ‘modularization’ is dependent upon the criteria used in
dividing the system into modules.”

“The second decomposition was made using "information hiding" as a
criterion. [...] Every module in the second decomposition is characterized by
its knowledge of a design decision which it hides from all others. Its
interface or definition was chosen to reveal as little as possible about its
inner workings.”

“There are a number of design decisions which are questionable and likely to
change under many circumstances. [...] By looking at these changes we can
see the differences between the two modularizations.”

[Par 1972]

Separation of concerns

One concept/decision per module
Information hiding

Reveal as little as possible about
internal implementation
+
Better changeability

Changes are kept local
Independent teams

Teams can easier work
independently on different modules
Easier to comprehend

Modules can be understood on
their own easier

Research findings
•  High cohesion, low coupling leads the right way
•  Separation of Concerns and Information hiding
support implementing them
•  Concrete paper instructions should not be followed blindly
•  Different context (single process, very limited hardware)
•  Would lead to nano or pico services à lots of remote calls
•  You need to rethink instructions in the current context
•  Required for all CS papers from a different time & context
•  Leads to concept of Functional Independence in this context
•  Reduces risk of “vertical decomposition” (i.e., layered design)

<uses>
Functionality
vertical decomposition
(layer design, composition)
In practice, you typically
use a combination
of both approaches
Core functional decomposition approaches
horizontal decomposition
(pillar design, segregation)

Vertical decomposition

•  Based on “uses” relation
•  Typical drivers are reuse and avoidance of redundancy
•  Creates strong coupling (high functional dependence)
•  Often useful pattern inside a process boundary
•  Due to deterministic communication behavior
•  Problematic across process boundaries
à Should be avoided in service design

Horizontal decomposition

•  Based on functional segregation
•  Typical drivers are autonomy and independence
•  Creates low coupling (high functional independence)
•  Useful pattern across process boundaries
•  Can also be useful inside a process boundary
•  Less accidental "ripple" effects due to changes
à Should be preferred in service design

Watch out!

•  Vertical decomposition is our default design approach
•  We’ve learned it in our CS education (divide and conquer, ...)
•  It’s emphasized in our daily work (DRY, reusability, ...)
•  Even our IDEs support it (“Extract method”)
•  It's everywhere! It's predominant!
•  It takes energy not to apply vertical decomposition
•  Most people never learned horizontal decomposition

How to learn horizontal decomposition?

Domain-Driven Design

by Eric Evans

[Eva 2004]

DDD to the rescue?

•  Naive application of building block patterns leads
to the counter-example design we have seen before
•  Not useful in our context due to high coupling
•  “Service” pattern leads to process service working on entities
•  Anti-pattern in our context due to high coupling
•  “Conceptual contours” supports high cohesion
•  Yet, tends to be too fine grained for our context
•  “Bounded contexts” supports low coupling
•  Yet, tends to be too coarse grained for our context
à Mixed emotions: Good, but not the expected panacea

For good service design, look at the behavior first, not the data

Case study

(Very simple) eCommerce shop
•  Implements the core functionalities
•  Search & show
•  Checkout
•  Shipment
•  Customer self-care
•  Product catalog maintenance
•  Payments only touched as black box
•  No recommendations, etc.

Core reasoning

To reduce the number of remote calls needed for a given functionality,
we need to spread the functionality between the services in a way
that a single use case/user interaction less often needs to cross service
boundaries.

Therefore, we try to organize services around use cases/user interactions.

Search & Show
Add to
shopping cart
Checkout
Shipment
Customer
self-care
Product catalog
maintenance
eCommerce shop
Customer
Back office
employee
Warehouse
employee

Search & Show
Add to
shopping cart
Checkout
Shipment
Customer
self-care
Product catalog
maintenance
eCommerce shop
Customer
Back office
employee
Warehouse
employee
Three different actors
•  Indicator for cohesion boundaries
•  (At least) three different UIs
•  Could be completely different architectures
•  Depending on user needs, usage patterns and other NFRs
•  As an architect this gives you additional options

Search & Show
Add to
shopping cart
Checkout
Shipment
Customer
self-care
Product catalog
maintenance
eCommerce shop
Customer
Back office
employee
Warehouse
employee
Could be a mobile-first FE
with service-oriented backend
Could be a special warehouse device FE
with a monolithic backend
Could be a rich
desktop app
Could be a
desktop browser
first FE with a
service-oriented
backend

Search & Show
Add to
shopping cart
Checkout
Shipment
Customer
self-care
Product catalog
maintenance

Behavior-based design approach

Focus on minimum cross-service communication
inside a use case/user interaction

1.  Each use case/user interaction is a service candidate

ProductCatalogService

maintenance
S
ShipmentService

•  Shipment
S
CustomerMDService *

S
ShoppingCartService

S
CheckoutService

•  Checkout
S
SearchService

•  Search/show
S
* MD = Master Data



2.  Possibly split big use cases in multiple services
•  Only if really needed (e.g., multiple teams, disparate NFRs)
•  Look for functional clusters with low coupling between them

ShoppingCartService

S
CheckoutService

•  Checkout
S
SearchService

•  Search/show
S

maintenance
S
ShipmentService

•  Shipment
S
CustomerMDService

S
Splitting up use cases in multiple services not needed in this example



2.  Possibly split big use cases in multiple services
•  Only if really needed (e.g., multiple teams, disparate NFRs)
•  Look for functional clusters with low coupling between them
3.  Try to group several use cases in a single service
•  Strive for a sweet spot in terms of an overall trade-off
•  Look for service candidates that operate on the same data

ShoppingCartService

S

maintenance
S
CheckoutService

•  Checkout
S
ShipmentService

•  Shipment
S
SearchService

•  Search/show
S
CustomerMDService

S
Product catalog
Customer master data
Shopping cart
Buying order
Inventory data / shipping order
Product catalog

ShoppingCartService

S

maintenance
S
CheckoutService

•  Checkout
S
ShipmentService

•  Shipment
S
SearchService

•  Search/show
S
CustomerMDService

S
Product catalog
Shopping cart
Buying order
Product catalog
Service candidates working on the same data

Architectural reasoning

•  Same data ...
•  ... but different actors

•  Option to work on a single product catalog database
here outweighs different actors using a single service

à  Unite in one service
(unless you decide to use a different architectural style
for the back office employee application)

ShoppingCartService

S

maintenance
•  Search/show
S
CheckoutService

•  Checkout
S
ShipmentService

•  Shipment
S
CustomerMDService

S
Shopping cart
Buying order
Product catalog

ShoppingCartService

S

maintenance
•  Search/show
S
CheckoutService

•  Checkout
S
ShipmentService

•  Shipment
S
CustomerMDService

S
Shopping cart
Buying order
Product catalog
Service candidates
working on the
same type of data
(shopping cart is a
preliminary order)

Architectural reasoning

•  Some (sequential) cohesion and could work on same data ...
•  ... but unification is still not imperative

•  Need to ponder other aspects and balance trade-offs
•  Different representations for shopping cart and order needed?
•  UI part of the service?
•  How does payment interfere (not considered in the example)?

à  Here we assume that it is best to unite the services


maintenance
•  Search/show
S
OrderCreationService

•  Checkout
S
ShipmentService

•  Shipment
S
CustomerMDService

S
Shopping cart / buying order
Product catalog

Additional reasoning

•  Buying order vs. shipping order
•  Less commonalities than shopping cart and buying order
•  Shipping order is only “ephemeral” entity
•  Different actors using them
à Keep them separated (we need a signaling mechanism then)
•  Who updates items in stock?
•  No longer part of product catalog maintenance
•  Warehouse employee responsible (more reasonable anyway)
à Add additional use case “Fill up inventory”

CustomerMDService

S

•  Checkout
S
Shopping cart / buying order
WarehouseService

•  Shipment
•  Fill up inventory
S

maintenance
•  Search/show
S
Product catalog

Nice, but is this design any better?

Again: How good is it in terms of remote communication?

CheckoutService
Payment Provider
<proceed to checkout>
<show price and ask
for payment method>
<proceed to payment>
pay
<report back completion>
calculate price

WarehouseService
Delivery Provider
inform delivery provider
<initiate shipment>
<show shipment
information>
<parcel(s) packed –
initiate delivery>
update items in stock

Findings

•  All use cases are robust and fast
Side note: It is not always as nice and simple as in this example

1st law of architectural work:

Every decision has its price.
No decision is for free.

(Translation: No decision only has upsides. Every decision also has downsides.)

2nd law of architectural work:

A decision can only be evaluated
with respect to its context.

(Translation: Decisions are not invariably “good” or “bad”, but only in a given context.)

Trade-offs of the approach

•  Biggest concern: What about the data?
•  Data replication and reconciliation
•  Entity distribution (no single source of truth for an entity)
•  Question cannot be answered in general
•  Here we will evaluate it with respect to the given example
•  Plus some general considerations (but no general evaluation)

*
*
*
*
Customer

•  name
•  address
E
Order

•  customer
•  order items
•  product
•  quantity
E
Product

•  name
•  description
•  image(s)
•  price
E
This diagram is misleading!

*
*
*
*
Customer

•  name
•  address
E
Order

•  customer
•  order items
•  product
•  quantity
E
Product

•  name
•  description
•  image(s)
•  price
E
Only used as copy template
Only relevant for search/show
Only relevant for checkout
Just an ID for
business related
referencing purposes
Only relevant for checkout
(including invoice address)
Only relevant for shipment
Only relevant for shipment
(including delivery address)
Different for checkout and shipment
(only IDs and quantities needed)
Immutable after completion
(all data copied into order)

CustomerMDService

S

•  Checkout
S
Shopping cart /
buying order
WarehouseService

•  Shipment
S

maintenance
•  Search/show
S
Product catalog
Putting these use cases in a
single service avoids the need
for data replication
3
3
3
Putting these use cases in a
single service avoids the need
for data signaling
4
4
Needs to signal data
for shipment order

(signaling mechanism required)
2
2
2
Needs to copy some
product (and customer) data
into the order

(could be handled by the frontend)
1
1
1
1

Findings

•  All use cases are robust and fast
•  Minimal need to transfer data between services
•  Solvable via frontend and standard data transfer solution
(batch file, transfer table, message queue, ...)
•  No data replication and reconciliation solution needed

Side note: It is not always as nice and simple as in this example

Still, it is not always that nice and easy

Translation: There are situations where two or more copies of the same data need to be kept in sync

CustomerMDService

S

•  Checkout
S
Shopping cart /
buying order
WarehouseService

•  Shipment
S

maintenance
•  Search/show
S
Product catalog
Might want to allow update
of payment methods in the
context of customer self care
and checkout

(requires two-way synchronization
of master data after change)
1
1
1
Might want to allow adding
items to shopping cart only if
items are in stock

(requires one-way synchronization
of transactional data after change)
2
2
2
It might even hit us
in our example

How can we keep the data in sync?

Options to keep data in sync

•  Shared database
•  Compromises original reasoning to use services
•  Distributed transactions (2-phase commit)
•  Tight coupling compromises service independence
•  Compromises availability and scalability
•  Eventual consistency
•  Sufficient for basically all functional requirements
•  Supports low coupling and high availability
•  Downside: Much harder programming model than ACID TX
✖
✖
✔

Options for eventual consistency
•  Batch pull
•  Consumer pulls data batch when ready to process data
•  Very robust approach (also suitable for legacy integration)
•  Data sync delay may be longer than acceptable
•  Batch bootstrapping & delta push
•  Initial state sync via batch pull, then push of delta updates
•  Often combined with event sourcing, CQRS, reactive, ...
•  Fast, robust (if done right) and still quite lightweight
•  Distributed log
•  Offers advantages of previous approach in one technology
•  Kafka currently is the best-known implementation
•  Still have a plan how to recover if the tool breaks

And the single source of truth issue?

Pondering single source of truth

•  Usually task for analytical data processing
•  Orthogonal, well-understood issue
•  Many solutions available
•  Sometimes needed in transactional systems (e.g., CRM)
•  Question if it is really a need or just a habit
•  Strive for eventual consistency
•  Go for event streams or distributed logs for fast updates

Wrap-up
•  Think (at least) twice before distributing functionality
•  Strive for low coupling, support with high cohesion
•  Prefer horizontal decomposition in service design
•  Favor functional independence over reuse
•  The magic is in the behavior, not the data
•  Employ use cases to find service boundaries
•  Prefer eventual consistency for data synchronization
•  Value the timeless wisdom
•  But update the instructions to the given current context

Timeless design in a cloud-native world

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Ähnlich wie Timeless design in a cloud-native world

Ähnlich wie Timeless design in a cloud-native world (20)

Mehr von Uwe Friedrichsen

Mehr von Uwe Friedrichsen (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Timeless design in a cloud-native world