Platform Engineering 101: Empowering Developers

Platform Engineering 101
Empowering developers to deliver - as-a-service
Skyworkz - https://skyworkz.nl
Sander Knape - https://sanderknape.com

Hello, I’m Sander Knape
Cloud Engineer @ Skyworkz

Let’s build an app
● Application that connects to a database
● Works on my machine™
→ Let’s get it into the Cloud!

Cloud provider / runtime environment

There’s quite a bit going on
Cloud provider / runtime
environment
Infrastructure as Code
CI / CD
Artifact management
Security scanning
Configuration
management
Secret management
Monitoring, Logging,
Metrics
Alerts
Database migrations
Database
anonymization
Cost insights
Security
Application

Software development is more than just
writing [Go, Java, NodeJS, C#, …]

Definition time: “Platform Engineering”
The composition and integration of a set of processes, tools and automation
(components) to build a coherent platform with the goal of empowering developers
to be able to easily build, maintain and operate their business logic.

Why Platform Engineering
“[T]he reality is that state of the art cloud native technology is still too hard to use if
every product engineering team has to individually solve common problems
around networking, observability, deployment, provisioning, caching, data storage,
etc.”
https://medium.com/@mattklein123/the-human-scalability-of-devops-e36c37d3db6a

Specific knowledge is required

Platform Engineering
==
Software Engineering

Who has the responsibility?
1 or 2 platform engineers
1 platform team
multiple platform teams

Multiple approaches
● “Just hand me your source code on a USB stick. I’ll put it on the server and
make sure and that it keeps working”
Or…
● “Create a pull request <here> to automatically get a Git repository, CI/CD,
filled with a Hello World application, that deploys all the way to production”

Challenges we’ll talk about
1. Protecting the platform for organizational scalability
2. Building an opinionated platform
3. Specifying contracts
4. Collaborating with your users

Protecting the platform for
organizational scalability

Protect the platform
https://www.independent.ie/sport/soccer/premier-league/the-platform-we-created-is-more-important-than-one-
person-mauricio-pochettino-coy-on-future-37652030.html

The question to ask for each new feature
How much support will I need to provide after I have delivered this feature?
Less (or equal) is the only acceptable answer

Reducing toil
“Toil is the kind of work tied to running a production service that tends to be
manual, repetitive, automatable, tactical, devoid of enduring value, and that
scales linearly as a service grows.”
https://landing.google.com/sre/sre-book/chapters/eliminating-toil/

Examples of toil
● Manually creating Git repositories and granting the correct people access
● Manually creating CI/CD pipelines based on another project
● Running load tests from a laptop with a security token that you can not share
● Putting secrets into your secret management solution

Examples of toil: old-school operations
● Manually;
○ Flushing caches
○ Rebooting servers
○ Renewing SSL licenses
○ ...

Self-service through automation

Self-service instead of toil
● Manually creating Git repositories and granting the correct people access
● Manually creating CI/CD pipelines based on another project
● Running load tests from a laptop with a security token that you can not share
● Putting secrets into your secret management solution

Scalable support
Developer: Hello, can you help me with “something” for feature X?
Platform: Did you check the docs for feature X?
Developer: Yes, but it doesn’t mention anything about “something”
Platform: I see, give me as second
Developer: Sure
Platform: I edited the docs to include “something”, does this answer
your question? https://github.com/org/docs/pull/1337
Developer: Totally, thanks!

How to protect yourselves
● Automate. Think “as a service”.
● Prefer managed solutions above self-hosted. Every minute you spend on
maintaining a self-hosted solution is time not spend on improving the platform
usability
● Prefer documentation above single answers
● Make the platform as simple as possible

Building an opinionated platform

Abstraction levels
“Serverless”
“IaaS”
Levelofabstraction
“Bricks”
Platform
Managed externally
Platform features

Abstraction levels
“Serverless”
“IaaS”
Levelofabstraction
“Bricks”
Platform
Managed externally
Platform features
Splunk Cloud
AWS CloudWatch
AWS Secrets Manager
Google Cloud Datastore

Abstraction levels
“Serverless”
“IaaS”
Levelofabstraction
“Bricks”
Platform
Managed externally
Platform features
HashiCorp Vault
Splunk Self-Hosted
Prometheus/Grafana
MySQL on EC2
GitLab on EC2

Abstraction levels
“Serverless”
“IaaS”
Levelofabstraction
“Bricks”
Platform
Managed externally
Platform features
CloudFormation

Abstraction levels
“Serverless”
“IaaS”
Levelofabstraction
“Bricks”
Platform
Managed externally
Platform features
Developer
responsibility

Self-service
● Lambda languages v.s. Lambda runtimes
● Hosted, shared CI/CD solution vs. BYO CI/CD solution
● Default Docker / EC2 images vs. built your own
● Default network with every application vs. create your own VPC/Subnet

$> curl -s localhost:9200/_cluster/settings?include_defaults=true
| jq '.defaults'
| grep ""[a-z_.-]*": ["[]"
| wc -l
361
Elasticsearch settings

Business value
What is the business value of each development team maintaining their own;
● CI/CD server
● Monitoring solution
● Network
● ...

Getting buy-in
Team Manifesto
We build everything with developers. We’ll always select a pioneering team that
has interest in a new service/feature, and work with them to make sure we pick
and configure the right tool and solve the root problem.

How to sell an opinionated platform
● Simple to use, high-level abstractions; developers shouldn’t want to use
anything else
● Make everyone aware of the constant balance seeking
● Build the features together with developers

Your contract is your interface
“Serverless”
“IaaS”
Levelofabstraction
“Bricks”
Platform
Managed externally
Platform features
Interface

Perfect world
Platform Application

Actual world
Grey area

Implemented world
Contract

Implemented world
Component
A
Component
B
Contract

Embrace dependencies
Order
Service
Product
Service
Artifact
Mgmt
Owner:
backoffice
team
Owner:
catalogue
team
Owner:
platform
team

Documentation
“Serverless”
“IaaS”
Levelofabstraction
“Bricks”
Platform
Documented by the external party
Documented by the platform team
Interface

AWS: Shared Responsibility Model

Shared Responsibility
● Git repository automation
● CI/CD autoscaling
● Network ACLs
● Kubernetes cluster
● Logging / Metrics agent on EC2,
listening on specific port
Platform Development
● Creating Git repositories
● CI/CD configuration
● Security groups
● Application in Kubernetes
● Shipping logs/metrics to specific
port

Continuously test your responsibility
● Are my agents autoscaling?
● Are my NACLs correct?
● Is my Kubernetes cluster stable?
● Are the logging/metrics agent operable?
→ Make this transparent!

Shift left
Idea Development ProductionStaging
Early feedback Late feedback

Codify your contract
execution:
- concurrency: 5
hold-for: 20m
ramp-up: 5m
scenarios:
requests:
- method: GET
url: https://example.com
modules:
blazemeter:
projectName: Team Name
testName: Test Name
modules:
blazemeter:
projectName: .*
testName: .*
Implementation Contract

Best Really good

execution:
- concurrency: 5
hold-for: 20m
ramp-up: 5m
scenarios:
requests:
- method: GET
url: https://example.com
modules:
blazemeter:
projectName: Team Name
Must specify testName:
https://url/to/docs

$> git clone example/template/skeleton
Codify your microservices

Default microservice
● Unit testing
● Integration testing
● Log example
● Metric example
● Secret example
● CI/CD to production
● ...

How to define your contracts
● Be aware of the contract and make the company aware
● Minimize the gray area
● Codify your contract
○ Shift left
○ Examples / blueprints
● Make sure the company understand the interface to your team

Getting buy-in
Team Manifesto
We build everything with developers. We always select a pioneering team that has
interest in a new service/feature, and work with them to make sure we pick and
configure the right tool and solve the root problem.

Imagine… a power outage
You’re at home, watching TV, and BAM: TV is out, lights are out. What do you do?
● Check the other rooms to see the scope of the outage
● Check with the neighbours if it’s just your home or more
● Check with the power company

How to collaborate with your users
● Teach everyone how the platform works, in a scalable way
○ Documentation
○ Workshops
● Enable collaboration between development teams
● Make it clear how you work

Summarizing
1. You need a platform - and someone should own it
2. Always think about scale - both organizational and technical
3. Opinions are OK
4. Specify contracts, but be careful about what you support
5. Collaborate

Thank you!
Questions?
Skyworkz - https://skyworkz.nl
Sander Knape - https://sanderknape.com - @SanderKnape

Platform Engineering 101: Empowering Developers

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie Platform Engineering 101: Empowering Developers

Ähnlich wie Platform Engineering 101: Empowering Developers (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Platform Engineering 101: Empowering Developers

Hinweis der Redaktion