0-60: Tesla's Streaming Data Platform ( Jesse Yates, Tesla) Kafka Summit SF 2019

•

2 gefällt mir•10,744 views

Tesla ingests trillions of events every day from hundreds of unique data sources through our streaming data platform. Find out how we developed a set of high-throughput, non-blocking primitives that allow us to transform and ingest data into a variety of data stores with minimal development time. Additionally, we will discuss how these primitives allowed us to completely migrate the streaming platform in just a few months. Finally, we will talk about how we scale team size sub-linearly to data volumes, while continuing to onboard new use cases.

Technologie

0-60: Tesla’s
Streaming
Data Platform
Jesse Yates
Staff Engineer
Kafka Summit SF
September 30th, 2019

• Staff Engineer @ Tesla, Big Data
• “Big Data” & Cloud specialist
• Apache HBase Committer
• Apache Phoenix PMC
• Occasional blogger
• Triathlete
Who am I?

Agenda
• Challenges
• Design Overview
• Build a data flow
• Operations

Data Challenges
Usual suspects
• Volume
• Velocity
• Variety
IoT Challenges
• Bursty
• Low-latency
• Payload explosion

100s of Powerpacks
10s pods/pack
1000s signals/sec/pod

Design Requirements
• ”Just works”
• Flexible batching
• One-stream, one-app
• Scale with multiple degrees of freedom

Kafka Channels
• Backpressure, buffering, non-blocking, fault tolerant
• Mostly configurable, extendable as needed
• Highly Composable
Limit functionality to increase operability

Channel Building Blocks
Database FileSystem
Broadcast
Filter Batch
Source
KafkaToFileSystem
KafkaToDB
KafkaToCanonical
KafkaToKafka

Composable Components
Mirror
Raw to
Canonical
Storage
Payload (K:V) (K:V)

Let’s build a
data flow!
Not really,
because,
you know,
live demos

A Simple Use Case
• Gzipped, custom event format
• Collected in an edge Kafka cluster
• Land in central DataLake for analysts
• …
• Profit!

Mirror from Edge Kafka
• Its just another Channel application
• No surprise bugs or operations
• Regex mapping
• Sampling
edge.cool_data
cool_data
edge.legacy_data

Raw to Canonical
• Many-to-many
• Decoders: gzip, b64
• Built-in parsers: JSON, CSV
• Custom Parser
• Flexible for unplanned uses
• High parallelizable
Kafka
Source
Commit
Decode
Parse
Produce

Parser API
public parse(byte[]) :: Iterator<Map<String, Object>>
• Exception during call -> Skip record
• Exception during iteration -> Halt the stream
• Sometimes means early materialization

Our Flow
10%
Mirror
Custom
Parser to
Avro

KafkaToFileSystem
• Parquet format
• System-time partitioned
• Many-to-one
• Native batching
/root
/cool.db
/sys_date=2019-09-30
/channel.cool_data
/somedata.parquet.1

Our Flow
10%
Mirror
Custom
Parser to
Avro
FS HDFS

Kafka Monitoring
• Kafka is very simple to monitor + observe
• One dashboard can tell you everything at a glance
But… people don’t think in offsets and counts
% SLOs and time-based lag monitoring

Operations
• Many open source tools
• Kafka Monitor https://github.com/linkedin/kafka-monitor
• Burrow https://github.com/linkedin/Burrow
• Cruise Control https://github.com/linkedin/cruise-control
• Our own tools https://github.com/teslamotors/kafka-helmsman
• Freshness tracker
• Topic Enforcer
• Rolling Restart

Kubernetes
• Dynamic scalability
• Incidents or usual growth
• Handle daily peaks
• Load smearing across
streams
• Not free – infra is non-trivial

What about when things go sideways
• A rack fails
• Your database chokes
• The network is having a bad day
And your users need their data RIGHT NOW!

Channels Backfill
• “Freshest” data can be ingest
immediately
• Looks just like a regular channel
• Just select a range in the past &
deploy

Summary
• Lots of kinds of data + IoT challenges
• Simplicity for operations at scale
• Backpressure, non-blocking, high-throughput
• Flexibly configuration based

Accelerate the world’s transition to
sustainable energy
We are hiring!
Jesse Yates@jesse_yates jyates@tesla.com

Future Directions
• S3 storage as first-class citizen
• Managing hundreds of flows with multiple steps
• Internal library
• Self-service flows exposed to users
• Open source?

Empfohlen

A Thorough Comparison of Delta Lake, Iceberg and HudiDatabricks

Making Apache Spark Better with Delta LakeDatabricks

Real-Life Use Cases & Architectures for Event Streaming with Apache KafkaKai Wähner

Apache Kafka Best PracticesDataWorks Summit/Hadoop Summit

Solving Enterprise Data Challenges with Apache ArrowWes McKinney

When NOT to use Apache Kafka?Kai Wähner

ksqlDB: A Stream-Relational Database Systemconfluent

Can Apache Kafka Replace a Database?Kai Wähner

Empfohlen

A Thorough Comparison of Delta Lake, Iceberg and HudiDatabricks

Making Apache Spark Better with Delta LakeDatabricks

Real-Life Use Cases & Architectures for Event Streaming with Apache KafkaKai Wähner

Apache Kafka Best PracticesDataWorks Summit/Hadoop Summit

Solving Enterprise Data Challenges with Apache ArrowWes McKinney

When NOT to use Apache Kafka?Kai Wähner

ksqlDB: A Stream-Relational Database Systemconfluent

Can Apache Kafka Replace a Database?Kai Wähner

Real time stock processing with apache nifi, apache flink and apache kafkaTimothy Spann

Making Data Timelier and More Reliable with Lakehouse TechnologyMatei Zaharia

Kafka for Real-Time Replication between Edge and Hybrid CloudKai Wähner

Kafka Streams: What it is, and how to use it?confluent

Introduction to Apache NiFi dws19 DWS - DC 2019Timothy Spann

The Log of All Logs: Raft-based Consensus Inside Kafka | Guozhang Wang, Confl...HostedbyConfluent

Building an open data platform with apache icebergAlluxio, Inc.

The top 3 challenges running multi-tenant Flink at scaleFlink Forward

Apache Iceberg - A Table Format for Hige Analytic DatasetsAlluxio, Inc.

Apache Kafka Streams + Machine Learning / Deep LearningKai Wähner

Squirreling Away $640 Billion: How Stripe Leverages Flink for Change Data Cap...Flink Forward

The Top 5 Apache Kafka Use Cases and Architectures in 2022Kai Wähner

Serverless Kafka and Spark in a Multi-Cloud Lakehouse ArchitectureKai Wähner

Spark Hadoop Tutorial | Spark Hadoop Example on NBA | Apache Spark Training |...Edureka!

Introduction to Kafka connectKnoldus Inc.

Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...Flink Forward

Meetup: Streaming Data Pipeline DevelopmentTimothy Spann

Tame the small files problem and optimize data layout for streaming ingestion...Flink Forward

Delta from a Data Engineer's PerspectiveDatabricks

Designing and Building Next Generation Data Pipelines at Scale with Structure...Databricks

Apache Kafka 0.8 basic training - VerisignMichael Noll

Real-Time Distributed and Reactive Systems with Apache Kafka and Apache AccumuloJoe Stein

Weitere ähnliche Inhalte

Was ist angesagt?