SlideShare ist ein Scribd-Unternehmen logo
1 von 25
Downloaden Sie, um offline zu lesen
14. January 2019, Apache Kafka Meetup, Hamburg
Stefan Frehse, Software Architect, Digitalization
Ingesting IoT data in Food Processing
Machines for Food Processing
Example: fish machines
2
Machines for Food Processing
3
11.01.20194
BAADER – Digital opportunities
A data driven food value chain
BAADER approach
Data-driven company by Apache Kafka
Summary
Agenda
BAADER Digitalization
OPPORTUNITIES OF FUTURE VALUE ADD
CURRENT VALUE CREATION
OPPORTUNITIES OF FUTURE VALUE
ADD
Machine information
runtime, speed, energy consumption
Product Data
Size, Weight, Quality
Business Process Data
Supplier, Order, Customer, Shipment
INFORMATION CHAIN
BAADER – Digital Opportunities
Expanding own value added through business process data capturing and using
BAADER Digitalization5
OPPORTUNITY OF FUTURE VALUE ADD OPPORTUNITY OF FUTURE VALUE ADD
11.01.20196
Capturing data from farm to fork
A data-driven Food Value Chain
6
Farm inputs Primary sector Transport Processing Retail Consumers
Farmers/input suppliers
• Optimize yield and operational management
• Anticipate future developments
• Provide product and process guarantees
Offtakers
• Enable market and consumer-driven production
• Shorten and optimize production chains
Value Chain
• Predict market dynamics to anticipate supply/demand and negotiate better prices
• Meet demand by consumers for more transparency, e.g. provenance, production methods
Reduce food waste and increase production
BAADER Digitalization
Recycling
11.01.20197 BAADER Digitalization7
ONE digital platform
as basis for all
future solutions in
the food processing
industry
Our vision
BAADER Approach
* Artificial Intelligence
11.01.20198
Visibility, consensus, performance for a data-driven company
Collecting data along the food value chain
8
Shared, replicated, permissioned
Farm record Transport
record
Weather
record
Processing
record
Packaging
record
Retailers
record
Auditors
record
End user
record
BAADER Digitalization
Single source of truth for a
Data-driven company
11.01.20199
Gathering data for food safety
Traceability
Temperature
Humidity
Air exchange
Transport time
Stunner settings
Time hanging
Transparency
Animal welfare indicators in poultry processing
Very brief overview
Vision: A data-driven company
10
Arbitrary
records
Apache Kafka
Transformation Aggregation
Central data log
Remote Service
Machine R&D
Predictive Maintenance
API
Dashboards / Reports
Collecting Farming and Transportation Records
Gathered from the farming process
● Farming: Chickens grow up and are catched after a
few weeks
● Critical parameters:
○ Temperature
○ Humidity
○ CO2
● GPS positions of the truck to forecast the ETA
○ Optimal preparation for reception
● Load information: number of birds, weight, sex
● => Animal welfare has a strong focus
11
+
Collecting Machine Data
Gathered from the slaughterhouse (factory)
● Installed OPC UA server (M2M protocol) provides
machine data
○ power consumption, voltage, line speed, air &
water temperature, bone size, fillet size,
quality, weight
● local IoT-gateway subscribes data from OPC UA
and forward the data to AWS IoT Core as they
occur
12
Processing record
Example: Collecting data from camera systems
11.01.201913
Camera before
chilling
Camera after
chilling
Upstreaming
steps+
Downstreaming
steps+
10:00 pm 1:00 pm Timeline
BAADER Digitalization
~190 birds/min
Evisceration Line CutUp Line
~190 birds/min
Product quality: Chicken
Collecting data from a camera system
11.01.201914 BAADER Digitalization
● Camera takes 2-4 pictures of
each chicken
● Images are analyzed in
real-time at the edge
● Quality and potential defects
are provided
● Sorting based on the quality
for downstreaming steps
(cutup, whole bird, …)
Data ingestion into Apache Kafka
Keeping the quality
11.01.2019BAADER Digitalization15
Apache Kafka
Cluster
MQTTAdaptor
AWSIoTCore(MQTT)
Elasticsearch / Kibana
MongoDB
AWS S3
KafkaConnector
Demo
Streaming SQL Processing
GPS
Machinedata
Weather
MQTT over TLS
Data source
KSQL and Chicken Quality
Creating a stream
CREATE STREAM quality (
thingId VARCHAR,
value STRUCT<quality STRUCT<code INT>>)
WITH (KAFKA_TOPIC='camera', VALUE_FORMAT='json');
● Creates a KSQL stream of the quality from the camera.
16
KSQL and Chicken Quality
Creating a windowed table for efficiency monitoring
CREATE TABLE empty_shackles_per_5_minutes AS
SELECT thingId, COUNT(*) FROM quality
WINDOW TUMBLING (SIZE 5 MINUTES)
WHERE value->quality->code = 4
GROUP by thingId;
● Create a table over a 5 minute window for empty shackles (code = 4)
● Application / Alerting: Notify factory manager about potential efficiency drop
17
Kafka and Transportation Data
Computing truck arrivals
● Topic that contains the GPS positions of a truck
18
{
"thingId": "4d5cc1d6-50ca-4bf9-bb6c-ddfaa9043d6a",
"thingType": "baader::dt::vehicle::daf::vehicle_geo_position",
"thingVersion": "0",
"value": {
"Latitude":"5X.6155128",
"Longitude":"-X.9469323",
“LastUpdateDataTimeInUtc":"2018-09-14T11:51:44.000Z"
}
"timestamp": 1536955501460
}
ETA Computation
Using KSQL for computing ETA
● KSQL: Added own stateless functions for SQL stream processing
● UDFs: User-defined functions in Java and just putting the compiled jar into the
confluent-platform environment
● Implemented a UDF for computing the ETA using Google Distance API
● Derived a new stream that calls the distance UDF
● The ETA is presented in a dashboard
19
Drive new businesses
Writing streaming applications
● Building a central data lake for machine data (power, voltage, water
consumptions) -> next machine generation
● Providing a transparent view of the entire food value chain
●
● Using easy-to-use streaming SQL (e.g., KSQL, LSQL) to create derived streams
○ Computing windowed reached SLA for the machines
○ Efficiency by computing number of empty shackles
○ Alerts when violating thresholds
20
Value of Apache Kafka
Context Digitalization
● Building streaming applications in the context of IoT
● Event-driven architectures
● Easy experimentation with low cost
● Single source of truth for a data-driven company
● … and the usual suspects
● Fault-tolerance
● Strict ordering
● High-Availability
● Scalability
21
Confluent Cloud Enterprise
● Fully managed Kafka Cluster for dev. and prod.
● Just configure: throughput and retention time
● Excellent support in terms of response and quality
● Secured environment using VPC peering on AWS
● Open-Source contributor to Apache Kafka and its entire ecosystem
22
Values
Today’s numbers
At the very early stage
● 40-50 msg/s
● Total: 140 data points
● 100 GB gathered from a single factory within a few months
● There is much more, but...
23
We are hiring:
https://stackoverflow.com/jobs/companies/baader
Backup
Kibana dashboard (15 min) of the camera system
25

Weitere ähnliche Inhalte

Was ist angesagt?

Technical Deep Dive: Using Apache Kafka to Optimize Real-Time Analytics in Fi...
Technical Deep Dive: Using Apache Kafka to Optimize Real-Time Analytics in Fi...Technical Deep Dive: Using Apache Kafka to Optimize Real-Time Analytics in Fi...
Technical Deep Dive: Using Apache Kafka to Optimize Real-Time Analytics in Fi...
confluent
 
IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Mac...
IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Mac...IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Mac...
IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Mac...
Kai Wähner
 
Event Streaming Architecture for Industry 4.0 - Abdelkrim Hadjidj & Jan Kuni...
Event Streaming Architecture for Industry 4.0 -  Abdelkrim Hadjidj & Jan Kuni...Event Streaming Architecture for Industry 4.0 -  Abdelkrim Hadjidj & Jan Kuni...
Event Streaming Architecture for Industry 4.0 - Abdelkrim Hadjidj & Jan Kuni...
Flink Forward
 
Streamsheets and Apache Kafka – Interactively build real-time Dashboards and ...
Streamsheets and Apache Kafka – Interactively build real-time Dashboards and ...Streamsheets and Apache Kafka – Interactively build real-time Dashboards and ...
Streamsheets and Apache Kafka – Interactively build real-time Dashboards and ...
confluent
 

Was ist angesagt? (20)

Technical Deep Dive: Using Apache Kafka to Optimize Real-Time Analytics in Fi...
Technical Deep Dive: Using Apache Kafka to Optimize Real-Time Analytics in Fi...Technical Deep Dive: Using Apache Kafka to Optimize Real-Time Analytics in Fi...
Technical Deep Dive: Using Apache Kafka to Optimize Real-Time Analytics in Fi...
 
Best Practices for Streaming IoT Data with MQTT and Apache Kafka®
Best Practices for Streaming IoT Data with MQTT and Apache Kafka®Best Practices for Streaming IoT Data with MQTT and Apache Kafka®
Best Practices for Streaming IoT Data with MQTT and Apache Kafka®
 
Fast Data – Fast Cars: Wie Apache Kafka die Datenwelt revolutioniert
Fast Data – Fast Cars: Wie Apache Kafka die Datenwelt revolutioniertFast Data – Fast Cars: Wie Apache Kafka die Datenwelt revolutioniert
Fast Data – Fast Cars: Wie Apache Kafka die Datenwelt revolutioniert
 
Building a Streaming Pipeline on Kubernetes Using Kafka Connect, KSQLDB & Apa...
Building a Streaming Pipeline on Kubernetes Using Kafka Connect, KSQLDB & Apa...Building a Streaming Pipeline on Kubernetes Using Kafka Connect, KSQLDB & Apa...
Building a Streaming Pipeline on Kubernetes Using Kafka Connect, KSQLDB & Apa...
 
Elastically Scaling Kafka Using Confluent
Elastically Scaling Kafka Using ConfluentElastically Scaling Kafka Using Confluent
Elastically Scaling Kafka Using Confluent
 
Application modernization patterns with apache kafka, debezium, and kubernete...
Application modernization patterns with apache kafka, debezium, and kubernete...Application modernization patterns with apache kafka, debezium, and kubernete...
Application modernization patterns with apache kafka, debezium, and kubernete...
 
Bridge to Cloud: Using Apache Kafka to Migrate to AWS
Bridge to Cloud: Using Apache Kafka to Migrate to AWSBridge to Cloud: Using Apache Kafka to Migrate to AWS
Bridge to Cloud: Using Apache Kafka to Migrate to AWS
 
Fully-Managed, Multi-Tenant Kafka Clusters: Tips, Tricks, and Tools (Christop...
Fully-Managed, Multi-Tenant Kafka Clusters: Tips, Tricks, and Tools (Christop...Fully-Managed, Multi-Tenant Kafka Clusters: Tips, Tricks, and Tools (Christop...
Fully-Managed, Multi-Tenant Kafka Clusters: Tips, Tricks, and Tools (Christop...
 
Bridge Your Kafka Streams to Azure Webinar
Bridge Your Kafka Streams to Azure WebinarBridge Your Kafka Streams to Azure Webinar
Bridge Your Kafka Streams to Azure Webinar
 
IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Mac...
IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Mac...IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Mac...
IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Mac...
 
Building Streaming Data Pipelines with Google Cloud Dataflow and Confluent Cl...
Building Streaming Data Pipelines with Google Cloud Dataflow and Confluent Cl...Building Streaming Data Pipelines with Google Cloud Dataflow and Confluent Cl...
Building Streaming Data Pipelines with Google Cloud Dataflow and Confluent Cl...
 
One Kubernetes to rule them all (ZEUS 2019 Keynote)
One Kubernetes to rule them all (ZEUS 2019 Keynote)One Kubernetes to rule them all (ZEUS 2019 Keynote)
One Kubernetes to rule them all (ZEUS 2019 Keynote)
 
The Rise Of Event Streaming – Why Apache Kafka Changes Everything
The Rise Of Event Streaming – Why Apache Kafka Changes EverythingThe Rise Of Event Streaming – Why Apache Kafka Changes Everything
The Rise Of Event Streaming – Why Apache Kafka Changes Everything
 
Event Streaming Architecture for Industry 4.0 - Abdelkrim Hadjidj & Jan Kuni...
Event Streaming Architecture for Industry 4.0 -  Abdelkrim Hadjidj & Jan Kuni...Event Streaming Architecture for Industry 4.0 -  Abdelkrim Hadjidj & Jan Kuni...
Event Streaming Architecture for Industry 4.0 - Abdelkrim Hadjidj & Jan Kuni...
 
Streamsheets and Apache Kafka – Interactively build real-time Dashboards and ...
Streamsheets and Apache Kafka – Interactively build real-time Dashboards and ...Streamsheets and Apache Kafka – Interactively build real-time Dashboards and ...
Streamsheets and Apache Kafka – Interactively build real-time Dashboards and ...
 
Confluent Cloud for Apache Kafka® | Google Cloud Next ’19
Confluent Cloud for Apache Kafka® | Google Cloud Next ’19Confluent Cloud for Apache Kafka® | Google Cloud Next ’19
Confluent Cloud for Apache Kafka® | Google Cloud Next ’19
 
ksqlDB Workshop
ksqlDB WorkshopksqlDB Workshop
ksqlDB Workshop
 
What's new in confluent platform 5.4 online talk
What's new in confluent platform 5.4 online talkWhat's new in confluent platform 5.4 online talk
What's new in confluent platform 5.4 online talk
 
Real time data processing and model inferncing platform with Kafka streams (N...
Real time data processing and model inferncing platform with Kafka streams (N...Real time data processing and model inferncing platform with Kafka streams (N...
Real time data processing and model inferncing platform with Kafka streams (N...
 
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
 

Ähnlich wie Ingesting IoT data in Food Processing

Routing IoT Data with KSQL and Kafka Connect (Stefan Frehse, Baader Group) Fr...
Routing IoT Data with KSQL and Kafka Connect (Stefan Frehse, Baader Group) Fr...Routing IoT Data with KSQL and Kafka Connect (Stefan Frehse, Baader Group) Fr...
Routing IoT Data with KSQL and Kafka Connect (Stefan Frehse, Baader Group) Fr...
confluent
 
How Confluent Cloud Helps for Sustainable Food Processing (Sabrina Eßer, Baad...
How Confluent Cloud Helps for Sustainable Food Processing (Sabrina Eßer, Baad...How Confluent Cloud Helps for Sustainable Food Processing (Sabrina Eßer, Baad...
How Confluent Cloud Helps for Sustainable Food Processing (Sabrina Eßer, Baad...
confluent
 
Machine Learning with Apache Spark
Machine Learning with Apache SparkMachine Learning with Apache Spark
Machine Learning with Apache Spark
IBM Cloud Data Services
 
A Scalable and Distributed Electrical Power Monitoring System Utilizing Cloud...
A Scalable and Distributed Electrical Power Monitoring System Utilizing Cloud...A Scalable and Distributed Electrical Power Monitoring System Utilizing Cloud...
A Scalable and Distributed Electrical Power Monitoring System Utilizing Cloud...
Ryousei Takano
 

Ähnlich wie Ingesting IoT data in Food Processing (20)

Routing IoT Data with KSQL and Kafka Connect (Stefan Frehse, Baader Group) Fr...
Routing IoT Data with KSQL and Kafka Connect (Stefan Frehse, Baader Group) Fr...Routing IoT Data with KSQL and Kafka Connect (Stefan Frehse, Baader Group) Fr...
Routing IoT Data with KSQL and Kafka Connect (Stefan Frehse, Baader Group) Fr...
 
Adjust Workshop - PUSHING AND PULLING YOUR DATA
Adjust Workshop - PUSHING AND PULLING YOUR DATA Adjust Workshop - PUSHING AND PULLING YOUR DATA
Adjust Workshop - PUSHING AND PULLING YOUR DATA
 
IoT Event Processing and Analytics with InfluxDB in Google Cloud | Christoph ...
IoT Event Processing and Analytics with InfluxDB in Google Cloud | Christoph ...IoT Event Processing and Analytics with InfluxDB in Google Cloud | Christoph ...
IoT Event Processing and Analytics with InfluxDB in Google Cloud | Christoph ...
 
Getting started with azure event hubs and stream analytics services
Getting started with azure event hubs and stream analytics servicesGetting started with azure event hubs and stream analytics services
Getting started with azure event hubs and stream analytics services
 
EDA Meets Data Engineering – What's the Big Deal?
EDA Meets Data Engineering – What's the Big Deal?EDA Meets Data Engineering – What's the Big Deal?
EDA Meets Data Engineering – What's the Big Deal?
 
Christoph Bussler [Google Cloud] | IoT Event Processing and Analytics with In...
Christoph Bussler [Google Cloud] | IoT Event Processing and Analytics with In...Christoph Bussler [Google Cloud] | IoT Event Processing and Analytics with In...
Christoph Bussler [Google Cloud] | IoT Event Processing and Analytics with In...
 
How Confluent Cloud Helps for Sustainable Food Processing (Sabrina Eßer, Baad...
How Confluent Cloud Helps for Sustainable Food Processing (Sabrina Eßer, Baad...How Confluent Cloud Helps for Sustainable Food Processing (Sabrina Eßer, Baad...
How Confluent Cloud Helps for Sustainable Food Processing (Sabrina Eßer, Baad...
 
Serving the Real-Time Data Needs of an Airport with Kafka Streams and KSQL
Serving the Real-Time Data Needs of an Airport with Kafka Streams and KSQLServing the Real-Time Data Needs of an Airport with Kafka Streams and KSQL
Serving the Real-Time Data Needs of an Airport with Kafka Streams and KSQL
 
Cloudera - IoT & Smart Cities
Cloudera - IoT & Smart CitiesCloudera - IoT & Smart Cities
Cloudera - IoT & Smart Cities
 
Cf summit-2016-monitoring-cf-sensu-graphite
Cf summit-2016-monitoring-cf-sensu-graphiteCf summit-2016-monitoring-cf-sensu-graphite
Cf summit-2016-monitoring-cf-sensu-graphite
 
Serhii Kholodniuk: What you need to know, before migrating data platform to G...
Serhii Kholodniuk: What you need to know, before migrating data platform to G...Serhii Kholodniuk: What you need to know, before migrating data platform to G...
Serhii Kholodniuk: What you need to know, before migrating data platform to G...
 
Machine Learning with Apache Spark
Machine Learning with Apache SparkMachine Learning with Apache Spark
Machine Learning with Apache Spark
 
OSDC 2019 | Democratizing Data at Go-JEK by Maulik Soneji
OSDC 2019 | Democratizing Data at Go-JEK by Maulik SonejiOSDC 2019 | Democratizing Data at Go-JEK by Maulik Soneji
OSDC 2019 | Democratizing Data at Go-JEK by Maulik Soneji
 
Distributed Solar Systems: Revolutionizing Time-Series Predictive Analytics, ...
Distributed Solar Systems: Revolutionizing Time-Series Predictive Analytics, ...Distributed Solar Systems: Revolutionizing Time-Series Predictive Analytics, ...
Distributed Solar Systems: Revolutionizing Time-Series Predictive Analytics, ...
 
A Scalable and Distributed Electrical Power Monitoring System Utilizing Cloud...
A Scalable and Distributed Electrical Power Monitoring System Utilizing Cloud...A Scalable and Distributed Electrical Power Monitoring System Utilizing Cloud...
A Scalable and Distributed Electrical Power Monitoring System Utilizing Cloud...
 
Samsara Fleet Monitoring/Reefer Temperature Solution
Samsara Fleet Monitoring/Reefer Temperature SolutionSamsara Fleet Monitoring/Reefer Temperature Solution
Samsara Fleet Monitoring/Reefer Temperature Solution
 
Industry 4.0 for beginners
Industry 4.0 for beginnersIndustry 4.0 for beginners
Industry 4.0 for beginners
 
SOLUSI INDUSTRIAL IOT CONTEC
SOLUSI INDUSTRIAL IOT CONTECSOLUSI INDUSTRIAL IOT CONTEC
SOLUSI INDUSTRIAL IOT CONTEC
 
SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...
SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...
SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...
 
IoT Connected Brewery
IoT Connected BreweryIoT Connected Brewery
IoT Connected Brewery
 

Mehr von confluent

Mehr von confluent (20)

Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
 
Santander Stream Processing with Apache Flink
Santander Stream Processing with Apache FlinkSantander Stream Processing with Apache Flink
Santander Stream Processing with Apache Flink
 
Unlocking the Power of IoT: A comprehensive approach to real-time insights
Unlocking the Power of IoT: A comprehensive approach to real-time insightsUnlocking the Power of IoT: A comprehensive approach to real-time insights
Unlocking the Power of IoT: A comprehensive approach to real-time insights
 
Workshop híbrido: Stream Processing con Flink
Workshop híbrido: Stream Processing con FlinkWorkshop híbrido: Stream Processing con Flink
Workshop híbrido: Stream Processing con Flink
 
Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...
Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...
Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...
 
AWS Immersion Day Mapfre - Confluent
AWS Immersion Day Mapfre   -   ConfluentAWS Immersion Day Mapfre   -   Confluent
AWS Immersion Day Mapfre - Confluent
 
Eventos y Microservicios - Santander TechTalk
Eventos y Microservicios - Santander TechTalkEventos y Microservicios - Santander TechTalk
Eventos y Microservicios - Santander TechTalk
 
Q&A with Confluent Experts: Navigating Networking in Confluent Cloud
Q&A with Confluent Experts: Navigating Networking in Confluent CloudQ&A with Confluent Experts: Navigating Networking in Confluent Cloud
Q&A with Confluent Experts: Navigating Networking in Confluent Cloud
 
Citi TechTalk Session 2: Kafka Deep Dive
Citi TechTalk Session 2: Kafka Deep DiveCiti TechTalk Session 2: Kafka Deep Dive
Citi TechTalk Session 2: Kafka Deep Dive
 
Build real-time streaming data pipelines to AWS with Confluent
Build real-time streaming data pipelines to AWS with ConfluentBuild real-time streaming data pipelines to AWS with Confluent
Build real-time streaming data pipelines to AWS with Confluent
 
Q&A with Confluent Professional Services: Confluent Service Mesh
Q&A with Confluent Professional Services: Confluent Service MeshQ&A with Confluent Professional Services: Confluent Service Mesh
Q&A with Confluent Professional Services: Confluent Service Mesh
 
Citi Tech Talk: Event Driven Kafka Microservices
Citi Tech Talk: Event Driven Kafka MicroservicesCiti Tech Talk: Event Driven Kafka Microservices
Citi Tech Talk: Event Driven Kafka Microservices
 
Confluent & GSI Webinars series - Session 3
Confluent & GSI Webinars series - Session 3Confluent & GSI Webinars series - Session 3
Confluent & GSI Webinars series - Session 3
 
Citi Tech Talk: Messaging Modernization
Citi Tech Talk: Messaging ModernizationCiti Tech Talk: Messaging Modernization
Citi Tech Talk: Messaging Modernization
 
Citi Tech Talk: Data Governance for streaming and real time data
Citi Tech Talk: Data Governance for streaming and real time dataCiti Tech Talk: Data Governance for streaming and real time data
Citi Tech Talk: Data Governance for streaming and real time data
 
Confluent & GSI Webinars series: Session 2
Confluent & GSI Webinars series: Session 2Confluent & GSI Webinars series: Session 2
Confluent & GSI Webinars series: Session 2
 
Data In Motion Paris 2023
Data In Motion Paris 2023Data In Motion Paris 2023
Data In Motion Paris 2023
 
Confluent Partner Tech Talk with Synthesis
Confluent Partner Tech Talk with SynthesisConfluent Partner Tech Talk with Synthesis
Confluent Partner Tech Talk with Synthesis
 
The Future of Application Development - API Days - Melbourne 2023
The Future of Application Development - API Days - Melbourne 2023The Future of Application Development - API Days - Melbourne 2023
The Future of Application Development - API Days - Melbourne 2023
 
The Playful Bond Between REST And Data Streams
The Playful Bond Between REST And Data StreamsThe Playful Bond Between REST And Data Streams
The Playful Bond Between REST And Data Streams
 

Kürzlich hochgeladen

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Kürzlich hochgeladen (20)

DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 

Ingesting IoT data in Food Processing

  • 1. 14. January 2019, Apache Kafka Meetup, Hamburg Stefan Frehse, Software Architect, Digitalization Ingesting IoT data in Food Processing
  • 2. Machines for Food Processing Example: fish machines 2
  • 3. Machines for Food Processing 3
  • 4. 11.01.20194 BAADER – Digital opportunities A data driven food value chain BAADER approach Data-driven company by Apache Kafka Summary Agenda BAADER Digitalization
  • 5. OPPORTUNITIES OF FUTURE VALUE ADD CURRENT VALUE CREATION OPPORTUNITIES OF FUTURE VALUE ADD Machine information runtime, speed, energy consumption Product Data Size, Weight, Quality Business Process Data Supplier, Order, Customer, Shipment INFORMATION CHAIN BAADER – Digital Opportunities Expanding own value added through business process data capturing and using BAADER Digitalization5 OPPORTUNITY OF FUTURE VALUE ADD OPPORTUNITY OF FUTURE VALUE ADD
  • 6. 11.01.20196 Capturing data from farm to fork A data-driven Food Value Chain 6 Farm inputs Primary sector Transport Processing Retail Consumers Farmers/input suppliers • Optimize yield and operational management • Anticipate future developments • Provide product and process guarantees Offtakers • Enable market and consumer-driven production • Shorten and optimize production chains Value Chain • Predict market dynamics to anticipate supply/demand and negotiate better prices • Meet demand by consumers for more transparency, e.g. provenance, production methods Reduce food waste and increase production BAADER Digitalization Recycling
  • 7. 11.01.20197 BAADER Digitalization7 ONE digital platform as basis for all future solutions in the food processing industry Our vision BAADER Approach * Artificial Intelligence
  • 8. 11.01.20198 Visibility, consensus, performance for a data-driven company Collecting data along the food value chain 8 Shared, replicated, permissioned Farm record Transport record Weather record Processing record Packaging record Retailers record Auditors record End user record BAADER Digitalization Single source of truth for a Data-driven company
  • 9. 11.01.20199 Gathering data for food safety Traceability Temperature Humidity Air exchange Transport time Stunner settings Time hanging Transparency Animal welfare indicators in poultry processing
  • 10. Very brief overview Vision: A data-driven company 10 Arbitrary records Apache Kafka Transformation Aggregation Central data log Remote Service Machine R&D Predictive Maintenance API Dashboards / Reports
  • 11. Collecting Farming and Transportation Records Gathered from the farming process ● Farming: Chickens grow up and are catched after a few weeks ● Critical parameters: ○ Temperature ○ Humidity ○ CO2 ● GPS positions of the truck to forecast the ETA ○ Optimal preparation for reception ● Load information: number of birds, weight, sex ● => Animal welfare has a strong focus 11 +
  • 12. Collecting Machine Data Gathered from the slaughterhouse (factory) ● Installed OPC UA server (M2M protocol) provides machine data ○ power consumption, voltage, line speed, air & water temperature, bone size, fillet size, quality, weight ● local IoT-gateway subscribes data from OPC UA and forward the data to AWS IoT Core as they occur 12
  • 13. Processing record Example: Collecting data from camera systems 11.01.201913 Camera before chilling Camera after chilling Upstreaming steps+ Downstreaming steps+ 10:00 pm 1:00 pm Timeline BAADER Digitalization ~190 birds/min Evisceration Line CutUp Line ~190 birds/min
  • 14. Product quality: Chicken Collecting data from a camera system 11.01.201914 BAADER Digitalization ● Camera takes 2-4 pictures of each chicken ● Images are analyzed in real-time at the edge ● Quality and potential defects are provided ● Sorting based on the quality for downstreaming steps (cutup, whole bird, …)
  • 15. Data ingestion into Apache Kafka Keeping the quality 11.01.2019BAADER Digitalization15 Apache Kafka Cluster MQTTAdaptor AWSIoTCore(MQTT) Elasticsearch / Kibana MongoDB AWS S3 KafkaConnector Demo Streaming SQL Processing GPS Machinedata Weather MQTT over TLS Data source
  • 16. KSQL and Chicken Quality Creating a stream CREATE STREAM quality ( thingId VARCHAR, value STRUCT<quality STRUCT<code INT>>) WITH (KAFKA_TOPIC='camera', VALUE_FORMAT='json'); ● Creates a KSQL stream of the quality from the camera. 16
  • 17. KSQL and Chicken Quality Creating a windowed table for efficiency monitoring CREATE TABLE empty_shackles_per_5_minutes AS SELECT thingId, COUNT(*) FROM quality WINDOW TUMBLING (SIZE 5 MINUTES) WHERE value->quality->code = 4 GROUP by thingId; ● Create a table over a 5 minute window for empty shackles (code = 4) ● Application / Alerting: Notify factory manager about potential efficiency drop 17
  • 18. Kafka and Transportation Data Computing truck arrivals ● Topic that contains the GPS positions of a truck 18 { "thingId": "4d5cc1d6-50ca-4bf9-bb6c-ddfaa9043d6a", "thingType": "baader::dt::vehicle::daf::vehicle_geo_position", "thingVersion": "0", "value": { "Latitude":"5X.6155128", "Longitude":"-X.9469323", “LastUpdateDataTimeInUtc":"2018-09-14T11:51:44.000Z" } "timestamp": 1536955501460 }
  • 19. ETA Computation Using KSQL for computing ETA ● KSQL: Added own stateless functions for SQL stream processing ● UDFs: User-defined functions in Java and just putting the compiled jar into the confluent-platform environment ● Implemented a UDF for computing the ETA using Google Distance API ● Derived a new stream that calls the distance UDF ● The ETA is presented in a dashboard 19
  • 20. Drive new businesses Writing streaming applications ● Building a central data lake for machine data (power, voltage, water consumptions) -> next machine generation ● Providing a transparent view of the entire food value chain ● ● Using easy-to-use streaming SQL (e.g., KSQL, LSQL) to create derived streams ○ Computing windowed reached SLA for the machines ○ Efficiency by computing number of empty shackles ○ Alerts when violating thresholds 20
  • 21. Value of Apache Kafka Context Digitalization ● Building streaming applications in the context of IoT ● Event-driven architectures ● Easy experimentation with low cost ● Single source of truth for a data-driven company ● … and the usual suspects ● Fault-tolerance ● Strict ordering ● High-Availability ● Scalability 21
  • 22. Confluent Cloud Enterprise ● Fully managed Kafka Cluster for dev. and prod. ● Just configure: throughput and retention time ● Excellent support in terms of response and quality ● Secured environment using VPC peering on AWS ● Open-Source contributor to Apache Kafka and its entire ecosystem 22 Values
  • 23. Today’s numbers At the very early stage ● 40-50 msg/s ● Total: 140 data points ● 100 GB gathered from a single factory within a few months ● There is much more, but... 23
  • 25. Backup Kibana dashboard (15 min) of the camera system 25