SlideShare ist ein Scribd-Unternehmen logo
1 von 20
Downloaden Sie, um offline zu lesen
ASRC FEDERAL
Kafka and the Satellite Operations Domain
Artificial Intelligence, Machine Learning, and Streaming Support for
Satellite Data Operations
Eric Velte
Chief Technology Officer
ASRC Federal
Agenda 2
Overview of the Satellite Command and Data Acquisition (SCDAS)
domain
Opportunities for AI & ML and tools like Confluent and Kafka
Mission Operator Assistant Phase 1 – establishment of a Minimum
Viable Product (MVP)
What we learned from our initial prototype
Phase 2, deployment and future enhancements
ASRC Federal Proprietary
3
At a Glance
Our Federal Markets
• Space
• Defense
• Intelligence Community
• National Security
• Health
• Civilian
~8,000
Employees
44
States,
districts,
territories we
operate in
32
Subsidiary
companies
supporting
federal
government
2003
Founded
40%
Employees in
engineering,
development,
and analytics
46%
Employees
with Secret or
Top-Secret
clearances
Our Problem Space 4
More than 2,500 operating
satellites in space
Commanding, controlling,
and acquiring information is
a full-time job!
IT and Software have
evolved over time
ASRC Federal supports the engineering and day-to-day
operation of satellite constellations for NASA, NOAA, and others
Challenges – The Operations Landscape 5
• Traditional satellite data
ingestion, storage and
processing are very complex,
very human-intensive, somewhat
slow and very expensive
System undergoes
constant technical
refactoring and re-
imagining with modern
tools and techniques
NOAA Wallops CDAS,
Wallops Island, VA
Public Product Viewer –
GOES
Satellite Ground
Architecture – GOES-R
101101101110011010100101
010010101001001001001111
The Legacy Implementation 6
Problem detection and troubleshooting
can take days to understand and solve
due to multiple challenges
<…
while (1)
grep “Error” goesr17/log.data>
end
…>
Improve Situational Awareness with MOA
Mission Operations Assistant
LACK OF VISIBILITY PUTS MISSIONS AT RISK
• Difficult for Operations to identify &
Analyze time-sensitive logfiles
• Lack of centralized log file structure,
arcane manual analytical procedures
• No easy way for the user to identify
when the message was seen last
GOAL OF MOA: EXTEND HUMAN CAPABILITIES THROUGH AI/ML
• Reduce the number of Tier-1 false positive error messages or indicators
• Minimize escalation to Tier-2 (OSPO/EMOSS), Tier-3 (Maint. & Sust.) and Tier-4 (factory)
• Leverage ML to minimize log-file noise, improve troubleshooting efficiently & identify impact across
assets
Loss of
situational
awareness
Time
consuming
manual
effort
Log file
overload
Missing
Critical
Alerts
Artificial Intelligence in the Satellite Domain
Bringing Efficiency to Satellite
Mission Operations
8
NOAA’s Satellite Operations
Facility is a staffing-
intensive operation
Satellite mission challenges
are hard to detect,
troubleshoot and repair
Introduction of automation
reduces troubleshooting
time from days to hours.
ASRC Federal introduced automation to assist
operator in detecting operations and control issues
Pixel Striping Anomaly
Human in the Loop –
Operator Alerting
9
Reaction wheels wear out with use.
Team was able to determine reaction
wheel slippage and develop an
operating procedure that minimized
wear-and-tear, potentially
prolonging spacecraft life
AI & ML are also useful tools for the
prediction of failures
The MOA Minimum Viable Product (MVP) 10
Log & TT&C Data
MOA Front End
Satellite-Specific Insights
MOA Features
SEARCH ENGINE
ALARM VIEW
IMPACT MAP
• Enhanced search capabilities
• Build centralized knowledge base by allowing
user to add notes to event type
• Show graphical timeline of occurrences for
improved visualizations
• Instrumented to capture user interaction for ML training
• Build centralized knowledge base by allowing user
to add notes to event type
• Show graphical timeline of occurrences for improved
visualizations
• ML enabled view to minimize extraneous noise on display
• Leverage ML to predict impacts prior to
completion of product generation
• Maintains a history of impacts for easy
reference
Prototype screenshots
Tools in our Development and Operations Space
Our AI/ML programs
Cloud native AI / ML solutions such as AWS
SageMaker, MS Cognitive Services, and Google
Analytics are all big enablers of mission
success. We also develop our own domain-
specific & custom AI/ML frameworks
Continuous Integration and Continuous
Development help us design, deliver,
deploy, monitor and maintain at the speed
of need!
*Bounded Computational AI & Cognitive Computing
Satellite Data Log Management
From Ingest to Insight – Phase 1 (Completed 2Q2021)
13
1. Ingest of log information from
server-based log files
1
2
3
4
2. Correlation of data across
multiple sources using Kafka
3. Storage of data into Elastic and
Postgres containers
4. Asynchronous access and
update of data from the front
end
• Enables us to use event driven streaming of satellite logs
• Enables us to easily scale Logstash, backend, and other microservice instances
• Redundancy, stability, fault-tolerant for client services
• Allows multiple consumers for each topic
Satellite Data Log Management
Microservices Communications and Segmentation – Phase 2
14
• Driving towards a real-time system for
anomaly detection and problem resolution
• Current tools are tightly coupled and rely
on proprietary communications middleware
like TCP and direct connections between
components
Using Kafka:
• Promotes information sharing to broader
communities
• No need to determine who they are (policy)
or where the users are located
• Scalability improvements for message
surges
Phase 2: Expand Kafka into the microservices
environment as a total system messaging bus
Satellite Data Pipeline
Partnered and working with Confluent to expand
Kafka into the microservices environment
Evaluating Confluent
components:
• Confluent for Kubernetes for scalable, elastic
deployment and management
• RBAC for role management and access to
specific topics
• Kafka Connect to quick and reliable
connectivity to external data sources
• Schema Registry and Schema Validation for
satellite model management and evolution
• Kafka Streams and ksqlDB for real-time data
enrichment, aggregation, and filtering
• ksqlDB UDFs for anomaly detection
algorithms
MOA SOLUTION
• Automatic real-time aggregation of multi-
system log files
• Enhanced search capabilities
• Correlate impact analysis
• Identify historic patterns & predict analytics to
avoid future mission critical outage
• Extend human capabilities through the
use of AI/ML
VALUE TO CUSTOMER
• Risk mitigation and cost effectiveness.
• Extend life of critical satellite constellations
• Increased situational awareness satellite
h&s
• Consistent results across user base
• Proactive preemptive maintenance
• Centralized source of best practice
knowledge base
• Designed to be used for any mission w/
standardized logging systems
16
MOA satisfies unmet needs identified at 2017 GSAW :
“We think that the set of events/log message enhancements will provide powerful capabilities
for the mission user regardless of Agency or type of mission.”
“Placing an emphasis on non-telemetry analysis opens up a new area of data mining, analytics
and tool development [in satellite operations]–we think the users will help identify even more
functions.”
https://ntrs.nasa.gov/citations/20170002304
NOAA’s End-State Vision
Common Cloud Framework
17
Many types of satellite data are a
public commodity
• Weather
• Oceanographic
• Imagery
Traditional data ingestion, storage
and processing are cost-effective and
performed as-a-service
Cloud-based solutions enhance data
proximity & enhance user experience
Kafka is a key to the framework of
the future
Security and role-based access are
keys
Streaming solutions like Kafka and
Confluent tooling simplify and speed up
data flows from ingest to dissemination
Moving Forward with
ASRC Federal
18
Alaskan company with more than
8,000 employees
• Engineering
• Software
• Digital Modernization
• Professional Services
• Infrastructure
More than 1,500 employees working
with NASA, NOAA and others
20+ year history solving complex
technology and software problems
for the federal government
Summary 19
ASRC Federal is partnered with NASA and NOAA to modernize the
ground station architecture in accordance with agency goals and
roadmaps
Cloud migration and modern tooling are both keys to that success,
including better dependability and an anticipated cost reduction
We successfully used Kafka to implement the MOA message-passing
infrastructure in a phase 1 MVP; seeking to expand its use in follow-
on development
Quyanaq!
Thank you!
Eric Velte
Chief Technology Officer
e: evelte@asrcfederal.com

Weitere ähnliche Inhalte

Was ist angesagt?

How Pulsar Enables Netdata to Offer Unlimited Infrastructure Monitoring for F...
How Pulsar Enables Netdata to Offer Unlimited Infrastructure Monitoring for F...How Pulsar Enables Netdata to Offer Unlimited Infrastructure Monitoring for F...
How Pulsar Enables Netdata to Offer Unlimited Infrastructure Monitoring for F...
StreamNative
 

Was ist angesagt? (20)

Apache Kafka
Apache KafkaApache Kafka
Apache Kafka
 
A Kafka journey and why migrate to Confluent Cloud?
A Kafka journey and why migrate to Confluent Cloud?A Kafka journey and why migrate to Confluent Cloud?
A Kafka journey and why migrate to Confluent Cloud?
 
Set your Data in Motion with Confluent & Apache Kafka Tech Talk Series LME
Set your Data in Motion with Confluent & Apache Kafka Tech Talk Series LMESet your Data in Motion with Confluent & Apache Kafka Tech Talk Series LME
Set your Data in Motion with Confluent & Apache Kafka Tech Talk Series LME
 
From Message to Cluster: A Realworld Introduction to Kafka Capacity Planning
From Message to Cluster: A Realworld Introduction to Kafka Capacity PlanningFrom Message to Cluster: A Realworld Introduction to Kafka Capacity Planning
From Message to Cluster: A Realworld Introduction to Kafka Capacity Planning
 
Kafka in Space | Kafka Summit London 2022
 Kafka in Space | Kafka Summit London 2022 Kafka in Space | Kafka Summit London 2022
Kafka in Space | Kafka Summit London 2022
 
Architecture patterns for distributed, hybrid, edge and global Apache Kafka d...
Architecture patterns for distributed, hybrid, edge and global Apache Kafka d...Architecture patterns for distributed, hybrid, edge and global Apache Kafka d...
Architecture patterns for distributed, hybrid, edge and global Apache Kafka d...
 
Hadoop Summit Tokyo Apache NiFi Crash Course
Hadoop Summit Tokyo Apache NiFi Crash CourseHadoop Summit Tokyo Apache NiFi Crash Course
Hadoop Summit Tokyo Apache NiFi Crash Course
 
CDC patterns in Apache Kafka®
CDC patterns in Apache Kafka®CDC patterns in Apache Kafka®
CDC patterns in Apache Kafka®
 
Kafka Streams: What it is, and how to use it?
Kafka Streams: What it is, and how to use it?Kafka Streams: What it is, and how to use it?
Kafka Streams: What it is, and how to use it?
 
Real-time Analytics with Apache Flink and Druid
Real-time Analytics with Apache Flink and DruidReal-time Analytics with Apache Flink and Druid
Real-time Analytics with Apache Flink and Druid
 
Kafka + Uber- The World’s Realtime Transit Infrastructure, Aaron Schildkrout
Kafka + Uber- The World’s Realtime Transit Infrastructure, Aaron SchildkroutKafka + Uber- The World’s Realtime Transit Infrastructure, Aaron Schildkrout
Kafka + Uber- The World’s Realtime Transit Infrastructure, Aaron Schildkrout
 
Reliability Guarantees for Apache Kafka
Reliability Guarantees for Apache KafkaReliability Guarantees for Apache Kafka
Reliability Guarantees for Apache Kafka
 
Exploring the power of OpenTelemetry on Kubernetes
Exploring the power of OpenTelemetry on KubernetesExploring the power of OpenTelemetry on Kubernetes
Exploring the power of OpenTelemetry on Kubernetes
 
Apache Kafka® and API Management
Apache Kafka® and API ManagementApache Kafka® and API Management
Apache Kafka® and API Management
 
Fundamentals of Apache Kafka
Fundamentals of Apache KafkaFundamentals of Apache Kafka
Fundamentals of Apache Kafka
 
Everything you ever needed to know about Kafka on Kubernetes but were afraid ...
Everything you ever needed to know about Kafka on Kubernetes but were afraid ...Everything you ever needed to know about Kafka on Kubernetes but were afraid ...
Everything you ever needed to know about Kafka on Kubernetes but were afraid ...
 
Confluent REST Proxy and Schema Registry (Concepts, Architecture, Features)
Confluent REST Proxy and Schema Registry (Concepts, Architecture, Features)Confluent REST Proxy and Schema Registry (Concepts, Architecture, Features)
Confluent REST Proxy and Schema Registry (Concepts, Architecture, Features)
 
Cilium + Istio with Gloo Mesh
Cilium + Istio with Gloo MeshCilium + Istio with Gloo Mesh
Cilium + Istio with Gloo Mesh
 
Deploying Confluent Platform for Production
Deploying Confluent Platform for ProductionDeploying Confluent Platform for Production
Deploying Confluent Platform for Production
 
How Pulsar Enables Netdata to Offer Unlimited Infrastructure Monitoring for F...
How Pulsar Enables Netdata to Offer Unlimited Infrastructure Monitoring for F...How Pulsar Enables Netdata to Offer Unlimited Infrastructure Monitoring for F...
How Pulsar Enables Netdata to Offer Unlimited Infrastructure Monitoring for F...
 

Ähnlich wie Kafka Migration for Satellite Event Streaming Data | Eric Velte, ASRC Federal

Cyber Infrastructure as a Service to Empower Multidisciplinary, Data-Driven S...
Cyber Infrastructure as a Service to Empower Multidisciplinary, Data-Driven S...Cyber Infrastructure as a Service to Empower Multidisciplinary, Data-Driven S...
Cyber Infrastructure as a Service to Empower Multidisciplinary, Data-Driven S...
AIRCC Publishing Corporation
 
CYBER INFRASTRUCTURE AS A SERVICE TO EMPOWER MULTIDISCIPLINARY, DATA-DRIVEN S...
CYBER INFRASTRUCTURE AS A SERVICE TO EMPOWER MULTIDISCIPLINARY, DATA-DRIVEN S...CYBER INFRASTRUCTURE AS A SERVICE TO EMPOWER MULTIDISCIPLINARY, DATA-DRIVEN S...
CYBER INFRASTRUCTURE AS A SERVICE TO EMPOWER MULTIDISCIPLINARY, DATA-DRIVEN S...
ijcsit
 
Cyber Infrastructure as a Service to Empower Multidisciplinary, Data-Driven S...
Cyber Infrastructure as a Service to Empower Multidisciplinary, Data-Driven S...Cyber Infrastructure as a Service to Empower Multidisciplinary, Data-Driven S...
Cyber Infrastructure as a Service to Empower Multidisciplinary, Data-Driven S...
AIRCC Publishing Corporation
 
Unit i introduction to grid computing
Unit i   introduction to grid computingUnit i   introduction to grid computing
Unit i introduction to grid computing
sudha kar
 
Ogce Workflow Suite
Ogce Workflow SuiteOgce Workflow Suite
Ogce Workflow Suite
smarru
 
CSense: A Stream-Processing Toolkit for Robust and High-Rate Mobile Sensing A...
CSense: A Stream-Processing Toolkit for Robust and High-Rate Mobile Sensing A...CSense: A Stream-Processing Toolkit for Robust and High-Rate Mobile Sensing A...
CSense: A Stream-Processing Toolkit for Robust and High-Rate Mobile Sensing A...
Farley Lai
 
A Platform for Large-Scale Grid Data Service on Dynamic High-Performance Netw...
A Platform for Large-Scale Grid Data Service on Dynamic High-Performance Netw...A Platform for Large-Scale Grid Data Service on Dynamic High-Performance Netw...
A Platform for Large-Scale Grid Data Service on Dynamic High-Performance Netw...
Tal Lavian Ph.D.
 

Ähnlich wie Kafka Migration for Satellite Event Streaming Data | Eric Velte, ASRC Federal (20)

Cyber Infrastructure as a Service to Empower Multidisciplinary, Data-Driven S...
Cyber Infrastructure as a Service to Empower Multidisciplinary, Data-Driven S...Cyber Infrastructure as a Service to Empower Multidisciplinary, Data-Driven S...
Cyber Infrastructure as a Service to Empower Multidisciplinary, Data-Driven S...
 
CYBER INFRASTRUCTURE AS A SERVICE TO EMPOWER MULTIDISCIPLINARY, DATA-DRIVEN S...
CYBER INFRASTRUCTURE AS A SERVICE TO EMPOWER MULTIDISCIPLINARY, DATA-DRIVEN S...CYBER INFRASTRUCTURE AS A SERVICE TO EMPOWER MULTIDISCIPLINARY, DATA-DRIVEN S...
CYBER INFRASTRUCTURE AS A SERVICE TO EMPOWER MULTIDISCIPLINARY, DATA-DRIVEN S...
 
3 rd International Conference on Signal Processing, VLSI Design & Communicati...
3 rd International Conference on Signal Processing, VLSI Design & Communicati...3 rd International Conference on Signal Processing, VLSI Design & Communicati...
3 rd International Conference on Signal Processing, VLSI Design & Communicati...
 
Cyber Infrastructure as a Service to Empower Multidisciplinary, Data-Driven S...
Cyber Infrastructure as a Service to Empower Multidisciplinary, Data-Driven S...Cyber Infrastructure as a Service to Empower Multidisciplinary, Data-Driven S...
Cyber Infrastructure as a Service to Empower Multidisciplinary, Data-Driven S...
 
MapR 5.2: Getting More Value from the MapR Converged Data Platform
MapR 5.2: Getting More Value from the MapR Converged Data PlatformMapR 5.2: Getting More Value from the MapR Converged Data Platform
MapR 5.2: Getting More Value from the MapR Converged Data Platform
 
Grid Computing
Grid ComputingGrid Computing
Grid Computing
 
DGterzo
DGterzoDGterzo
DGterzo
 
Velocity San Jose 2017: Traffic shifts: Avoiding disasters at scale
Velocity San Jose 2017: Traffic shifts: Avoiding disasters at scaleVelocity San Jose 2017: Traffic shifts: Avoiding disasters at scale
Velocity San Jose 2017: Traffic shifts: Avoiding disasters at scale
 
grid mining
grid mininggrid mining
grid mining
 
Unit i introduction to grid computing
Unit i   introduction to grid computingUnit i   introduction to grid computing
Unit i introduction to grid computing
 
Ogce Workflow Suite
Ogce Workflow SuiteOgce Workflow Suite
Ogce Workflow Suite
 
Splunk App for Stream for Enhanced Operational Intelligence from Wire Data
Splunk App for Stream for Enhanced Operational Intelligence from Wire DataSplunk App for Stream for Enhanced Operational Intelligence from Wire Data
Splunk App for Stream for Enhanced Operational Intelligence from Wire Data
 
Paper444012-4014
Paper444012-4014Paper444012-4014
Paper444012-4014
 
A Reconfigurable Component-Based Problem Solving Environment
A Reconfigurable Component-Based Problem Solving EnvironmentA Reconfigurable Component-Based Problem Solving Environment
A Reconfigurable Component-Based Problem Solving Environment
 
IRJET-Framework for Dynamic Resource Allocation and Efficient Scheduling Stra...
IRJET-Framework for Dynamic Resource Allocation and Efficient Scheduling Stra...IRJET-Framework for Dynamic Resource Allocation and Efficient Scheduling Stra...
IRJET-Framework for Dynamic Resource Allocation and Efficient Scheduling Stra...
 
CSense: A Stream-Processing Toolkit for Robust and High-Rate Mobile Sensing A...
CSense: A Stream-Processing Toolkit for Robust and High-Rate Mobile Sensing A...CSense: A Stream-Processing Toolkit for Robust and High-Rate Mobile Sensing A...
CSense: A Stream-Processing Toolkit for Robust and High-Rate Mobile Sensing A...
 
A Platform for Large-Scale Grid Data Service on Dynamic High-Performance Netw...
A Platform for Large-Scale Grid Data Service on Dynamic High-Performance Netw...A Platform for Large-Scale Grid Data Service on Dynamic High-Performance Netw...
A Platform for Large-Scale Grid Data Service on Dynamic High-Performance Netw...
 
Cluster and Grid Computing
Cluster and Grid ComputingCluster and Grid Computing
Cluster and Grid Computing
 
Iscram 2008 presentation
Iscram 2008 presentationIscram 2008 presentation
Iscram 2008 presentation
 
Cloud Computing: Architecture, IT Security and Operational Perspectives
Cloud Computing: Architecture, IT Security and Operational PerspectivesCloud Computing: Architecture, IT Security and Operational Perspectives
Cloud Computing: Architecture, IT Security and Operational Perspectives
 

Mehr von HostedbyConfluent

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
HostedbyConfluent
 
Evolution of NRT Data Ingestion Pipeline at Trendyol
Evolution of NRT Data Ingestion Pipeline at TrendyolEvolution of NRT Data Ingestion Pipeline at Trendyol
Evolution of NRT Data Ingestion Pipeline at Trendyol
HostedbyConfluent
 

Mehr von HostedbyConfluent (20)

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Renaming a Kafka Topic | Kafka Summit London
Renaming a Kafka Topic | Kafka Summit LondonRenaming a Kafka Topic | Kafka Summit London
Renaming a Kafka Topic | Kafka Summit London
 
Evolution of NRT Data Ingestion Pipeline at Trendyol
Evolution of NRT Data Ingestion Pipeline at TrendyolEvolution of NRT Data Ingestion Pipeline at Trendyol
Evolution of NRT Data Ingestion Pipeline at Trendyol
 
Ensuring Kafka Service Resilience: A Dive into Health-Checking Techniques
Ensuring Kafka Service Resilience: A Dive into Health-Checking TechniquesEnsuring Kafka Service Resilience: A Dive into Health-Checking Techniques
Ensuring Kafka Service Resilience: A Dive into Health-Checking Techniques
 
Exactly-once Stream Processing with Arroyo and Kafka
Exactly-once Stream Processing with Arroyo and KafkaExactly-once Stream Processing with Arroyo and Kafka
Exactly-once Stream Processing with Arroyo and Kafka
 
Fish Plays Pokemon | Kafka Summit London
Fish Plays Pokemon | Kafka Summit LondonFish Plays Pokemon | Kafka Summit London
Fish Plays Pokemon | Kafka Summit London
 
Tiered Storage 101 | Kafla Summit London
Tiered Storage 101 | Kafla Summit LondonTiered Storage 101 | Kafla Summit London
Tiered Storage 101 | Kafla Summit London
 
Building a Self-Service Stream Processing Portal: How And Why
Building a Self-Service Stream Processing Portal: How And WhyBuilding a Self-Service Stream Processing Portal: How And Why
Building a Self-Service Stream Processing Portal: How And Why
 
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
 
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
 
Navigating Private Network Connectivity Options for Kafka Clusters
Navigating Private Network Connectivity Options for Kafka ClustersNavigating Private Network Connectivity Options for Kafka Clusters
Navigating Private Network Connectivity Options for Kafka Clusters
 
Apache Flink: Building a Company-wide Self-service Streaming Data Platform
Apache Flink: Building a Company-wide Self-service Streaming Data PlatformApache Flink: Building a Company-wide Self-service Streaming Data Platform
Apache Flink: Building a Company-wide Self-service Streaming Data Platform
 
Explaining How Real-Time GenAI Works in a Noisy Pub
Explaining How Real-Time GenAI Works in a Noisy PubExplaining How Real-Time GenAI Works in a Noisy Pub
Explaining How Real-Time GenAI Works in a Noisy Pub
 
TL;DR Kafka Metrics | Kafka Summit London
TL;DR Kafka Metrics | Kafka Summit LondonTL;DR Kafka Metrics | Kafka Summit London
TL;DR Kafka Metrics | Kafka Summit London
 
A Window Into Your Kafka Streams Tasks | KSL
A Window Into Your Kafka Streams Tasks | KSLA Window Into Your Kafka Streams Tasks | KSL
A Window Into Your Kafka Streams Tasks | KSL
 
Mastering Kafka Producer Configs: A Guide to Optimizing Performance
Mastering Kafka Producer Configs: A Guide to Optimizing PerformanceMastering Kafka Producer Configs: A Guide to Optimizing Performance
Mastering Kafka Producer Configs: A Guide to Optimizing Performance
 
Data Contracts Management: Schema Registry and Beyond
Data Contracts Management: Schema Registry and BeyondData Contracts Management: Schema Registry and Beyond
Data Contracts Management: Schema Registry and Beyond
 
Code-First Approach: Crafting Efficient Flink Apps
Code-First Approach: Crafting Efficient Flink AppsCode-First Approach: Crafting Efficient Flink Apps
Code-First Approach: Crafting Efficient Flink Apps
 
Debezium vs. the World: An Overview of the CDC Ecosystem
Debezium vs. the World: An Overview of the CDC EcosystemDebezium vs. the World: An Overview of the CDC Ecosystem
Debezium vs. the World: An Overview of the CDC Ecosystem
 
Beyond Tiered Storage: Serverless Kafka with No Local Disks
Beyond Tiered Storage: Serverless Kafka with No Local DisksBeyond Tiered Storage: Serverless Kafka with No Local Disks
Beyond Tiered Storage: Serverless Kafka with No Local Disks
 

Kürzlich hochgeladen

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Kürzlich hochgeladen (20)

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 

Kafka Migration for Satellite Event Streaming Data | Eric Velte, ASRC Federal

  • 1. ASRC FEDERAL Kafka and the Satellite Operations Domain Artificial Intelligence, Machine Learning, and Streaming Support for Satellite Data Operations Eric Velte Chief Technology Officer ASRC Federal
  • 2. Agenda 2 Overview of the Satellite Command and Data Acquisition (SCDAS) domain Opportunities for AI & ML and tools like Confluent and Kafka Mission Operator Assistant Phase 1 – establishment of a Minimum Viable Product (MVP) What we learned from our initial prototype Phase 2, deployment and future enhancements
  • 3. ASRC Federal Proprietary 3 At a Glance Our Federal Markets • Space • Defense • Intelligence Community • National Security • Health • Civilian ~8,000 Employees 44 States, districts, territories we operate in 32 Subsidiary companies supporting federal government 2003 Founded 40% Employees in engineering, development, and analytics 46% Employees with Secret or Top-Secret clearances
  • 4. Our Problem Space 4 More than 2,500 operating satellites in space Commanding, controlling, and acquiring information is a full-time job! IT and Software have evolved over time ASRC Federal supports the engineering and day-to-day operation of satellite constellations for NASA, NOAA, and others
  • 5. Challenges – The Operations Landscape 5 • Traditional satellite data ingestion, storage and processing are very complex, very human-intensive, somewhat slow and very expensive System undergoes constant technical refactoring and re- imagining with modern tools and techniques NOAA Wallops CDAS, Wallops Island, VA Public Product Viewer – GOES Satellite Ground Architecture – GOES-R
  • 6. 101101101110011010100101 010010101001001001001111 The Legacy Implementation 6 Problem detection and troubleshooting can take days to understand and solve due to multiple challenges <… while (1) grep “Error” goesr17/log.data> end …>
  • 7. Improve Situational Awareness with MOA Mission Operations Assistant LACK OF VISIBILITY PUTS MISSIONS AT RISK • Difficult for Operations to identify & Analyze time-sensitive logfiles • Lack of centralized log file structure, arcane manual analytical procedures • No easy way for the user to identify when the message was seen last GOAL OF MOA: EXTEND HUMAN CAPABILITIES THROUGH AI/ML • Reduce the number of Tier-1 false positive error messages or indicators • Minimize escalation to Tier-2 (OSPO/EMOSS), Tier-3 (Maint. & Sust.) and Tier-4 (factory) • Leverage ML to minimize log-file noise, improve troubleshooting efficiently & identify impact across assets Loss of situational awareness Time consuming manual effort Log file overload Missing Critical Alerts
  • 8. Artificial Intelligence in the Satellite Domain Bringing Efficiency to Satellite Mission Operations 8 NOAA’s Satellite Operations Facility is a staffing- intensive operation Satellite mission challenges are hard to detect, troubleshoot and repair Introduction of automation reduces troubleshooting time from days to hours. ASRC Federal introduced automation to assist operator in detecting operations and control issues Pixel Striping Anomaly
  • 9. Human in the Loop – Operator Alerting 9 Reaction wheels wear out with use. Team was able to determine reaction wheel slippage and develop an operating procedure that minimized wear-and-tear, potentially prolonging spacecraft life AI & ML are also useful tools for the prediction of failures
  • 10. The MOA Minimum Viable Product (MVP) 10 Log & TT&C Data MOA Front End Satellite-Specific Insights
  • 11. MOA Features SEARCH ENGINE ALARM VIEW IMPACT MAP • Enhanced search capabilities • Build centralized knowledge base by allowing user to add notes to event type • Show graphical timeline of occurrences for improved visualizations • Instrumented to capture user interaction for ML training • Build centralized knowledge base by allowing user to add notes to event type • Show graphical timeline of occurrences for improved visualizations • ML enabled view to minimize extraneous noise on display • Leverage ML to predict impacts prior to completion of product generation • Maintains a history of impacts for easy reference Prototype screenshots
  • 12. Tools in our Development and Operations Space Our AI/ML programs Cloud native AI / ML solutions such as AWS SageMaker, MS Cognitive Services, and Google Analytics are all big enablers of mission success. We also develop our own domain- specific & custom AI/ML frameworks Continuous Integration and Continuous Development help us design, deliver, deploy, monitor and maintain at the speed of need! *Bounded Computational AI & Cognitive Computing
  • 13. Satellite Data Log Management From Ingest to Insight – Phase 1 (Completed 2Q2021) 13 1. Ingest of log information from server-based log files 1 2 3 4 2. Correlation of data across multiple sources using Kafka 3. Storage of data into Elastic and Postgres containers 4. Asynchronous access and update of data from the front end • Enables us to use event driven streaming of satellite logs • Enables us to easily scale Logstash, backend, and other microservice instances • Redundancy, stability, fault-tolerant for client services • Allows multiple consumers for each topic
  • 14. Satellite Data Log Management Microservices Communications and Segmentation – Phase 2 14 • Driving towards a real-time system for anomaly detection and problem resolution • Current tools are tightly coupled and rely on proprietary communications middleware like TCP and direct connections between components Using Kafka: • Promotes information sharing to broader communities • No need to determine who they are (policy) or where the users are located • Scalability improvements for message surges Phase 2: Expand Kafka into the microservices environment as a total system messaging bus
  • 15. Satellite Data Pipeline Partnered and working with Confluent to expand Kafka into the microservices environment Evaluating Confluent components: • Confluent for Kubernetes for scalable, elastic deployment and management • RBAC for role management and access to specific topics • Kafka Connect to quick and reliable connectivity to external data sources • Schema Registry and Schema Validation for satellite model management and evolution • Kafka Streams and ksqlDB for real-time data enrichment, aggregation, and filtering • ksqlDB UDFs for anomaly detection algorithms
  • 16. MOA SOLUTION • Automatic real-time aggregation of multi- system log files • Enhanced search capabilities • Correlate impact analysis • Identify historic patterns & predict analytics to avoid future mission critical outage • Extend human capabilities through the use of AI/ML VALUE TO CUSTOMER • Risk mitigation and cost effectiveness. • Extend life of critical satellite constellations • Increased situational awareness satellite h&s • Consistent results across user base • Proactive preemptive maintenance • Centralized source of best practice knowledge base • Designed to be used for any mission w/ standardized logging systems 16 MOA satisfies unmet needs identified at 2017 GSAW : “We think that the set of events/log message enhancements will provide powerful capabilities for the mission user regardless of Agency or type of mission.” “Placing an emphasis on non-telemetry analysis opens up a new area of data mining, analytics and tool development [in satellite operations]–we think the users will help identify even more functions.” https://ntrs.nasa.gov/citations/20170002304
  • 17. NOAA’s End-State Vision Common Cloud Framework 17 Many types of satellite data are a public commodity • Weather • Oceanographic • Imagery Traditional data ingestion, storage and processing are cost-effective and performed as-a-service Cloud-based solutions enhance data proximity & enhance user experience Kafka is a key to the framework of the future Security and role-based access are keys Streaming solutions like Kafka and Confluent tooling simplify and speed up data flows from ingest to dissemination
  • 18. Moving Forward with ASRC Federal 18 Alaskan company with more than 8,000 employees • Engineering • Software • Digital Modernization • Professional Services • Infrastructure More than 1,500 employees working with NASA, NOAA and others 20+ year history solving complex technology and software problems for the federal government
  • 19. Summary 19 ASRC Federal is partnered with NASA and NOAA to modernize the ground station architecture in accordance with agency goals and roadmaps Cloud migration and modern tooling are both keys to that success, including better dependability and an anticipated cost reduction We successfully used Kafka to implement the MOA message-passing infrastructure in a phase 1 MVP; seeking to expand its use in follow- on development
  • 20. Quyanaq! Thank you! Eric Velte Chief Technology Officer e: evelte@asrcfederal.com