SlideShare a Scribd company logo
1 of 24
Think Innovate Deliver
MIT CDOIQ
SUMMIT
20 August 2020
Ensuring Flight Safety
F-16 unloaded F-16 with many mounted “stores”: missiles, bombs,
fuel tanks, sensors, etc.
New store configurations lead to many possible problems, including:
Unsuccessful separation:
https://www.youtube.com/watch?v=fPTnmZ_HPAs&t=68s
Aeroelastic flutter: https://www.youtube.com/watch?v=qpJBvQXQC2M&t=44s
AF SEEK EAGLE Office
• Mission:
We deliver war-winning capability by efficiently evaluating the integration of the state-of-the-art weapons
on current and future generation aircraft, providing accurate combat weapon delivery software, while
serving as responsible stewards of our nation's resources.​
• Vision:
Be the most agile, trusted, and responsive provider of innovative and cost-effective war-winning weapons
integration and mission planning solutions in the DoD.
REMEMBERING THE PAST…
● Separations
● Ballistics
AFSEO Disciplines
● Fit and function
● Stability & Control
● Aircraft & Store
Loads
● EMC & EMI
● Flutter
● Safe
Escape
AFSEO Disciplines: Data
● Separations
● Ballistics
● Safe Escape
○ Custom
software
package
○ Physics
modeling
● EMC/EMI
○ Maxwell Solvers
Software
○ Ground Tests
● Flutter
○ Finite Element
Modeling
○ Flight tests
● Fit and function
● Stability & Control
● Aircraft & Store
Loads
● EMC & EMI
● Flutter
● Safe
Escape
● Fit and
function
○ CAD
○ Laser
Scans
● Stability & Control
○ Wind tunnel tests
○ Simulations
○ Flight tests
● Aircraft Store &
Loads
○ Wind tunnel tests
○ Flight tests
● Separations
○ Static ejection tests
○ CFD
○ Wind tunnel tests
○ Flight tests
● Ballistics
○ Dynamic Modeling
Creating a Clearance Recommendation
● Safe Escape
○ Custom
software
package
○ Physics
modeling
● Fit and
function
○ CAD
○ Laser
Scans
Unified “Store Limitations”
● Stability & Control
○ Wind tunnel tests
○ Simulations
○ Flight tests
● Aircraft Store &
Loads
○ Wind tunnel tests
○ Flight tests
● EMC/EMI
○ Maxwell Solvers
Software
○ Ground Tests
● Flutter
○ Finite Element
Modeling
○ Flight tests
● Separations
○ Static ejection tests
○ CFD
○ Wind tunnel tests
○ Flight tests
● Ballistics
○ Dynamic Modeling
Aligned with Air Force Chief Data Office
“Data is the future of our force,” Crider said. “Unlocking and unleashing the
power of our data is going to keep the Air Force at the forefront of
technological advancement. We must take advantage of today’s technology
so we can learn faster than our adversaries and ensure the maximum
effectiveness of our force.”
http://www.af.mil/News/Article-Display/Article/1448828/af-chief-data-officer-data-is-the-future-of-the-force/
• Organizing and storing data
• Finding data
• Making it accessible
• Linking it
• Making it trustworthy
• Providing an environment to access,
view, filter it
• Understanding it
Data accessibility
Support
appropriate
heterogeneity
Data governance
Automation w/
machine learning
Digit-
ization
“Exposing
engineering data as
a strategic asset”
AF CDO Core Goals AFSEO CDO
Immediate
Priorities
AFSEO Data Challenges
• Technology and processes
management must go together
• Choosing the right technology for
today and the future
• Supporting multigenerational office
• 200 TB of data from 50+ years
• Majority of data is unstructured in
sundry formats
- includes a large video and
simulations library
- Final products are memos and
reports in .doc and .pdf
• Infrastructure modernization ongoing
Background Issues
Data Lake Design
• On premise, but cloud ready
• Open Architecture - “Best in
Breed”
• Scalable - for the future
• Configurable & adaptable
• Interoperabile with internal legacy
AFSEO tools
• Secure, using existing role based
access control (at the file level)
• Correct results without massive
investment of time or money
App Layer
“Data unification powered by
machine learning”
Storage
“Scale out network-attached
storage platform”
Microservices
“Software platform for data
engineering at scale”
Design Principles
The data lake is an extensible workspace for data processing and application creation
Key Technologies
Data Lake Products
Numerical predictions
AUTOMATED OUTPUTS PRODUCTIVITY TOOLS
Document catalog powered by
clean, consistent metadata
Recommendation with
sources cited
Interactive antecedent browser
NEW
SEEK
EAGLE
REQUEST
Metadata extraction
INPUT AUTOMATED ANALYSIS
Discipline-specific logic
Machine learning models
for each discipline
Data Lake Architecture
Linux VM
Hue
Cloudera
Microservices
Dell/EMC Isilon
HDFS
CIFS
Windows File Browser
Linux VM
Cloudera
Microservices
DB
App 1:
File Catalog &
Metadata Tagging
DB
App
API
(future)
AD, LDAP and
Kerberos
integrated
Cloudera & Isilon,
and by extension
Tamr
Connected to other
AFSEO DBs and
apps
Solr Spark
...
Cloudera
Manager
Banana
HBase Solr Spark...
App 2:
Recommendation
Engine
App 3:
Deep
Predictor
Tamr Entity
Resolution
Data Lake Architecture
Linux VM
Hue
Cloudera
Microservices
Dell/EMC Isilon
HDFS
CIFS
Windows File Browser
Linux VM
Cloudera
Microservices
DB
App 1:
File Catalog &
Metadata Tagging
DB
App
API
(future)
AD, LDAP and
Kerberos
integrated
Cloudera & Isilon,
and by extension
Tamr
Connected to other
AFSEO DBs and
apps
Solr Spark
...
Cloudera
Manager
Banana
HBase Solr Spark...
App 2:
Recommendation
Engine
App 3:
Deep
Predictor
Tamr Entity
Resolution
An aside
Lucene
Elastic-
Search
Solr
Kibana Banana
E(L)K
Stack
Viz
Search
Solr
equiv.
IB
Intra-cluster
Communication Layer
Client/Application Layer
Clients
Clients
Clients
Ethernet Layer
1GbE
10GbE
CIFSNFS
FTPHTTP
SMB
SWIFT
HDFS NDMP
OneFS Operating
Environment
Single
FS/Volume
Isilon NAS Architecture
• A Data Lake offers multi-protocol access with
tiering to workloads with different performance
requirements.
• All file sharing protocols capabilities allow
access from all other protocols
• Multiprotocol access controlled through AD,
LDAP, NIS and other providers
PowerScale Supported Protocols
• SMB – Microsoft OS
• NFS – Linux / Unix
• HTTP
• REST
• SWIFT
• S3 – Object Storage – Cloud Option
• HDFS – Hadoop
• NDMP – Backup & Recovery
• FTP
Flexibility makes AI an integral part of IT
© Copyright 2017 Dell Inc.16
Flash
SATA/SAS
Isilon
Complete “best of breed” solution
Flexible Compute
Compute &
Converged
Hyper-Converged
w/ NW
Hyper-
Converged
Existing Server
Infrastructure
Servers Embedded
w/ GPU-Farm*
Optional GPU Acceleration
Machine Learning
Custom
Deep Learning
DL4JCustom
Python
BigDL
*ex. Nvidia DGX-1
READY SOLUTION
Deep Learning with Nvidia
Massively Parallel IO
Flexible deployment options
App 1: Searchable File Catalog
Outcomes
● Improve productivity by making 50 years
of test data and SEEK EAGLE analyses
easily searchable
● Surface key references and connections
buried in documents
● Tagging >131 TB of data with 4.3 billion
descriptive labels in 30+ tag types
(aircraft, stores, author, file type, etc.)
● Analyst uses browser to search and filter
by tag
Prior State
● Research wastes engineer time
Digging through old files takes hours or
days (“limited data accessibility”)
● Siloed data
If one department needs help from another,
they have to ask a human (“limited data
sharing”)
● Disorganization leads to repeated work
Finding the right test from 10 years ago is
so difficult, engineers repeat the
experiment (“limited data usage”)
App 1: Searchable File Catalog
Maintain
this
system
alongside
the users’
folders
One click opens a file locally
Metadata
tags are
compre-
hensive &
ambitious
Amazon-
like search
& filter mix
● 80% of requests should be fully
automated: engineer is just editor of
automatically produced product
● If the judgement is “too close to call”,
shows relevant historical document as a
place to start
● Encoded some of the veterans’ instincts
in the logic of the app
● Makes it easier for the program
management office to plan
● Engineers from each discipline touch
every request
● 90% of requests will be “by analogy”
● Even planning is hard
Requirement
Loads EMI Stability/
Control
Flutter Separ
ations
Mission
planning
Prior State
Recommendation
Outcomes
App 2: Recommendation Engine
App 2: Recommendation Engine
NEW SEEK
EAGLE
REQUEST
REFERENCE DATA
ACES Standard
Rationale
STAMP
DATA PROCESSING
Entity Resolution, NLP,
Transformations
SCORING
Tolerance Check
Configuration Comparator
HISTORICAL DATA
Approved
configs
Eng. dataTech
Order
Certify by
Analogy?
Produce
Publishable
Documents✅
🆇 Human review
and/or testing
needed
Flight Limits
Engineering Rationale
with sources cited
Historical data browser,
sorted by similarity
App 2: Recommendation Engine
Deep Dive: Entity Resolution with ML
Demo
● Time intensive
Every request results in 10,000s of
“download configurations”
● Manual
Physics-based simulations used heavily,
but non-trivial interpretation still needed
● Testing data is sparse
30+ years of flight tests, but most configs
aren’t tested due to high cost
● Expert-driven
Good judgment requires years of training
App 3: ML Predictions (Flutter)
● Tamr predicts amount of oscillation as
accurately as possible at every
possible flight condition
● The app automatically generates a
flight envelope for each new SEEK
EAGLE Request.
● Tamr shows most similar previous
flight tests and confidence intervals for
human check
● The Tamr output allows for fewer and
more targeted flight tests.
Prior State Outcomes
App 3: ML Predictions (Flutter)
Data lake used to predict aerodynamic flutter from first principles
Automatically produced “flight envelope” for
new configuration
Data Robot used to automate ML tuning process
Outcomes
Result Impact
AFSEO processes more configurations/year,
giving the pilot more options for a mission
Increased ROI and effective utilization of Air
Force aircraft
AFSEO cycle time is reduced Innovations reach the front lines sooner
Better data usage → fewer test flights, fewer
experiments in the wind tunnel, etc.
Cost saving for AFSEO
Data lake not only reveals and organizes
data, it converts data → insights
SEEK EAGLE Office is more productive
Data is easier to find;
Greybeard “instincts” have been codified
Reduce the onboarding time for new
engineers

More Related Content

What's hot

Software Defined Storage - Open Framework and Intel® Architecture Technologies
Software Defined Storage - Open Framework and Intel® Architecture TechnologiesSoftware Defined Storage - Open Framework and Intel® Architecture Technologies
Software Defined Storage - Open Framework and Intel® Architecture TechnologiesOdinot Stanislas
 
Intel IT Open Cloud - What's under the Hood and How do we Drive it?
Intel IT Open Cloud - What's under the Hood and How do we Drive it?Intel IT Open Cloud - What's under the Hood and How do we Drive it?
Intel IT Open Cloud - What's under the Hood and How do we Drive it?Odinot Stanislas
 
NRB - BE MAINFRAME DAY 2017 - Compuware Dev Ops
NRB - BE MAINFRAME DAY 2017 - Compuware Dev OpsNRB - BE MAINFRAME DAY 2017 - Compuware Dev Ops
NRB - BE MAINFRAME DAY 2017 - Compuware Dev OpsNRB
 
Implementing DevOps – How it came to the fore, its key elements and example d...
Implementing DevOps – How it came to the fore, its key elements and example d...Implementing DevOps – How it came to the fore, its key elements and example d...
Implementing DevOps – How it came to the fore, its key elements and example d...Barton George
 
Authoring and Hosting Applications on YARN using Slider
Authoring and Hosting Applications on YARN using SliderAuthoring and Hosting Applications on YARN using Slider
Authoring and Hosting Applications on YARN using SliderDataWorks Summit
 
Cloud native integration
Cloud native integrationCloud native integration
Cloud native integrationKim Clark
 
Cloud foundry architecture and deep dive
Cloud foundry architecture and deep diveCloud foundry architecture and deep dive
Cloud foundry architecture and deep diveAnimesh Singh
 
Making your PostgreSQL Database Highly Available
Making your PostgreSQL Database Highly AvailableMaking your PostgreSQL Database Highly Available
Making your PostgreSQL Database Highly AvailableEDB
 
Mainframe Modernization with AWS: Patterns and Best Practices
Mainframe Modernization with AWS: Patterns and Best PracticesMainframe Modernization with AWS: Patterns and Best Practices
Mainframe Modernization with AWS: Patterns and Best PracticesAmazon Web Services
 
Agile Mumbai 2020 Conference | Value of DevOps - Journey from Automation to N...
Agile Mumbai 2020 Conference | Value of DevOps - Journey from Automation to N...Agile Mumbai 2020 Conference | Value of DevOps - Journey from Automation to N...
Agile Mumbai 2020 Conference | Value of DevOps - Journey from Automation to N...AgileNetwork
 
NRB - LUXEMBOURG MAINFRAME DAY 2017 - Data Spark and the Data Federation
NRB - LUXEMBOURG MAINFRAME DAY 2017 - Data Spark and the Data FederationNRB - LUXEMBOURG MAINFRAME DAY 2017 - Data Spark and the Data Federation
NRB - LUXEMBOURG MAINFRAME DAY 2017 - Data Spark and the Data FederationNRB
 
New Integration Options with Postgres Enterprise Manager 8.0
New Integration Options with Postgres Enterprise Manager 8.0New Integration Options with Postgres Enterprise Manager 8.0
New Integration Options with Postgres Enterprise Manager 8.0EDB
 
Enabling Cloud Capabilities Through an Enterprise PaaS (Cloud Foundry Summit ...
Enabling Cloud Capabilities Through an Enterprise PaaS (Cloud Foundry Summit ...Enabling Cloud Capabilities Through an Enterprise PaaS (Cloud Foundry Summit ...
Enabling Cloud Capabilities Through an Enterprise PaaS (Cloud Foundry Summit ...VMware Tanzu
 
How to Migrate Applications Off a Mainframe
How to Migrate Applications Off a MainframeHow to Migrate Applications Off a Mainframe
How to Migrate Applications Off a MainframeVMware Tanzu
 
Enabling a hardware accelerated deep learning data science experience for Apa...
Enabling a hardware accelerated deep learning data science experience for Apa...Enabling a hardware accelerated deep learning data science experience for Apa...
Enabling a hardware accelerated deep learning data science experience for Apa...DataWorks Summit
 
Best practices for application migration to public clouds interop presentation
Best practices for application migration to public clouds interop presentationBest practices for application migration to public clouds interop presentation
Best practices for application migration to public clouds interop presentationesebeus
 
IBM Bluemix hands on
IBM Bluemix hands onIBM Bluemix hands on
IBM Bluemix hands onFelipe Freire
 
Troubleshooting App Health and Performance with PCF Metrics 1.2
Troubleshooting App Health and Performance with PCF Metrics 1.2Troubleshooting App Health and Performance with PCF Metrics 1.2
Troubleshooting App Health and Performance with PCF Metrics 1.2VMware Tanzu
 
The Business Case behind Cloud Computing - The risks and rewards
The Business Case behind Cloud Computing - The risks and rewardsThe Business Case behind Cloud Computing - The risks and rewards
The Business Case behind Cloud Computing - The risks and rewardsOptimation
 

What's hot (20)

Software Defined Storage - Open Framework and Intel® Architecture Technologies
Software Defined Storage - Open Framework and Intel® Architecture TechnologiesSoftware Defined Storage - Open Framework and Intel® Architecture Technologies
Software Defined Storage - Open Framework and Intel® Architecture Technologies
 
Intel IT Open Cloud - What's under the Hood and How do we Drive it?
Intel IT Open Cloud - What's under the Hood and How do we Drive it?Intel IT Open Cloud - What's under the Hood and How do we Drive it?
Intel IT Open Cloud - What's under the Hood and How do we Drive it?
 
Delphix 4.0
Delphix 4.0Delphix 4.0
Delphix 4.0
 
NRB - BE MAINFRAME DAY 2017 - Compuware Dev Ops
NRB - BE MAINFRAME DAY 2017 - Compuware Dev OpsNRB - BE MAINFRAME DAY 2017 - Compuware Dev Ops
NRB - BE MAINFRAME DAY 2017 - Compuware Dev Ops
 
Implementing DevOps – How it came to the fore, its key elements and example d...
Implementing DevOps – How it came to the fore, its key elements and example d...Implementing DevOps – How it came to the fore, its key elements and example d...
Implementing DevOps – How it came to the fore, its key elements and example d...
 
Authoring and Hosting Applications on YARN using Slider
Authoring and Hosting Applications on YARN using SliderAuthoring and Hosting Applications on YARN using Slider
Authoring and Hosting Applications on YARN using Slider
 
Cloud native integration
Cloud native integrationCloud native integration
Cloud native integration
 
Cloud foundry architecture and deep dive
Cloud foundry architecture and deep diveCloud foundry architecture and deep dive
Cloud foundry architecture and deep dive
 
Making your PostgreSQL Database Highly Available
Making your PostgreSQL Database Highly AvailableMaking your PostgreSQL Database Highly Available
Making your PostgreSQL Database Highly Available
 
Mainframe Modernization with AWS: Patterns and Best Practices
Mainframe Modernization with AWS: Patterns and Best PracticesMainframe Modernization with AWS: Patterns and Best Practices
Mainframe Modernization with AWS: Patterns and Best Practices
 
Agile Mumbai 2020 Conference | Value of DevOps - Journey from Automation to N...
Agile Mumbai 2020 Conference | Value of DevOps - Journey from Automation to N...Agile Mumbai 2020 Conference | Value of DevOps - Journey from Automation to N...
Agile Mumbai 2020 Conference | Value of DevOps - Journey from Automation to N...
 
NRB - LUXEMBOURG MAINFRAME DAY 2017 - Data Spark and the Data Federation
NRB - LUXEMBOURG MAINFRAME DAY 2017 - Data Spark and the Data FederationNRB - LUXEMBOURG MAINFRAME DAY 2017 - Data Spark and the Data Federation
NRB - LUXEMBOURG MAINFRAME DAY 2017 - Data Spark and the Data Federation
 
New Integration Options with Postgres Enterprise Manager 8.0
New Integration Options with Postgres Enterprise Manager 8.0New Integration Options with Postgres Enterprise Manager 8.0
New Integration Options with Postgres Enterprise Manager 8.0
 
Enabling Cloud Capabilities Through an Enterprise PaaS (Cloud Foundry Summit ...
Enabling Cloud Capabilities Through an Enterprise PaaS (Cloud Foundry Summit ...Enabling Cloud Capabilities Through an Enterprise PaaS (Cloud Foundry Summit ...
Enabling Cloud Capabilities Through an Enterprise PaaS (Cloud Foundry Summit ...
 
How to Migrate Applications Off a Mainframe
How to Migrate Applications Off a MainframeHow to Migrate Applications Off a Mainframe
How to Migrate Applications Off a Mainframe
 
Enabling a hardware accelerated deep learning data science experience for Apa...
Enabling a hardware accelerated deep learning data science experience for Apa...Enabling a hardware accelerated deep learning data science experience for Apa...
Enabling a hardware accelerated deep learning data science experience for Apa...
 
Best practices for application migration to public clouds interop presentation
Best practices for application migration to public clouds interop presentationBest practices for application migration to public clouds interop presentation
Best practices for application migration to public clouds interop presentation
 
IBM Bluemix hands on
IBM Bluemix hands onIBM Bluemix hands on
IBM Bluemix hands on
 
Troubleshooting App Health and Performance with PCF Metrics 1.2
Troubleshooting App Health and Performance with PCF Metrics 1.2Troubleshooting App Health and Performance with PCF Metrics 1.2
Troubleshooting App Health and Performance with PCF Metrics 1.2
 
The Business Case behind Cloud Computing - The risks and rewards
The Business Case behind Cloud Computing - The risks and rewardsThe Business Case behind Cloud Computing - The risks and rewards
The Business Case behind Cloud Computing - The risks and rewards
 

Similar to Data as a Strategic Asset

Confluent Partner Tech Talk with Reply
Confluent Partner Tech Talk with ReplyConfluent Partner Tech Talk with Reply
Confluent Partner Tech Talk with Replyconfluent
 
Mastering Cloud Data Cost Control: A FinOps Approach
Mastering Cloud Data Cost Control: A FinOps ApproachMastering Cloud Data Cost Control: A FinOps Approach
Mastering Cloud Data Cost Control: A FinOps ApproachDenodo
 
Lunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps Approach
Lunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps ApproachLunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps Approach
Lunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps ApproachDenodo
 
Application Modernisation with PKS
Application Modernisation with PKSApplication Modernisation with PKS
Application Modernisation with PKSPhil Reay
 
OS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of MLOS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of MLNordic APIs
 
Red Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
Red Hat® Ceph Storage and Network Solutions for Software Defined InfrastructureRed Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
Red Hat® Ceph Storage and Network Solutions for Software Defined InfrastructureIntel® Software
 
Faster, more Secure Application Modernization and Replatforming with PKS - Ku...
Faster, more Secure Application Modernization and Replatforming with PKS - Ku...Faster, more Secure Application Modernization and Replatforming with PKS - Ku...
Faster, more Secure Application Modernization and Replatforming with PKS - Ku...VMware Tanzu
 
Continus sql with sql stream builder
Continus sql with sql stream builderContinus sql with sql stream builder
Continus sql with sql stream builderTimothy Spann
 
Plan with confidence: Route to a successful Do178c multicore certification
Plan with confidence: Route to a successful Do178c multicore certificationPlan with confidence: Route to a successful Do178c multicore certification
Plan with confidence: Route to a successful Do178c multicore certificationMassimo Talia
 
Trusted Reliability & Performance with the AppExchange Platform
Trusted Reliability & Performance with the AppExchange PlatformTrusted Reliability & Performance with the AppExchange Platform
Trusted Reliability & Performance with the AppExchange Platformdreamforce2006
 
Monitoring IAAS & PAAS Solutions
Monitoring IAAS & PAAS SolutionsMonitoring IAAS & PAAS Solutions
Monitoring IAAS & PAAS SolutionsColloquium
 
Red Hat Storage Day Atlanta - Persistent Storage for Linux Containers
Red Hat Storage Day Atlanta - Persistent Storage for Linux Containers Red Hat Storage Day Atlanta - Persistent Storage for Linux Containers
Red Hat Storage Day Atlanta - Persistent Storage for Linux Containers Red_Hat_Storage
 
DevOps LA Meetup Intro to Habitat
DevOps LA Meetup Intro to HabitatDevOps LA Meetup Intro to Habitat
DevOps LA Meetup Intro to HabitatJessica DeVita
 
FlexPod-Performance-Fall2014-slideshare
FlexPod-Performance-Fall2014-slideshareFlexPod-Performance-Fall2014-slideshare
FlexPod-Performance-Fall2014-slideshareMichael Harding
 
Confluent Partner Tech Talk with Synthesis
Confluent Partner Tech Talk with SynthesisConfluent Partner Tech Talk with Synthesis
Confluent Partner Tech Talk with Synthesisconfluent
 
Morphis Technologies Overview
Morphis Technologies OverviewMorphis Technologies Overview
Morphis Technologies Overviewjrhartley62
 
Accelerating Cloud Services - Intel
Accelerating Cloud Services - IntelAccelerating Cloud Services - Intel
Accelerating Cloud Services - IntelAmazon Web Services
 
Breaking the Monolith
Breaking the MonolithBreaking the Monolith
Breaking the MonolithVMware Tanzu
 
Webinar: How and Why to Containerize Your Legacy Applications
Webinar: How and Why to Containerize Your Legacy ApplicationsWebinar: How and Why to Containerize Your Legacy Applications
Webinar: How and Why to Containerize Your Legacy ApplicationsStorage Switzerland
 
Deploying All-Flash Cloud Infrastructure without Breaking the Bank
Deploying All-Flash Cloud Infrastructure without Breaking the BankDeploying All-Flash Cloud Infrastructure without Breaking the Bank
Deploying All-Flash Cloud Infrastructure without Breaking the BankWestern Digital
 

Similar to Data as a Strategic Asset (20)

Confluent Partner Tech Talk with Reply
Confluent Partner Tech Talk with ReplyConfluent Partner Tech Talk with Reply
Confluent Partner Tech Talk with Reply
 
Mastering Cloud Data Cost Control: A FinOps Approach
Mastering Cloud Data Cost Control: A FinOps ApproachMastering Cloud Data Cost Control: A FinOps Approach
Mastering Cloud Data Cost Control: A FinOps Approach
 
Lunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps Approach
Lunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps ApproachLunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps Approach
Lunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps Approach
 
Application Modernisation with PKS
Application Modernisation with PKSApplication Modernisation with PKS
Application Modernisation with PKS
 
OS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of MLOS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of ML
 
Red Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
Red Hat® Ceph Storage and Network Solutions for Software Defined InfrastructureRed Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
Red Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
 
Faster, more Secure Application Modernization and Replatforming with PKS - Ku...
Faster, more Secure Application Modernization and Replatforming with PKS - Ku...Faster, more Secure Application Modernization and Replatforming with PKS - Ku...
Faster, more Secure Application Modernization and Replatforming with PKS - Ku...
 
Continus sql with sql stream builder
Continus sql with sql stream builderContinus sql with sql stream builder
Continus sql with sql stream builder
 
Plan with confidence: Route to a successful Do178c multicore certification
Plan with confidence: Route to a successful Do178c multicore certificationPlan with confidence: Route to a successful Do178c multicore certification
Plan with confidence: Route to a successful Do178c multicore certification
 
Trusted Reliability & Performance with the AppExchange Platform
Trusted Reliability & Performance with the AppExchange PlatformTrusted Reliability & Performance with the AppExchange Platform
Trusted Reliability & Performance with the AppExchange Platform
 
Monitoring IAAS & PAAS Solutions
Monitoring IAAS & PAAS SolutionsMonitoring IAAS & PAAS Solutions
Monitoring IAAS & PAAS Solutions
 
Red Hat Storage Day Atlanta - Persistent Storage for Linux Containers
Red Hat Storage Day Atlanta - Persistent Storage for Linux Containers Red Hat Storage Day Atlanta - Persistent Storage for Linux Containers
Red Hat Storage Day Atlanta - Persistent Storage for Linux Containers
 
DevOps LA Meetup Intro to Habitat
DevOps LA Meetup Intro to HabitatDevOps LA Meetup Intro to Habitat
DevOps LA Meetup Intro to Habitat
 
FlexPod-Performance-Fall2014-slideshare
FlexPod-Performance-Fall2014-slideshareFlexPod-Performance-Fall2014-slideshare
FlexPod-Performance-Fall2014-slideshare
 
Confluent Partner Tech Talk with Synthesis
Confluent Partner Tech Talk with SynthesisConfluent Partner Tech Talk with Synthesis
Confluent Partner Tech Talk with Synthesis
 
Morphis Technologies Overview
Morphis Technologies OverviewMorphis Technologies Overview
Morphis Technologies Overview
 
Accelerating Cloud Services - Intel
Accelerating Cloud Services - IntelAccelerating Cloud Services - Intel
Accelerating Cloud Services - Intel
 
Breaking the Monolith
Breaking the MonolithBreaking the Monolith
Breaking the Monolith
 
Webinar: How and Why to Containerize Your Legacy Applications
Webinar: How and Why to Containerize Your Legacy ApplicationsWebinar: How and Why to Containerize Your Legacy Applications
Webinar: How and Why to Containerize Your Legacy Applications
 
Deploying All-Flash Cloud Infrastructure without Breaking the Bank
Deploying All-Flash Cloud Infrastructure without Breaking the BankDeploying All-Flash Cloud Infrastructure without Breaking the Bank
Deploying All-Flash Cloud Infrastructure without Breaking the Bank
 

More from TamrMarketing

Data Mastering at Scale with Michael Stonebraker
Data Mastering at Scale with Michael StonebrakerData Mastering at Scale with Michael Stonebraker
Data Mastering at Scale with Michael StonebrakerTamrMarketing
 
Optimize supply chains using machine learning superpowers webinar deck
Optimize supply chains using machine learning superpowers webinar deckOptimize supply chains using machine learning superpowers webinar deck
Optimize supply chains using machine learning superpowers webinar deckTamrMarketing
 
7 Steps for Boosting R&D Outcomes
7 Steps for Boosting R&D Outcomes7 Steps for Boosting R&D Outcomes
7 Steps for Boosting R&D OutcomesTamrMarketing
 
How Santander UK Accelerates Digital Initiatives by Mastering Customer Data
How Santander UK Accelerates Digital Initiatives by Mastering Customer DataHow Santander UK Accelerates Digital Initiatives by Mastering Customer Data
How Santander UK Accelerates Digital Initiatives by Mastering Customer DataTamrMarketing
 
DataOps @ Scale: A Modern Framework for Data Management in the Public Sector
DataOps @ Scale: A Modern Framework for Data Management in the Public SectorDataOps @ Scale: A Modern Framework for Data Management in the Public Sector
DataOps @ Scale: A Modern Framework for Data Management in the Public SectorTamrMarketing
 
Sailing Toward Global Data Alignment with Carnival Corporation
 Sailing Toward Global Data Alignment with Carnival Corporation Sailing Toward Global Data Alignment with Carnival Corporation
Sailing Toward Global Data Alignment with Carnival CorporationTamrMarketing
 
Agile Leadership: Guiding DataOps Teams Through Rapid Change and Uncertainty
Agile Leadership: Guiding DataOps Teams Through Rapid Change and UncertaintyAgile Leadership: Guiding DataOps Teams Through Rapid Change and Uncertainty
Agile Leadership: Guiding DataOps Teams Through Rapid Change and UncertaintyTamrMarketing
 
How to Implement a Spend Analytics Program Using Machine Learning
 How to Implement a Spend Analytics Program Using Machine Learning How to Implement a Spend Analytics Program Using Machine Learning
How to Implement a Spend Analytics Program Using Machine LearningTamrMarketing
 
Michael Stonebraker: Big Data, Disruption, and the 800 Pound Gorilla in the ...
Michael Stonebraker:  Big Data, Disruption, and the 800 Pound Gorilla in the ...Michael Stonebraker:  Big Data, Disruption, and the 800 Pound Gorilla in the ...
Michael Stonebraker: Big Data, Disruption, and the 800 Pound Gorilla in the ...TamrMarketing
 
3 Strategies to drive more data driven outcomes in financial services
3 Strategies to drive more data driven outcomes in financial services3 Strategies to drive more data driven outcomes in financial services
3 Strategies to drive more data driven outcomes in financial servicesTamrMarketing
 

More from TamrMarketing (10)

Data Mastering at Scale with Michael Stonebraker
Data Mastering at Scale with Michael StonebrakerData Mastering at Scale with Michael Stonebraker
Data Mastering at Scale with Michael Stonebraker
 
Optimize supply chains using machine learning superpowers webinar deck
Optimize supply chains using machine learning superpowers webinar deckOptimize supply chains using machine learning superpowers webinar deck
Optimize supply chains using machine learning superpowers webinar deck
 
7 Steps for Boosting R&D Outcomes
7 Steps for Boosting R&D Outcomes7 Steps for Boosting R&D Outcomes
7 Steps for Boosting R&D Outcomes
 
How Santander UK Accelerates Digital Initiatives by Mastering Customer Data
How Santander UK Accelerates Digital Initiatives by Mastering Customer DataHow Santander UK Accelerates Digital Initiatives by Mastering Customer Data
How Santander UK Accelerates Digital Initiatives by Mastering Customer Data
 
DataOps @ Scale: A Modern Framework for Data Management in the Public Sector
DataOps @ Scale: A Modern Framework for Data Management in the Public SectorDataOps @ Scale: A Modern Framework for Data Management in the Public Sector
DataOps @ Scale: A Modern Framework for Data Management in the Public Sector
 
Sailing Toward Global Data Alignment with Carnival Corporation
 Sailing Toward Global Data Alignment with Carnival Corporation Sailing Toward Global Data Alignment with Carnival Corporation
Sailing Toward Global Data Alignment with Carnival Corporation
 
Agile Leadership: Guiding DataOps Teams Through Rapid Change and Uncertainty
Agile Leadership: Guiding DataOps Teams Through Rapid Change and UncertaintyAgile Leadership: Guiding DataOps Teams Through Rapid Change and Uncertainty
Agile Leadership: Guiding DataOps Teams Through Rapid Change and Uncertainty
 
How to Implement a Spend Analytics Program Using Machine Learning
 How to Implement a Spend Analytics Program Using Machine Learning How to Implement a Spend Analytics Program Using Machine Learning
How to Implement a Spend Analytics Program Using Machine Learning
 
Michael Stonebraker: Big Data, Disruption, and the 800 Pound Gorilla in the ...
Michael Stonebraker:  Big Data, Disruption, and the 800 Pound Gorilla in the ...Michael Stonebraker:  Big Data, Disruption, and the 800 Pound Gorilla in the ...
Michael Stonebraker: Big Data, Disruption, and the 800 Pound Gorilla in the ...
 
3 Strategies to drive more data driven outcomes in financial services
3 Strategies to drive more data driven outcomes in financial services3 Strategies to drive more data driven outcomes in financial services
3 Strategies to drive more data driven outcomes in financial services
 

Recently uploaded

WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 

Recently uploaded (20)

WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 

Data as a Strategic Asset

  • 1. Think Innovate Deliver MIT CDOIQ SUMMIT 20 August 2020
  • 2. Ensuring Flight Safety F-16 unloaded F-16 with many mounted “stores”: missiles, bombs, fuel tanks, sensors, etc. New store configurations lead to many possible problems, including: Unsuccessful separation: https://www.youtube.com/watch?v=fPTnmZ_HPAs&t=68s Aeroelastic flutter: https://www.youtube.com/watch?v=qpJBvQXQC2M&t=44s
  • 3. AF SEEK EAGLE Office • Mission: We deliver war-winning capability by efficiently evaluating the integration of the state-of-the-art weapons on current and future generation aircraft, providing accurate combat weapon delivery software, while serving as responsible stewards of our nation's resources.​ • Vision: Be the most agile, trusted, and responsive provider of innovative and cost-effective war-winning weapons integration and mission planning solutions in the DoD.
  • 5. ● Separations ● Ballistics AFSEO Disciplines ● Fit and function ● Stability & Control ● Aircraft & Store Loads ● EMC & EMI ● Flutter ● Safe Escape
  • 6. AFSEO Disciplines: Data ● Separations ● Ballistics ● Safe Escape ○ Custom software package ○ Physics modeling ● EMC/EMI ○ Maxwell Solvers Software ○ Ground Tests ● Flutter ○ Finite Element Modeling ○ Flight tests ● Fit and function ● Stability & Control ● Aircraft & Store Loads ● EMC & EMI ● Flutter ● Safe Escape ● Fit and function ○ CAD ○ Laser Scans ● Stability & Control ○ Wind tunnel tests ○ Simulations ○ Flight tests ● Aircraft Store & Loads ○ Wind tunnel tests ○ Flight tests ● Separations ○ Static ejection tests ○ CFD ○ Wind tunnel tests ○ Flight tests ● Ballistics ○ Dynamic Modeling
  • 7. Creating a Clearance Recommendation ● Safe Escape ○ Custom software package ○ Physics modeling ● Fit and function ○ CAD ○ Laser Scans Unified “Store Limitations” ● Stability & Control ○ Wind tunnel tests ○ Simulations ○ Flight tests ● Aircraft Store & Loads ○ Wind tunnel tests ○ Flight tests ● EMC/EMI ○ Maxwell Solvers Software ○ Ground Tests ● Flutter ○ Finite Element Modeling ○ Flight tests ● Separations ○ Static ejection tests ○ CFD ○ Wind tunnel tests ○ Flight tests ● Ballistics ○ Dynamic Modeling
  • 8. Aligned with Air Force Chief Data Office “Data is the future of our force,” Crider said. “Unlocking and unleashing the power of our data is going to keep the Air Force at the forefront of technological advancement. We must take advantage of today’s technology so we can learn faster than our adversaries and ensure the maximum effectiveness of our force.” http://www.af.mil/News/Article-Display/Article/1448828/af-chief-data-officer-data-is-the-future-of-the-force/ • Organizing and storing data • Finding data • Making it accessible • Linking it • Making it trustworthy • Providing an environment to access, view, filter it • Understanding it Data accessibility Support appropriate heterogeneity Data governance Automation w/ machine learning Digit- ization “Exposing engineering data as a strategic asset” AF CDO Core Goals AFSEO CDO Immediate Priorities
  • 9. AFSEO Data Challenges • Technology and processes management must go together • Choosing the right technology for today and the future • Supporting multigenerational office • 200 TB of data from 50+ years • Majority of data is unstructured in sundry formats - includes a large video and simulations library - Final products are memos and reports in .doc and .pdf • Infrastructure modernization ongoing Background Issues
  • 10. Data Lake Design • On premise, but cloud ready • Open Architecture - “Best in Breed” • Scalable - for the future • Configurable & adaptable • Interoperabile with internal legacy AFSEO tools • Secure, using existing role based access control (at the file level) • Correct results without massive investment of time or money App Layer “Data unification powered by machine learning” Storage “Scale out network-attached storage platform” Microservices “Software platform for data engineering at scale” Design Principles The data lake is an extensible workspace for data processing and application creation Key Technologies
  • 11. Data Lake Products Numerical predictions AUTOMATED OUTPUTS PRODUCTIVITY TOOLS Document catalog powered by clean, consistent metadata Recommendation with sources cited Interactive antecedent browser NEW SEEK EAGLE REQUEST Metadata extraction INPUT AUTOMATED ANALYSIS Discipline-specific logic Machine learning models for each discipline
  • 12. Data Lake Architecture Linux VM Hue Cloudera Microservices Dell/EMC Isilon HDFS CIFS Windows File Browser Linux VM Cloudera Microservices DB App 1: File Catalog & Metadata Tagging DB App API (future) AD, LDAP and Kerberos integrated Cloudera & Isilon, and by extension Tamr Connected to other AFSEO DBs and apps Solr Spark ... Cloudera Manager Banana HBase Solr Spark... App 2: Recommendation Engine App 3: Deep Predictor Tamr Entity Resolution
  • 13. Data Lake Architecture Linux VM Hue Cloudera Microservices Dell/EMC Isilon HDFS CIFS Windows File Browser Linux VM Cloudera Microservices DB App 1: File Catalog & Metadata Tagging DB App API (future) AD, LDAP and Kerberos integrated Cloudera & Isilon, and by extension Tamr Connected to other AFSEO DBs and apps Solr Spark ... Cloudera Manager Banana HBase Solr Spark... App 2: Recommendation Engine App 3: Deep Predictor Tamr Entity Resolution An aside Lucene Elastic- Search Solr Kibana Banana E(L)K Stack Viz Search Solr equiv.
  • 14. IB Intra-cluster Communication Layer Client/Application Layer Clients Clients Clients Ethernet Layer 1GbE 10GbE CIFSNFS FTPHTTP SMB SWIFT HDFS NDMP OneFS Operating Environment Single FS/Volume Isilon NAS Architecture
  • 15. • A Data Lake offers multi-protocol access with tiering to workloads with different performance requirements. • All file sharing protocols capabilities allow access from all other protocols • Multiprotocol access controlled through AD, LDAP, NIS and other providers PowerScale Supported Protocols • SMB – Microsoft OS • NFS – Linux / Unix • HTTP • REST • SWIFT • S3 – Object Storage – Cloud Option • HDFS – Hadoop • NDMP – Backup & Recovery • FTP Flexibility makes AI an integral part of IT
  • 16. © Copyright 2017 Dell Inc.16 Flash SATA/SAS Isilon Complete “best of breed” solution Flexible Compute Compute & Converged Hyper-Converged w/ NW Hyper- Converged Existing Server Infrastructure Servers Embedded w/ GPU-Farm* Optional GPU Acceleration Machine Learning Custom Deep Learning DL4JCustom Python BigDL *ex. Nvidia DGX-1 READY SOLUTION Deep Learning with Nvidia Massively Parallel IO Flexible deployment options
  • 17. App 1: Searchable File Catalog Outcomes ● Improve productivity by making 50 years of test data and SEEK EAGLE analyses easily searchable ● Surface key references and connections buried in documents ● Tagging >131 TB of data with 4.3 billion descriptive labels in 30+ tag types (aircraft, stores, author, file type, etc.) ● Analyst uses browser to search and filter by tag Prior State ● Research wastes engineer time Digging through old files takes hours or days (“limited data accessibility”) ● Siloed data If one department needs help from another, they have to ask a human (“limited data sharing”) ● Disorganization leads to repeated work Finding the right test from 10 years ago is so difficult, engineers repeat the experiment (“limited data usage”)
  • 18. App 1: Searchable File Catalog Maintain this system alongside the users’ folders One click opens a file locally Metadata tags are compre- hensive & ambitious Amazon- like search & filter mix
  • 19. ● 80% of requests should be fully automated: engineer is just editor of automatically produced product ● If the judgement is “too close to call”, shows relevant historical document as a place to start ● Encoded some of the veterans’ instincts in the logic of the app ● Makes it easier for the program management office to plan ● Engineers from each discipline touch every request ● 90% of requests will be “by analogy” ● Even planning is hard Requirement Loads EMI Stability/ Control Flutter Separ ations Mission planning Prior State Recommendation Outcomes App 2: Recommendation Engine
  • 20. App 2: Recommendation Engine NEW SEEK EAGLE REQUEST REFERENCE DATA ACES Standard Rationale STAMP DATA PROCESSING Entity Resolution, NLP, Transformations SCORING Tolerance Check Configuration Comparator HISTORICAL DATA Approved configs Eng. dataTech Order Certify by Analogy? Produce Publishable Documents✅ 🆇 Human review and/or testing needed Flight Limits Engineering Rationale with sources cited Historical data browser, sorted by similarity
  • 21. App 2: Recommendation Engine Deep Dive: Entity Resolution with ML Demo
  • 22. ● Time intensive Every request results in 10,000s of “download configurations” ● Manual Physics-based simulations used heavily, but non-trivial interpretation still needed ● Testing data is sparse 30+ years of flight tests, but most configs aren’t tested due to high cost ● Expert-driven Good judgment requires years of training App 3: ML Predictions (Flutter) ● Tamr predicts amount of oscillation as accurately as possible at every possible flight condition ● The app automatically generates a flight envelope for each new SEEK EAGLE Request. ● Tamr shows most similar previous flight tests and confidence intervals for human check ● The Tamr output allows for fewer and more targeted flight tests. Prior State Outcomes
  • 23. App 3: ML Predictions (Flutter) Data lake used to predict aerodynamic flutter from first principles Automatically produced “flight envelope” for new configuration Data Robot used to automate ML tuning process
  • 24. Outcomes Result Impact AFSEO processes more configurations/year, giving the pilot more options for a mission Increased ROI and effective utilization of Air Force aircraft AFSEO cycle time is reduced Innovations reach the front lines sooner Better data usage → fewer test flights, fewer experiments in the wind tunnel, etc. Cost saving for AFSEO Data lake not only reveals and organizes data, it converts data → insights SEEK EAGLE Office is more productive Data is easier to find; Greybeard “instincts” have been codified Reduce the onboarding time for new engineers