9. 9
CONNECTED VEHICLE DATA
In general, Connected Vehicle data is directly from vehicles. It is our best
opportunity to really understand how our customers use our current
products and help us in creating new ones.
Embedded Modem
SYNC ApplinkPlug-in Device
Data Logger
10. 10
BIG DATA DRIVE
• Many Ford employees have volunteered to allow the company to collect driving data
from in-vehicle sensors
– Uses OpenXC, Ford’s open-source plug-in device
• Learning how people actually use their vehicles through Big Data Analytics
– Improve products and customer service
– Develop mobility solutions
OBD2
Car
OpenXC
CAN Translator
Phone/
Tablet
Ford HadoopPublic
Cloud
Analysis
Toolbox
11. 11
MODEM IN LINCOLN MKC AND MKZ
• Modem is in larger Technology Package available in many 2017
models.
• Data from 2017 model year embedded modems includes ambient
temperature
• Environmental data from individual Lincoln vehicles already
outnumbers the number of weather stations in the U.S.
– 5000 connected vehicles compared to 2550 weather stations
– 6 Million data points between Jan and Jul 2015
• Analysis across different time scales
– By minute/hour across the day
– By day across months
12. 12
One vehicle creates up to 25 Gigabytes of data per hour
or
All Ford Vehicles on the road create 2500 Petabytes each day
Facebook users share
nearly 3.6 million
pieces of content each
day
Apple users
download nearly
50,000 apps (every
minute)
Amazon generates
over $80,000 in online
sales each minute
Creates an
unprecedented
transactional data
volumes -- 25+
petabytes* a day
GDI&A – Big Data Comparison
15. 15
GDI&A DATA OPERATIONS – WHAT DO WE DO?
Data Knowledge to Ignite Analytics
Identify Internal and External
Data Sources
Manage Data as
an Asset
Enterprise Level
Data Standards and
Governance
Reports and
Dashboards
16. 16
BIG DATA & ANALYTICS ARCHITECTURE
Our Analytics Reference Architecture
Describes The Components Needed
To Build Any And All Analytics
Applications And Is Used To Guide
Our Technology Choices
19. 19
DEMOCRATIZING THE BASE OF THE ANALYTIC PYRAMID
• GDI&A will provide tools, infrastructure and training to support
capability across skill teams.
FOCUS OF
DEMOCRATIZATION
The Base Of The Pyramid Is Where Significant Democratization Will Occur
20. 20
FOCUS ON EDUCATION & TRAINING
We Are Targeting Specific Areas For Initial Rollout Of Education & Tools
BUSINESS ANALYSTS
• Not Only Tool Training, but Data Science Training (Visualization
Techniques, Analysis Techniques, etc.)
• Multiple training methods (online, classroom, and doing together)
• Access to tools will follow education and training
21. 21
BALANCING ACCESS WITH EDUCATION
Our Efforts Will Be Focused On Building The Company’s Capabilities Versus Simply Passing
Out IDs And Tools
• Democratization doesn’t make someone a Data Scientist – It is critical that
people know when and where to get help
• Minimize Duplication of Efforts
• Right Data Sources
• Right Governance
• Right Methodology
• Right Tool for the Job
• Improved Access
• Improved Tools
• Improved Training
• Access to Help (GDIA)
Replace Mike C’s name –
Title change and modificaitons to the articles - reduce
Replace Mike C’s name –
Title change and modificaitons to the articles - reduce
Introduce the DSC
Volume from transactional systems
Connected vehicle
Autonomous vehicle
Ford Pass Application
Connected vehicle
Autonomous vehicle
Ford Pass Application
Tied directly to examples like AV and the need to analyze quicker
Replace Mike C’s name –
Title change and modificaitons to the articles - reduce
Ask tom to show the Open XC device
Validate the numbers – differentiate the model years
Get the right years and model enabling the
Is 2015 accurate or 2016 – validate model years with capability
How to organize and utilize the volumes of data?
Replace Mike C’s name –
Title change and modificaitons to the articles - reduce
As proven by our most ethical company award – Starts with securing and protecting our customers data
Data protection and consumer privacy is key to the success of Ford’s data strategy
Working closely with our legal partners to ensure we have the right opt-in, permissions and controls in place
Critical to the organization – tees off from the size of the challenge and how we plan to organize and use the data in an appliation
Load Data – Data Acquisition and Data Ingestion
Manage Dataa – Inf. Mgt, identify and security, data index, data discover, metadata,
Access mgt and data rights, PII and data privacy, data standards, data quality, data curation and datarention
Enable analytics – real tiem integration (API), Analytics enablement (DZ etc) Self Service
Analytics – Enterprise analytics - Glo
Replace Mike C’s name –
Title change and modificaitons to the articles - reduce