SlideShare ist ein Scribd-Unternehmen logo
1 von 28
1© Copyright 2016 Pivotal. All rights reserved.
Data Science-Powered Apps for
the Internet of Things
Chris Rawles1 and Jarrod Vawdrey2
1. Sr. Data Scientist in New York, New York
2. Sr. Data Scientist in Atlanta, Georgia
2© Copyright 2016 Pivotal. All rights reserved.
Today’s talk
1. A real-time data science app
A. The app: a live demonstration
B. How can a data scientist build a data science application?
C. Revisiting the app
2. Generalizing the framework: Solving new data science
challenges
A. Internet of Things – Creating a smart app to prevent oil spill disasters
B. Financial data - How can retail banks influence their cardholders’
behavior?
3© Copyright 2016 Pivotal. All rights reserved.
Today’s talk
1. A real-time data science app
A. The app: a live demonstration
B. How can a data scientist build a data science application?
C. Revisiting the app
2. Generalizing the framework: Solving new data science
challenges
A. Internet of Things – Creating a smart app to prevent oil spill disasters
B. Financial data - How can retail banks influence their cardholders’
behavior?
4© Copyright 2016 Pivotal. All rights reserved.
App
5© Copyright 2016 Pivotal. All rights reserved.
Today’s talk
1. A real-time data science app
A. The app: a live demonstration
B. How can a data scientist build a data science application?
C. Revisiting the app
2. Generalizing the framework: Solving new data science
challenges
A. Internet of Things – creating a smart app
B. Financial data - How can retail banks influence their cardholders’
behavior?
6© Copyright 2016 Pivotal. All rights reserved.
Training
app
Model
Scoring as
a service
Model
Training as
a service
Sensor
app
Scoring
app
Dashboard
app
Data science workflow: Movement classification
1. Sensor + Dashboard
2. Redis
3. Training app
4. Scoring app
7© Copyright 2016 Pivotal. All rights reserved.
here is my source code
run it on the cloud for me
- Onsi Fakhouri
@onsijoe
i do not care how
8© Copyright 2016 Pivotal. All rights reserved.
cf push
 CF determines app type (Java, Python, Ruby, …)
 Installs necessary environment
 Provisions and binds data services
 Creates domain, routing, and load balancing
 Continual app health checks and restarts
9© Copyright 2016 Pivotal. All rights reserved.
Data ingestion: Accelerometric data
 Accelerometric data streamed from
mobile phone at 15 Hz (15x / second)
 Other sensor data: gyroscopic data,
magnetometer data, lon/lat, etc.
Accelerometer axes
10© Copyright 2016 Pivotal. All rights reserved.
 For real-time applications, low-latency data ingestion into
the data store is essential
 WebSocket protocol - socket.io
– Mobile phone  Webserver
– Webserver  Dashboard
 socket.io  redis
Data ingestion
Training
app
Sensor
app
11© Copyright 2016 Pivotal. All rights reserved.
Data storage
 We are using a redis store for:
– Storing training data
– Model persistence
– Storing a micro-batch of scoring data
 Other storage systems include GemFire, HAWQ/Hadoop,
Greenplum Database, PostgreSQL, …
12© Copyright 2016 Pivotal. All rights reserved.
Modeling
Scalable machine learning applications in Pivotal
Cloud Foundry
1. Training app
2. Scoring app
13© Copyright 2016 Pivotal. All rights reserved.
Modeling – Training app
 Goal: build a data-driven model that learns accelerometric
motions associated with each activity
Feature Engineering
• Time-domain
transformations
• Fast Fourier Transform
analysis
Machine Learning
Classification Model
• Random Forest Model
using 2 second time
windows (30 samples)
…
Training data
Trained
model
14© Copyright 2016 Pivotal. All rights reserved.
Model building
 20 seconds per
training activity
 Two second moving
window on training
data
 Features: time-
domain summary
statistics and Fourier
transform coefficients
15© Copyright 2016 Pivotal. All rights reserved.
Model training approaches
1. Near-real-time model training
– Use small batches to train model
2. Real-time model training
– Online machine learning algorithm : continually update model
using each new data point
3. Offline model training
– Build a model offline using batches
– Useful for models requiring finer model tuning and calibration
16© Copyright 2016 Pivotal. All rights reserved.
Feature Engineering
• Time-domain
transformations
• Fast Fourier Transform
analysis
Machine Learning
Classification Model
• Random Forest Model
using 2 second time
windows (30 samples)
Trained model
Streaming input window
Model
Prediction
API Call
Model
prediction
PCF App:
Scoring app
• Real-time model scoring
• The dashboard initiates a request via
an API call and receives a model
prediction
{ "channel": "1234",
"label": ”walking",
"label_value": 0.746 }
17© Copyright 2016 Pivotal. All rights reserved.
1. Application auto-scaling
– As the data grows, the model scales
2. Application autonomy
– The model application is independent of other applications = faster
development iterations
– Faster development = rapid feedback loop
3. Multiple applications can access model scoring app
Operationalizing scalable data science applications
Model scoring as a service
Why?
18© Copyright 2016 Pivotal. All rights reserved.
Today’s talk
1. A real-time data science app
A. The app: a live demonstration
B. How can a data scientist build a data science application?
C. Revisiting the app
2. Generalizing the framework: Solving new data science
challenges
A. Internet of Things – creating a smart app
B. Financial data - How can retail banks influence their cardholders’
behavior?
19© Copyright 2016 Pivotal. All rights reserved.
App
20© Copyright 2016 Pivotal. All rights reserved.
Today’s talk
1. A real-time data science app
A. The app: a live demonstration
B. How can a data scientist build a data science application?
C. Revisiting the app
2. Generalizing the framework: Solving new data science
challenges
A. Internet of Things – Creating a smart app to prevent oil spill disasters
B. Financial data - How can retail banks influence their cardholders’
behavior?
21© Copyright 2016 Pivotal. All rights reserved.
Gene Sequencing
Smart Grids
COST TO SEQUENCE
ONE GENOME
HAS FALLEN FROM
$100M IN
2001
TO $10K IN 2011
TO $1K IN 2014
READING SMART METERS
EVERY 15 MINUTES IS
3000X MORE
DATA INTENSIVE
Stock Market
Social Media
FACEBOOK UPLOADS
250 MILLION
PHOTOS EACH DAY
In all industries billions of data points represent
opportunities for the Internet of Things
Oil Exploration
Video Surveillance
OIL RIGS GENERATE
25000
DATA POINTS
PER SECOND
Medical Imaging
Mobile Sensors
22© Copyright 2016 Pivotal. All rights reserved.
How can we use data
to help prevent
accidents like the Macondo
Disaster ?
23© Copyright 2016 Pivotal. All rights reserved. 23© Copyright 2016 Pivotal. All rights reserved.
…by creating a Smart Application
24© Copyright 2016 Pivotal. All rights reserved.
Training
app
Model
Scoring as
a service
Model
Training as
a service
Sensor
app
Scoring
app
Dashboard
app
Data science workflow: Movement classification
25© Copyright 2016 Pivotal. All rights reserved.
Training
app
Model
Scoring as
a service
Model
Training as
a service
Sensor
app
Scoring
app
Dashboard
app
Data science workflow: Creating a smart app to
prevent oil spill disasters • Alert operator
• Send signal to control system
to change operating
parameters
• Replace old machinery
• Shut down plant
26© Copyright 2016 Pivotal. All rights reserved.
Training
app
Model
Scoring as
a service
Model
Training as
a service
Sensor
app
Scoring
app
Dashboard
app
Data science workflow: How can retail banks influence their
cardholders’ behavior? • Provide customized services
and promotions
• Next best offer
• Characterize and improve
customer satisfaction
27© Copyright 2016 Pivotal. All rights reserved.
Thank you
Questions and comments
crawles@pivotal.io
28© Copyright 2016 Pivotal. All rights reserved.

Weitere ähnliche Inhalte

Was ist angesagt?

Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
Hong-Linh Truong
 
bbva_redhat-theAPIHour_IoT_Day-DavidBericat
bbva_redhat-theAPIHour_IoT_Day-DavidBericatbbva_redhat-theAPIHour_IoT_Day-DavidBericat
bbva_redhat-theAPIHour_IoT_Day-DavidBericat
David Bericat
 

Was ist angesagt? (20)

The Prospect of IoT in the Oil & Gas
The Prospect of IoT in the Oil & Gas The Prospect of IoT in the Oil & Gas
The Prospect of IoT in the Oil & Gas
 
NUS-ISS Learning Day 2018- Designing software to make the most of cloud platf...
NUS-ISS Learning Day 2018- Designing software to make the most of cloud platf...NUS-ISS Learning Day 2018- Designing software to make the most of cloud platf...
NUS-ISS Learning Day 2018- Designing software to make the most of cloud platf...
 
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
 
Ibm iot overview
Ibm   iot overviewIbm   iot overview
Ibm iot overview
 
Device to Intelligence, IOT and Big Data in Oracle
Device to Intelligence, IOT and Big Data in OracleDevice to Intelligence, IOT and Big Data in Oracle
Device to Intelligence, IOT and Big Data in Oracle
 
NUS-ISS Learning Day 2018- Harness the power of network effects and platform ...
NUS-ISS Learning Day 2018- Harness the power of network effects and platform ...NUS-ISS Learning Day 2018- Harness the power of network effects and platform ...
NUS-ISS Learning Day 2018- Harness the power of network effects and platform ...
 
Sap Leonardo IoT Overview
Sap Leonardo IoT OverviewSap Leonardo IoT Overview
Sap Leonardo IoT Overview
 
Innovation with AWS: IoT, Robotics and AI
Innovation with AWS: IoT, Robotics and AIInnovation with AWS: IoT, Robotics and AI
Innovation with AWS: IoT, Robotics and AI
 
Is cloud secure or not
Is cloud secure or notIs cloud secure or not
Is cloud secure or not
 
AIOps in 2020: A Beginner's Guide
AIOps in 2020: A Beginner's GuideAIOps in 2020: A Beginner's Guide
AIOps in 2020: A Beginner's Guide
 
Countering Cybersecurity Risk in Today's IoT World
Countering Cybersecurity Risk in Today's IoT WorldCountering Cybersecurity Risk in Today's IoT World
Countering Cybersecurity Risk in Today's IoT World
 
IoT and Big Data
IoT and Big DataIoT and Big Data
IoT and Big Data
 
Edge patterns in the IIoT
Edge patterns in the IIoTEdge patterns in the IIoT
Edge patterns in the IIoT
 
AIOps & Observability to Lead Your Digital Transformation
AIOps & Observability to Lead Your Digital TransformationAIOps & Observability to Lead Your Digital Transformation
AIOps & Observability to Lead Your Digital Transformation
 
Top 10 reasons your IoT project will fail
Top 10 reasons your IoT project will failTop 10 reasons your IoT project will fail
Top 10 reasons your IoT project will fail
 
Next Dimension + Cisco Smart Manufacturing
Next Dimension + Cisco Smart ManufacturingNext Dimension + Cisco Smart Manufacturing
Next Dimension + Cisco Smart Manufacturing
 
Internet of Things and Big Data
Internet of Things and Big DataInternet of Things and Big Data
Internet of Things and Big Data
 
Delivering Business Value from Operational Inisights at ING Bank
Delivering Business Value from Operational Inisights at ING BankDelivering Business Value from Operational Inisights at ING Bank
Delivering Business Value from Operational Inisights at ING Bank
 
bbva_redhat-theAPIHour_IoT_Day-DavidBericat
bbva_redhat-theAPIHour_IoT_Day-DavidBericatbbva_redhat-theAPIHour_IoT_Day-DavidBericat
bbva_redhat-theAPIHour_IoT_Day-DavidBericat
 
Decide if PhoneGap is for you as your mobile platform selection
Decide if PhoneGap is for you as your mobile platform selectionDecide if PhoneGap is for you as your mobile platform selection
Decide if PhoneGap is for you as your mobile platform selection
 

Andere mochten auch

Andere mochten auch (13)

Tablet Usage Study: Bangalore Chapter
Tablet Usage Study: Bangalore ChapterTablet Usage Study: Bangalore Chapter
Tablet Usage Study: Bangalore Chapter
 
Connecting devices to the internet of things
Connecting devices to the internet of thingsConnecting devices to the internet of things
Connecting devices to the internet of things
 
Confluence Keynote- Leading the change in a connected world - Pari Natarajan
Confluence Keynote- Leading the change in a connected world - Pari NatarajanConfluence Keynote- Leading the change in a connected world - Pari Natarajan
Confluence Keynote- Leading the change in a connected world - Pari Natarajan
 
Creating an Internet of Everything
Creating an Internet of Everything Creating an Internet of Everything
Creating an Internet of Everything
 
Zinnov DRAUP : Tech Mafia Disrupting Multiple Industries
Zinnov DRAUP :  Tech Mafia Disrupting Multiple IndustriesZinnov DRAUP :  Tech Mafia Disrupting Multiple Industries
Zinnov DRAUP : Tech Mafia Disrupting Multiple Industries
 
Internet of Things Innovations & Megatrends Update 12/14/16
Internet of Things Innovations & Megatrends Update 12/14/16Internet of Things Innovations & Megatrends Update 12/14/16
Internet of Things Innovations & Megatrends Update 12/14/16
 
Zinnov DRAUP : Engineering Services Outsourcing Overview
Zinnov DRAUP : Engineering Services Outsourcing OverviewZinnov DRAUP : Engineering Services Outsourcing Overview
Zinnov DRAUP : Engineering Services Outsourcing Overview
 
Dr David Soldani : Leading the disruptions | Zinnov Confluence '16 Munich
Dr David Soldani : Leading the disruptions | Zinnov Confluence '16 MunichDr David Soldani : Leading the disruptions | Zinnov Confluence '16 Munich
Dr David Soldani : Leading the disruptions | Zinnov Confluence '16 Munich
 
The Future is Internet of Things (IOT)
The Future is Internet of Things (IOT)The Future is Internet of Things (IOT)
The Future is Internet of Things (IOT)
 
The importance of strong entropy for iot
The importance of strong entropy for iotThe importance of strong entropy for iot
The importance of strong entropy for iot
 
Designing IoT Experiences in the Age of Humanised Technology by Guido Woska, ...
Designing IoT Experiences in the Age of Humanised Technology by Guido Woska, ...Designing IoT Experiences in the Age of Humanised Technology by Guido Woska, ...
Designing IoT Experiences in the Age of Humanised Technology by Guido Woska, ...
 
Digital strategy to transform GE into digital industrial leader
Digital strategy to transform GE into digital industrial leaderDigital strategy to transform GE into digital industrial leader
Digital strategy to transform GE into digital industrial leader
 
Zinnov Zones for IoT Services 2017
Zinnov Zones for IoT Services 2017Zinnov Zones for IoT Services 2017
Zinnov Zones for IoT Services 2017
 

Ähnlich wie Data Science Powered Apps for Internet of Things

Ähnlich wie Data Science Powered Apps for Internet of Things (20)

Learn How to Operationalize IoT Apps on Pivotal Cloud Foundry
Learn How to Operationalize IoT Apps on Pivotal Cloud FoundryLearn How to Operationalize IoT Apps on Pivotal Cloud Foundry
Learn How to Operationalize IoT Apps on Pivotal Cloud Foundry
 
The attention economy and the internet
The attention economy and the internetThe attention economy and the internet
The attention economy and the internet
 
Microservices: The Future-Proof Framework for IoT
Microservices: The Future-Proof Framework for IoTMicroservices: The Future-Proof Framework for IoT
Microservices: The Future-Proof Framework for IoT
 
Making your mobile testing strategy future-proof
Making your mobile testing strategy future-proofMaking your mobile testing strategy future-proof
Making your mobile testing strategy future-proof
 
WinOps Conf 2015 - John Rakowski - Militarise It for #DevOps success
WinOps Conf 2015 - John Rakowski - Militarise It for #DevOps successWinOps Conf 2015 - John Rakowski - Militarise It for #DevOps success
WinOps Conf 2015 - John Rakowski - Militarise It for #DevOps success
 
IRJET- Medicine Information Retrieval Application- Pharmaguide
IRJET- Medicine Information Retrieval Application- PharmaguideIRJET- Medicine Information Retrieval Application- Pharmaguide
IRJET- Medicine Information Retrieval Application- Pharmaguide
 
Best Practices and Advanced Insights on Browser RUM Users - AppSphere16
Best Practices and Advanced Insights on Browser RUM Users - AppSphere16Best Practices and Advanced Insights on Browser RUM Users - AppSphere16
Best Practices and Advanced Insights on Browser RUM Users - AppSphere16
 
How to Apply Machine Learning with R, H20, Apache Spark MLlib or PMML to Real...
How to Apply Machine Learning with R, H20, Apache Spark MLlib or PMML to Real...How to Apply Machine Learning with R, H20, Apache Spark MLlib or PMML to Real...
How to Apply Machine Learning with R, H20, Apache Spark MLlib or PMML to Real...
 
Handling the Extremes: Scaling and Streaming in Finance
Handling the Extremes: Scaling and Streaming in FinanceHandling the Extremes: Scaling and Streaming in Finance
Handling the Extremes: Scaling and Streaming in Finance
 
The Live: Stream Computing
The Live: Stream ComputingThe Live: Stream Computing
The Live: Stream Computing
 
Smart Traffic Monitoring System Report
Smart Traffic Monitoring System ReportSmart Traffic Monitoring System Report
Smart Traffic Monitoring System Report
 
HOW OPEN APIS WILL CHANGE THE FUTURE OF BANKING
HOW OPEN APIS WILL CHANGE THE FUTURE OF BANKINGHOW OPEN APIS WILL CHANGE THE FUTURE OF BANKING
HOW OPEN APIS WILL CHANGE THE FUTURE OF BANKING
 
Confluent x imply: Build the last mile to value for data streaming applications
Confluent x imply:  Build the last mile to value for data streaming applicationsConfluent x imply:  Build the last mile to value for data streaming applications
Confluent x imply: Build the last mile to value for data streaming applications
 
Android Application for Updation
Android Application for UpdationAndroid Application for Updation
Android Application for Updation
 
#Mcanada mobile apps stein 11.15
#Mcanada mobile apps stein 11.15#Mcanada mobile apps stein 11.15
#Mcanada mobile apps stein 11.15
 
Farmer’s Plaza: A Cloud-Enabled Mobile and Web Based Application For Farmers
Farmer’s Plaza: A Cloud-Enabled Mobile and Web Based Application For FarmersFarmer’s Plaza: A Cloud-Enabled Mobile and Web Based Application For Farmers
Farmer’s Plaza: A Cloud-Enabled Mobile and Web Based Application For Farmers
 
Entreprise mobility approach within digital transformation
Entreprise mobility approach within digital transformationEntreprise mobility approach within digital transformation
Entreprise mobility approach within digital transformation
 
Findability Day 2016 - Big data analytics and machine learning
Findability Day 2016 - Big data analytics and machine learningFindability Day 2016 - Big data analytics and machine learning
Findability Day 2016 - Big data analytics and machine learning
 
Blood Donors and Receivers Management System
Blood Donors and Receivers Management SystemBlood Donors and Receivers Management System
Blood Donors and Receivers Management System
 
7 Tips for Optimizing Mobile App Metrics
7 Tips for Optimizing Mobile App Metrics7 Tips for Optimizing Mobile App Metrics
7 Tips for Optimizing Mobile App Metrics
 

Mehr von VMware Tanzu

Mehr von VMware Tanzu (20)

What AI Means For Your Product Strategy And What To Do About It
What AI Means For Your Product Strategy And What To Do About ItWhat AI Means For Your Product Strategy And What To Do About It
What AI Means For Your Product Strategy And What To Do About It
 
Make the Right Thing the Obvious Thing at Cardinal Health 2023
Make the Right Thing the Obvious Thing at Cardinal Health 2023Make the Right Thing the Obvious Thing at Cardinal Health 2023
Make the Right Thing the Obvious Thing at Cardinal Health 2023
 
Enhancing DevEx and Simplifying Operations at Scale
Enhancing DevEx and Simplifying Operations at ScaleEnhancing DevEx and Simplifying Operations at Scale
Enhancing DevEx and Simplifying Operations at Scale
 
Spring Update | July 2023
Spring Update | July 2023Spring Update | July 2023
Spring Update | July 2023
 
Platforms, Platform Engineering, & Platform as a Product
Platforms, Platform Engineering, & Platform as a ProductPlatforms, Platform Engineering, & Platform as a Product
Platforms, Platform Engineering, & Platform as a Product
 
Building Cloud Ready Apps
Building Cloud Ready AppsBuilding Cloud Ready Apps
Building Cloud Ready Apps
 
Spring Boot 3 And Beyond
Spring Boot 3 And BeyondSpring Boot 3 And Beyond
Spring Boot 3 And Beyond
 
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdfSpring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
 
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
 
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
 
tanzu_developer_connect.pptx
tanzu_developer_connect.pptxtanzu_developer_connect.pptx
tanzu_developer_connect.pptx
 
Tanzu Virtual Developer Connect Workshop - French
Tanzu Virtual Developer Connect Workshop - FrenchTanzu Virtual Developer Connect Workshop - French
Tanzu Virtual Developer Connect Workshop - French
 
Tanzu Developer Connect Workshop - English
Tanzu Developer Connect Workshop - EnglishTanzu Developer Connect Workshop - English
Tanzu Developer Connect Workshop - English
 
Virtual Developer Connect Workshop - English
Virtual Developer Connect Workshop - EnglishVirtual Developer Connect Workshop - English
Virtual Developer Connect Workshop - English
 
Tanzu Developer Connect - French
Tanzu Developer Connect - FrenchTanzu Developer Connect - French
Tanzu Developer Connect - French
 
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
 
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring BootSpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
 
SpringOne Tour: The Influential Software Engineer
SpringOne Tour: The Influential Software EngineerSpringOne Tour: The Influential Software Engineer
SpringOne Tour: The Influential Software Engineer
 
SpringOne Tour: Domain-Driven Design: Theory vs Practice
SpringOne Tour: Domain-Driven Design: Theory vs PracticeSpringOne Tour: Domain-Driven Design: Theory vs Practice
SpringOne Tour: Domain-Driven Design: Theory vs Practice
 
SpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
SpringOne Tour: Spring Recipes: A Collection of Common-Sense SolutionsSpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
SpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
 

Kürzlich hochgeladen

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 

Kürzlich hochgeladen (20)

Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 

Data Science Powered Apps for Internet of Things

  • 1. 1© Copyright 2016 Pivotal. All rights reserved. Data Science-Powered Apps for the Internet of Things Chris Rawles1 and Jarrod Vawdrey2 1. Sr. Data Scientist in New York, New York 2. Sr. Data Scientist in Atlanta, Georgia
  • 2. 2© Copyright 2016 Pivotal. All rights reserved. Today’s talk 1. A real-time data science app A. The app: a live demonstration B. How can a data scientist build a data science application? C. Revisiting the app 2. Generalizing the framework: Solving new data science challenges A. Internet of Things – Creating a smart app to prevent oil spill disasters B. Financial data - How can retail banks influence their cardholders’ behavior?
  • 3. 3© Copyright 2016 Pivotal. All rights reserved. Today’s talk 1. A real-time data science app A. The app: a live demonstration B. How can a data scientist build a data science application? C. Revisiting the app 2. Generalizing the framework: Solving new data science challenges A. Internet of Things – Creating a smart app to prevent oil spill disasters B. Financial data - How can retail banks influence their cardholders’ behavior?
  • 4. 4© Copyright 2016 Pivotal. All rights reserved. App
  • 5. 5© Copyright 2016 Pivotal. All rights reserved. Today’s talk 1. A real-time data science app A. The app: a live demonstration B. How can a data scientist build a data science application? C. Revisiting the app 2. Generalizing the framework: Solving new data science challenges A. Internet of Things – creating a smart app B. Financial data - How can retail banks influence their cardholders’ behavior?
  • 6. 6© Copyright 2016 Pivotal. All rights reserved. Training app Model Scoring as a service Model Training as a service Sensor app Scoring app Dashboard app Data science workflow: Movement classification 1. Sensor + Dashboard 2. Redis 3. Training app 4. Scoring app
  • 7. 7© Copyright 2016 Pivotal. All rights reserved. here is my source code run it on the cloud for me - Onsi Fakhouri @onsijoe i do not care how
  • 8. 8© Copyright 2016 Pivotal. All rights reserved. cf push  CF determines app type (Java, Python, Ruby, …)  Installs necessary environment  Provisions and binds data services  Creates domain, routing, and load balancing  Continual app health checks and restarts
  • 9. 9© Copyright 2016 Pivotal. All rights reserved. Data ingestion: Accelerometric data  Accelerometric data streamed from mobile phone at 15 Hz (15x / second)  Other sensor data: gyroscopic data, magnetometer data, lon/lat, etc. Accelerometer axes
  • 10. 10© Copyright 2016 Pivotal. All rights reserved.  For real-time applications, low-latency data ingestion into the data store is essential  WebSocket protocol - socket.io – Mobile phone  Webserver – Webserver  Dashboard  socket.io  redis Data ingestion Training app Sensor app
  • 11. 11© Copyright 2016 Pivotal. All rights reserved. Data storage  We are using a redis store for: – Storing training data – Model persistence – Storing a micro-batch of scoring data  Other storage systems include GemFire, HAWQ/Hadoop, Greenplum Database, PostgreSQL, …
  • 12. 12© Copyright 2016 Pivotal. All rights reserved. Modeling Scalable machine learning applications in Pivotal Cloud Foundry 1. Training app 2. Scoring app
  • 13. 13© Copyright 2016 Pivotal. All rights reserved. Modeling – Training app  Goal: build a data-driven model that learns accelerometric motions associated with each activity Feature Engineering • Time-domain transformations • Fast Fourier Transform analysis Machine Learning Classification Model • Random Forest Model using 2 second time windows (30 samples) … Training data Trained model
  • 14. 14© Copyright 2016 Pivotal. All rights reserved. Model building  20 seconds per training activity  Two second moving window on training data  Features: time- domain summary statistics and Fourier transform coefficients
  • 15. 15© Copyright 2016 Pivotal. All rights reserved. Model training approaches 1. Near-real-time model training – Use small batches to train model 2. Real-time model training – Online machine learning algorithm : continually update model using each new data point 3. Offline model training – Build a model offline using batches – Useful for models requiring finer model tuning and calibration
  • 16. 16© Copyright 2016 Pivotal. All rights reserved. Feature Engineering • Time-domain transformations • Fast Fourier Transform analysis Machine Learning Classification Model • Random Forest Model using 2 second time windows (30 samples) Trained model Streaming input window Model Prediction API Call Model prediction PCF App: Scoring app • Real-time model scoring • The dashboard initiates a request via an API call and receives a model prediction { "channel": "1234", "label": ”walking", "label_value": 0.746 }
  • 17. 17© Copyright 2016 Pivotal. All rights reserved. 1. Application auto-scaling – As the data grows, the model scales 2. Application autonomy – The model application is independent of other applications = faster development iterations – Faster development = rapid feedback loop 3. Multiple applications can access model scoring app Operationalizing scalable data science applications Model scoring as a service Why?
  • 18. 18© Copyright 2016 Pivotal. All rights reserved. Today’s talk 1. A real-time data science app A. The app: a live demonstration B. How can a data scientist build a data science application? C. Revisiting the app 2. Generalizing the framework: Solving new data science challenges A. Internet of Things – creating a smart app B. Financial data - How can retail banks influence their cardholders’ behavior?
  • 19. 19© Copyright 2016 Pivotal. All rights reserved. App
  • 20. 20© Copyright 2016 Pivotal. All rights reserved. Today’s talk 1. A real-time data science app A. The app: a live demonstration B. How can a data scientist build a data science application? C. Revisiting the app 2. Generalizing the framework: Solving new data science challenges A. Internet of Things – Creating a smart app to prevent oil spill disasters B. Financial data - How can retail banks influence their cardholders’ behavior?
  • 21. 21© Copyright 2016 Pivotal. All rights reserved. Gene Sequencing Smart Grids COST TO SEQUENCE ONE GENOME HAS FALLEN FROM $100M IN 2001 TO $10K IN 2011 TO $1K IN 2014 READING SMART METERS EVERY 15 MINUTES IS 3000X MORE DATA INTENSIVE Stock Market Social Media FACEBOOK UPLOADS 250 MILLION PHOTOS EACH DAY In all industries billions of data points represent opportunities for the Internet of Things Oil Exploration Video Surveillance OIL RIGS GENERATE 25000 DATA POINTS PER SECOND Medical Imaging Mobile Sensors
  • 22. 22© Copyright 2016 Pivotal. All rights reserved. How can we use data to help prevent accidents like the Macondo Disaster ?
  • 23. 23© Copyright 2016 Pivotal. All rights reserved. 23© Copyright 2016 Pivotal. All rights reserved. …by creating a Smart Application
  • 24. 24© Copyright 2016 Pivotal. All rights reserved. Training app Model Scoring as a service Model Training as a service Sensor app Scoring app Dashboard app Data science workflow: Movement classification
  • 25. 25© Copyright 2016 Pivotal. All rights reserved. Training app Model Scoring as a service Model Training as a service Sensor app Scoring app Dashboard app Data science workflow: Creating a smart app to prevent oil spill disasters • Alert operator • Send signal to control system to change operating parameters • Replace old machinery • Shut down plant
  • 26. 26© Copyright 2016 Pivotal. All rights reserved. Training app Model Scoring as a service Model Training as a service Sensor app Scoring app Dashboard app Data science workflow: How can retail banks influence their cardholders’ behavior? • Provide customized services and promotions • Next best offer • Characterize and improve customer satisfaction
  • 27. 27© Copyright 2016 Pivotal. All rights reserved. Thank you Questions and comments crawles@pivotal.io
  • 28. 28© Copyright 2016 Pivotal. All rights reserved.