SlideShare a Scribd company logo
1 of 22
Download to read offline
DATA WAREHOUSE MODERNIZATION PROGRAMME 
TOBY WOOLFE 
BIG DATA SOLUTIONS LEADER IBM
© 2014 IBM Corporation 
Case studies from around the world 
help explain what is working in the 
world of Hadoop, real time analytics 
and internet connected devices. 
IBM delivers a significant data 
warehouse modernization programme 
to General Motors 
Toby Woolfe, Big Data Industrial 
and Automotive Solutions 
Leader, IBM Europe 
Toby.woolfe@uk.ibm.com +44 7795 328 742
A New Era of Smart 
We are on the precipice of awareness of, and adoption of, internet 
connected devices far beyond the Smartphone... 
Devices: 145 actuators, 70 on-board 
computers, 4700 relays 
and 70 sensors, including radar, 
sonar, accelerometer, 
camera, rain sensors. 
433MB per minute 
1 car year = 1TB 
Plus existing enterprise data, 
social media data, 3rd party 
suppliers,...etc 
2 © 2014 IBM Corporation
A New Era of Smart 
But how fast does information currently get delivered? 
Monthly 
reporting? 
Based originally on the 
lunar cycle, the Julian 
Calendar was 
introduced by Julius 
Caesar in 46BC 
Quarterly reporting? 
Celebrated by Druids for centuries, the 
four seasons result from the yearly 
revolution of the earth around the sun 
and the tilt of the earth’s axis relative 
to the plane of revolution. 
Annual 
reporting? 
In temperate and 
polar regions, the 
seasons are marked 
by changes in the 
intensity of sunlight 
that reaches the 
Earth's surface, 
variations of which 
may cause animals to 
go into hibernation or 
to migrate, and plants 
to be dormant 
Weekly 
reporting? 
Recorded in 
Babylonian 
carvings dated 6th 
century, the origin 
of the 7 day week 
was probably 
based on a quarter 
of the lunar cycle 
(though inaccurate) 
3 
© 2014 IBM Corporation
A New Era of Smart 
“GM Opens New Data Center Modeled on Google, 
Facebook” 
CIO Randy Mott launched a new $130 
million data center 
Chief Executive Dan Akerson said ”If 
we are going to win, we must turn IT 
into a competitive advantage and not 
treat it as something that is just a 
utility,” 
The IT Operations and Command 
Center in the $130-million Enterprise 
Data Center at the General Motors 
Technical Center in Warren, Michigan. 
4 
© 2014 IBM Corporation
A New Era of Smart 
GM’s people, process and technology evolution 
 GM has hired about 1,500 software developers and engineers, up from “close to 
zero” just one year ago, according to Tim Cox, CIO of GM global development 
services 
 About 70% of the 10,000 IT staff will be focused on innovation 
 Until recently, GM was using 23 different data centers. Now the company is moving 
to two 
 The site in Warren has the capacity to hold more than 10,000 ‘pizza box’ size 
servers, as well as smaller-size servers and larger mainframes 
 The data warehouse incorporates both the cutting edge IBM BigInsights for Hadoop 
technology as well as more traditional Massively Parallel Processing technologies 
historically used for data warehousing 
Source/thanks to 
5 
© 2014 IBM Corporation
A New Era of Smart 
General Motors wiki 
Business profile 
 Global vehicle sales leader for 77 consecutive years from 1931 – 2007 
 212,000 staff in 157 countries 
Business problem 
 GM has issued 45 recalls in 2014 involving 28 million cars worldwide 
 The cost of one recall due to faulty ignition switches, which has been linked to at least 13 
deaths, is $1.3 billion 
IT transformation 
 General Motors (GM) is in the process of constructing a single global information warehouse 
that will become the foundation for all business analytics and decision support across the 
enterprise. 
 The business objectives are 
– to reduce operating expenses for data warehouses by 25-30% 
– deliver 10x the measurable business value 
– double IT project capability 
– and integrate existing data warehouse systems with new technology to offload 
data, reduce latency, reduce costs and improve performance. 
6 
© 2014 IBM Corporation
A New Era of Smart 
Abstract: IBM delivers a significant data warehouse modernization 
programme to General Motors 
 High performance and continuously available data management environment 
 329,000 users with a 10% concurrency rate 
 Grow to approximately one petabyte in size over three years 
 IBM proposed a centralized enterprise architecture leveraging a traditional data warehouse 
matched with IBM Big Insights Hadoop technology for big data analytics 
Data types: 
• Inventory control of parts 
• Manufacturing equipment and 
assembly line data 
• Warranty and services data from 
dealers 
• Telemetry data from vehicles 
• Customer services and social 
media data 
7 
© 2014 IBM Corporation
A New Era of Smart 
Big data = disparate data! 
8 
© 2014 IBM Corporation
A New Era of Smart 
Probabilistic matching 
9 
© 2014 IBM Corporation
A New Era of Smart 
Analyst validation of key IBM technologies.... 
....and Analyst validation of IBM potential future phase technologies.... 
10 
© 2014 IBM Corporation
A New Era of Smart 
Capabilities required 
1. Efficient data protocols 
2. Real time analytics 
3. Capture all data to a landing zone 
4. High performance analytics platform 
5. Application Platform 
6. Data governance & integration 
11 
© 2014 IBM Corporation
A New Era of Smart 
IBM conclusions applicable to enterprise data warehouse 
modernisation learned from work with GM 
 GM’s objectives published in WSJ: 
–Reduce operating expenses for data warehouses by 25-30% 
–Deliver 10x the measurable business value 
–Double IT project capability 
–Integrate existing data warehouse systems with new technology 
to offload data, reduce latency, reduce costs and improve 
performance 
 Educate enterprise architecture staff on the new disruptive technology capabilities 
delivered from Hadoop, and its integration into existing data architectures 
 Engage in a revue of data warehouse architecture to harness the cost savings, speed 
advantages and business focus that new technologies such as IBM BigInsights for 
Hadoop brings to market 
 Experiment with new data asset/business objective combinations with IBM’s Big Data 
experts 
12 
© 2014 IBM Corporation
A New Era of Smart 
Case study illustrates the hidden value of machine data 
For a volume 
production car fleet 
using telematics data, 
IBM correctly predicted 
warranty codes/claims 
43 days out with 86% 
accuracy and only 1% 
false positives 
For a new electric car model, IBM 
predicted warranty claims 60 days out 
with 96% accuracy and 1% false 
positives on 619 vehicles using Big 
Data and analytics technology 
13 
© 2014 IBM Corporation
A New Era of Smart 
Broad range of role based visualisation and analytical capabilities 
Data scientist Business person 
R Script support SPSS Predictive analytics Integration with Cognos BI 
Spreadsheet visualisation of data 360 degree view of machine/client/location 
Delivered using the 
BigInsights wizard 
to accelerate the 
use of Hadoop data 
14 
© 2014 IBM Corporation
A New Era of Smart 
BigInsights visualisation examples 
1 –Hadoop raw data in spreadsheet format 
2 - Sort, filter etc. 
15 
© 2014 IBM Corporation
A New Era of Smart 
...and onto many kinds of analytical capability using Cognos 
16 
© 2014 IBM Corporation
A New Era of Smart 
How is Big Data Being Used in Industry? 
Product Development & 
Engineering 
 Patent Analytics 
 Social Media Analytics 
 Quality & Warranty Analytics 
Manufacturing & 
Quality 
 Predictive Maintenance 
 Quality Early Warning 
 Operational Efficiency 
 Supply Chain Analytics 
Connected Vehicle 
 V2V, V2C, V2I, V2X 
 Automotive Telematics 
 Real-time Alerts 
 Geospatial Analytics 
 Text Analytics 
 Predictive Failsafe Analytics 
 Driver Specific Predictive Analytics 
 Condition Based Monitoring 
Marketing & Sales 
 Social Media Analytics 
 Text Analytics 
 Customer 360 
 Actionable Customer 
Intelligence 
 Next Best Action 
After Sales 
 Fraud Analytics 
 Regulatory Analytics 
 Social Media Analytics 
 Warranty Analytics 
 Customer 360 
 Next Best Action 
17 © 2014 IBM Corporation
A New Era of Smart 
Experiment with Hadoop 
18 
© 2014 IBM Corporation
A New Era of Smart 
Suggested next action: 
Create an inventory of your structured and unstructured data and assess where 
business value will be found 
Internal External 
Complaints 
Telematic 
Data 
Workflow 
Data 
Geolocation 
Txn History 
Data 
Machine 
failure data 
Financial 
Data 
Shipping 
schedule 
data 
Warranty 
History 
Payment 
History 
Machine 
Performance 
Call 
Data 
Recordings 
Google 
Alerts 
Social Media 
Maintenance 
records 
Activities 
Data 
Catastrophic 
Emails 
Web 
Traffic 
Data 
Structured Unstructured 
MDM 
Customer 
Data 
Relatively easy 
to acquire Difficult to 
acquire 
19 © 2014 IBM Corporation
A New Era of Smart 
20 
20 © 2014 IBM Corporation
17TH ~ 18th NOV 2014 
MADRID (SPAIN)

More Related Content

What's hot

Intelligent Manufacturing system Final 1
Intelligent Manufacturing system Final 1Intelligent Manufacturing system Final 1
Intelligent Manufacturing system Final 1
Harish Pant
 
Presentation-Watson_IoT_Platform-Long-08Feb2016
Presentation-Watson_IoT_Platform-Long-08Feb2016Presentation-Watson_IoT_Platform-Long-08Feb2016
Presentation-Watson_IoT_Platform-Long-08Feb2016
Nikhil Dikshit
 
The Future of Data in Telecom and the Rise of Connected Communities
The Future of Data in Telecom and the Rise of Connected CommunitiesThe Future of Data in Telecom and the Rise of Connected Communities
The Future of Data in Telecom and the Rise of Connected Communities
DataWorks Summit
 

What's hot (20)

New Technologies For The Sustainable Enterprise; keynote @Wharton
New Technologies For The Sustainable Enterprise; keynote @WhartonNew Technologies For The Sustainable Enterprise; keynote @Wharton
New Technologies For The Sustainable Enterprise; keynote @Wharton
 
HP Communications and Media | Solutions IoT Platform
HP Communications and Media | Solutions IoT Platform HP Communications and Media | Solutions IoT Platform
HP Communications and Media | Solutions IoT Platform
 
Genius of Things Houston
Genius of Things HoustonGenius of Things Houston
Genius of Things Houston
 
Intelligent Manufacturing system Final 1
Intelligent Manufacturing system Final 1Intelligent Manufacturing system Final 1
Intelligent Manufacturing system Final 1
 
Insight Control Panel for SAP PM, IBM Maximo and Infor EAM
Insight Control Panel for SAP PM, IBM Maximo and Infor EAMInsight Control Panel for SAP PM, IBM Maximo and Infor EAM
Insight Control Panel for SAP PM, IBM Maximo and Infor EAM
 
Presentation-Watson_IoT_Platform-Long-08Feb2016
Presentation-Watson_IoT_Platform-Long-08Feb2016Presentation-Watson_IoT_Platform-Long-08Feb2016
Presentation-Watson_IoT_Platform-Long-08Feb2016
 
Big data – ready for business
Big data – ready for businessBig data – ready for business
Big data – ready for business
 
Dell AI Oil and Gas Webinar
Dell AI Oil and Gas WebinarDell AI Oil and Gas Webinar
Dell AI Oil and Gas Webinar
 
Hannover Messe: Evolution of a cognitive Digital Twin
Hannover Messe: Evolution of a cognitive Digital TwinHannover Messe: Evolution of a cognitive Digital Twin
Hannover Messe: Evolution of a cognitive Digital Twin
 
Microsoft Internet of Things konference 2015 - Microsoft og Internet of Things
Microsoft Internet of Things konference 2015 - Microsoft og Internet of ThingsMicrosoft Internet of Things konference 2015 - Microsoft og Internet of Things
Microsoft Internet of Things konference 2015 - Microsoft og Internet of Things
 
The z13 and The Mobile & Analytics Tsunami Hélène Lyon
The z13 and The Mobile & Analytics Tsunami Hélène LyonThe z13 and The Mobile & Analytics Tsunami Hélène Lyon
The z13 and The Mobile & Analytics Tsunami Hélène Lyon
 
2016 ibm watson io t forum 躍升雲端 敏捷打造物聯網平台
2016 ibm watson io t forum 躍升雲端 敏捷打造物聯網平台2016 ibm watson io t forum 躍升雲端 敏捷打造物聯網平台
2016 ibm watson io t forum 躍升雲端 敏捷打造物聯網平台
 
Dell AI and HPC University Roadshow
Dell AI and HPC University RoadshowDell AI and HPC University Roadshow
Dell AI and HPC University Roadshow
 
Ac ford innovation day-fina lv2
Ac ford innovation day-fina lv2Ac ford innovation day-fina lv2
Ac ford innovation day-fina lv2
 
The Future of Data in Telecom and the Rise of Connected Communities
The Future of Data in Telecom and the Rise of Connected CommunitiesThe Future of Data in Telecom and the Rise of Connected Communities
The Future of Data in Telecom and the Rise of Connected Communities
 
IBM Spain BP Storage Day Inigo Osoro
IBM Spain BP Storage Day    Inigo OsoroIBM Spain BP Storage Day    Inigo Osoro
IBM Spain BP Storage Day Inigo Osoro
 
Systems of Intelligence - Wikibon/theCUBE
Systems of Intelligence - Wikibon/theCUBESystems of Intelligence - Wikibon/theCUBE
Systems of Intelligence - Wikibon/theCUBE
 
The world of Machine Learning, Deep Learning and PowerAI
The world of Machine Learning, Deep Learning and PowerAIThe world of Machine Learning, Deep Learning and PowerAI
The world of Machine Learning, Deep Learning and PowerAI
 
Industry 4.0 Plymouth Manufacturing Group
Industry 4.0 Plymouth Manufacturing Group Industry 4.0 Plymouth Manufacturing Group
Industry 4.0 Plymouth Manufacturing Group
 
Big Data in Global Telecom Market: Key Trends, Market Opportunities and Indus...
Big Data in Global Telecom Market: Key Trends, Market Opportunities and Indus...Big Data in Global Telecom Market: Key Trends, Market Opportunities and Indus...
Big Data in Global Telecom Market: Key Trends, Market Opportunities and Indus...
 

Viewers also liked

Viewers also liked (20)

Getting the best insights from your data using Apache Metamodel by Alberto Ro...
Getting the best insights from your data using Apache Metamodel by Alberto Ro...Getting the best insights from your data using Apache Metamodel by Alberto Ro...
Getting the best insights from your data using Apache Metamodel by Alberto Ro...
 
State of Play. Data Science on Hadoop in 2015 by SEAN OWEN at Big Data Spain ...
State of Play. Data Science on Hadoop in 2015 by SEAN OWEN at Big Data Spain ...State of Play. Data Science on Hadoop in 2015 by SEAN OWEN at Big Data Spain ...
State of Play. Data Science on Hadoop in 2015 by SEAN OWEN at Big Data Spain ...
 
The top five questions to ask about NoSQL. JONATHAN ELLIS at Big Data Spain 2012
The top five questions to ask about NoSQL. JONATHAN ELLIS at Big Data Spain 2012The top five questions to ask about NoSQL. JONATHAN ELLIS at Big Data Spain 2012
The top five questions to ask about NoSQL. JONATHAN ELLIS at Big Data Spain 2012
 
Intro to the Big Data Spain 2014 conference
Intro to the Big Data Spain 2014 conferenceIntro to the Big Data Spain 2014 conference
Intro to the Big Data Spain 2014 conference
 
Location analytics by Marc Planaguma at Big Data Spain 2014
 Location analytics by Marc Planaguma at Big Data Spain 2014 Location analytics by Marc Planaguma at Big Data Spain 2014
Location analytics by Marc Planaguma at Big Data Spain 2014
 
Big Data the potential for data to improve service and business management by...
Big Data the potential for data to improve service and business management by...Big Data the potential for data to improve service and business management by...
Big Data the potential for data to improve service and business management by...
 
Big Data Web applications for Interactive Hadoop by ENRICO BERTI at Big Data...
 Big Data Web applications for Interactive Hadoop by ENRICO BERTI at Big Data... Big Data Web applications for Interactive Hadoop by ENRICO BERTI at Big Data...
Big Data Web applications for Interactive Hadoop by ENRICO BERTI at Big Data...
 
Dataflows: The abstraction that powers Big Data by Raul Castro Fernandez at ...
 Dataflows: The abstraction that powers Big Data by Raul Castro Fernandez at ... Dataflows: The abstraction that powers Big Data by Raul Castro Fernandez at ...
Dataflows: The abstraction that powers Big Data by Raul Castro Fernandez at ...
 
ToroDB: Scaling PostgreSQL like MongoDB by Álvaro Hernández at Big Data Spain...
ToroDB: Scaling PostgreSQL like MongoDB by Álvaro Hernández at Big Data Spain...ToroDB: Scaling PostgreSQL like MongoDB by Álvaro Hernández at Big Data Spain...
ToroDB: Scaling PostgreSQL like MongoDB by Álvaro Hernández at Big Data Spain...
 
Convergent Replicated Data Types in Riak 2.0
Convergent Replicated Data Types in Riak 2.0Convergent Replicated Data Types in Riak 2.0
Convergent Replicated Data Types in Riak 2.0
 
CloudMC: A cloud computing map-reduce implementation for radiotherapy. RUBEN ...
CloudMC: A cloud computing map-reduce implementation for radiotherapy. RUBEN ...CloudMC: A cloud computing map-reduce implementation for radiotherapy. RUBEN ...
CloudMC: A cloud computing map-reduce implementation for radiotherapy. RUBEN ...
 
How to integrate Big Data onto an analytical portal, Big Data benchmarking fo...
How to integrate Big Data onto an analytical portal, Big Data benchmarking fo...How to integrate Big Data onto an analytical portal, Big Data benchmarking fo...
How to integrate Big Data onto an analytical portal, Big Data benchmarking fo...
 
Essential ingredients for real time stream processing @Scale by Kartik pParam...
Essential ingredients for real time stream processing @Scale by Kartik pParam...Essential ingredients for real time stream processing @Scale by Kartik pParam...
Essential ingredients for real time stream processing @Scale by Kartik pParam...
 
IAd-learning: A new e-learning platform by José Antonio Omedes at Big Data Sp...
IAd-learning: A new e-learning platform by José Antonio Omedes at Big Data Sp...IAd-learning: A new e-learning platform by José Antonio Omedes at Big Data Sp...
IAd-learning: A new e-learning platform by José Antonio Omedes at Big Data Sp...
 
Analyzing organization e-mails in near real time using hadoop ecosystem tools...
Analyzing organization e-mails in near real time using hadoop ecosystem tools...Analyzing organization e-mails in near real time using hadoop ecosystem tools...
Analyzing organization e-mails in near real time using hadoop ecosystem tools...
 
Big Data, analytics and 4th generation data warehousing by Martyn Jones at Bi...
Big Data, analytics and 4th generation data warehousing by Martyn Jones at Bi...Big Data, analytics and 4th generation data warehousing by Martyn Jones at Bi...
Big Data, analytics and 4th generation data warehousing by Martyn Jones at Bi...
 
Geospatial and bitemporal search in C* with pluggable Lucene index by Andrés ...
Geospatial and bitemporal search in C* with pluggable Lucene index by Andrés ...Geospatial and bitemporal search in C* with pluggable Lucene index by Andrés ...
Geospatial and bitemporal search in C* with pluggable Lucene index by Andrés ...
 
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
 
A new streaming computation engine for real-time analytics by Michael Barton ...
A new streaming computation engine for real-time analytics by Michael Barton ...A new streaming computation engine for real-time analytics by Michael Barton ...
A new streaming computation engine for real-time analytics by Michael Barton ...
 
Processing large-scale graphs with Google(TM) Pregel by MICHAEL HACKSTEIN at...
 Processing large-scale graphs with Google(TM) Pregel by MICHAEL HACKSTEIN at... Processing large-scale graphs with Google(TM) Pregel by MICHAEL HACKSTEIN at...
Processing large-scale graphs with Google(TM) Pregel by MICHAEL HACKSTEIN at...
 

Similar to Data warehouse modernization programme by TOBY WOOLFE at Big Data Spain 2014

Why Infrastructure Matters for Big Data & Analytics
Why Infrastructure Matters for Big Data & AnalyticsWhy Infrastructure Matters for Big Data & Analytics
Why Infrastructure Matters for Big Data & Analytics
Rick Perret
 
Ibm symp14 referent_christian klezl_cloud
Ibm symp14 referent_christian klezl_cloudIbm symp14 referent_christian klezl_cloud
Ibm symp14 referent_christian klezl_cloud
IBM Switzerland
 
Ahluwalia ibm up con keynote (published)
Ahluwalia   ibm up con keynote (published)Ahluwalia   ibm up con keynote (published)
Ahluwalia ibm up con keynote (published)
sapenov
 
Cloud Computing - A collection of predictions, principles and providers - Feb...
Cloud Computing - A collection of predictions, principles and providers - Feb...Cloud Computing - A collection of predictions, principles and providers - Feb...
Cloud Computing - A collection of predictions, principles and providers - Feb...
William Santiago
 

Similar to Data warehouse modernization programme by TOBY WOOLFE at Big Data Spain 2014 (20)

CMOfinalpresentation.ppt
CMOfinalpresentation.pptCMOfinalpresentation.ppt
CMOfinalpresentation.ppt
 
Miguel Angel Perdiguero - Head of BIG data & analytics Atos Iberia - semanain...
Miguel Angel Perdiguero - Head of BIG data & analytics Atos Iberia - semanain...Miguel Angel Perdiguero - Head of BIG data & analytics Atos Iberia - semanain...
Miguel Angel Perdiguero - Head of BIG data & analytics Atos Iberia - semanain...
 
IBM InfoSphere Data Replication for Big Data
IBM InfoSphere Data Replication for Big DataIBM InfoSphere Data Replication for Big Data
IBM InfoSphere Data Replication for Big Data
 
IBM InfoSphere Data Replication for Big Data
IBM InfoSphere Data Replication for Big DataIBM InfoSphere Data Replication for Big Data
IBM InfoSphere Data Replication for Big Data
 
Why Infrastructure Matters for Big Data & Analytics
Why Infrastructure Matters for Big Data & AnalyticsWhy Infrastructure Matters for Big Data & Analytics
Why Infrastructure Matters for Big Data & Analytics
 
IBM Z for the Digital Enterprise 2018 - Z Keynote
IBM Z for the Digital Enterprise 2018 - Z KeynoteIBM Z for the Digital Enterprise 2018 - Z Keynote
IBM Z for the Digital Enterprise 2018 - Z Keynote
 
Big Data & Analytics Day
Big Data & Analytics Day Big Data & Analytics Day
Big Data & Analytics Day
 
Make from your it department a competitive differentiator for your business
Make from your it department a competitive differentiator for your businessMake from your it department a competitive differentiator for your business
Make from your it department a competitive differentiator for your business
 
Use cases for Hadoop and Big Data Analytics - InfoSphere BigInsights
Use cases for Hadoop and Big Data Analytics - InfoSphere BigInsightsUse cases for Hadoop and Big Data Analytics - InfoSphere BigInsights
Use cases for Hadoop and Big Data Analytics - InfoSphere BigInsights
 
Ibm symp14 referent_christian klezl_cloud
Ibm symp14 referent_christian klezl_cloudIbm symp14 referent_christian klezl_cloud
Ibm symp14 referent_christian klezl_cloud
 
Ben amaba. cloud mobile v3
Ben amaba. cloud mobile v3Ben amaba. cloud mobile v3
Ben amaba. cloud mobile v3
 
Control m customers using big data
Control m customers using big dataControl m customers using big data
Control m customers using big data
 
Mindsphere: an open cloud-based IoT operating system for Industry
Mindsphere: an open cloud-based IoT operating system for IndustryMindsphere: an open cloud-based IoT operating system for Industry
Mindsphere: an open cloud-based IoT operating system for Industry
 
IBM Power Systems Outlook and Roadmap
IBM Power Systems Outlook and RoadmapIBM Power Systems Outlook and Roadmap
IBM Power Systems Outlook and Roadmap
 
IoT World 2019 Keynote: A Story of Transformational IoT: Do machines actually...
IoT World 2019 Keynote: A Story of Transformational IoT: Do machines actually...IoT World 2019 Keynote: A Story of Transformational IoT: Do machines actually...
IoT World 2019 Keynote: A Story of Transformational IoT: Do machines actually...
 
Ahluwalia ibm up con keynote (published)
Ahluwalia   ibm up con keynote (published)Ahluwalia   ibm up con keynote (published)
Ahluwalia ibm up con keynote (published)
 
Cloud Computing - A collection of predictions, principles and providers - Feb...
Cloud Computing - A collection of predictions, principles and providers - Feb...Cloud Computing - A collection of predictions, principles and providers - Feb...
Cloud Computing - A collection of predictions, principles and providers - Feb...
 
Industry 4.0
Industry 4.0Industry 4.0
Industry 4.0
 
Cloud as a growth engine for business
Cloud as a growth engine for businessCloud as a growth engine for business
Cloud as a growth engine for business
 
Bmc joe goldberg
Bmc joe goldbergBmc joe goldberg
Bmc joe goldberg
 

More from Big Data Spain

More from Big Data Spain (20)

Big Data, Big Quality? by Irene Gonzálvez at Big Data Spain 2017
Big Data, Big Quality? by Irene Gonzálvez at Big Data Spain 2017Big Data, Big Quality? by Irene Gonzálvez at Big Data Spain 2017
Big Data, Big Quality? by Irene Gonzálvez at Big Data Spain 2017
 
Scaling a backend for a big data and blockchain environment by Rafael Ríos at...
Scaling a backend for a big data and blockchain environment by Rafael Ríos at...Scaling a backend for a big data and blockchain environment by Rafael Ríos at...
Scaling a backend for a big data and blockchain environment by Rafael Ríos at...
 
AI: The next frontier by Amparo Alonso at Big Data Spain 2017
AI: The next frontier by Amparo Alonso at Big Data Spain 2017AI: The next frontier by Amparo Alonso at Big Data Spain 2017
AI: The next frontier by Amparo Alonso at Big Data Spain 2017
 
Disaster Recovery for Big Data by Carlos Izquierdo at Big Data Spain 2017
Disaster Recovery for Big Data by Carlos Izquierdo at Big Data Spain 2017Disaster Recovery for Big Data by Carlos Izquierdo at Big Data Spain 2017
Disaster Recovery for Big Data by Carlos Izquierdo at Big Data Spain 2017
 
Presentation: Boost Hadoop and Spark with in-memory technologies by Akmal Cha...
Presentation: Boost Hadoop and Spark with in-memory technologies by Akmal Cha...Presentation: Boost Hadoop and Spark with in-memory technologies by Akmal Cha...
Presentation: Boost Hadoop and Spark with in-memory technologies by Akmal Cha...
 
Data science for lazy people, Automated Machine Learning by Diego Hueltes at ...
Data science for lazy people, Automated Machine Learning by Diego Hueltes at ...Data science for lazy people, Automated Machine Learning by Diego Hueltes at ...
Data science for lazy people, Automated Machine Learning by Diego Hueltes at ...
 
Training Deep Learning Models on Multiple GPUs in the Cloud by Enrique Otero ...
Training Deep Learning Models on Multiple GPUs in the Cloud by Enrique Otero ...Training Deep Learning Models on Multiple GPUs in the Cloud by Enrique Otero ...
Training Deep Learning Models on Multiple GPUs in the Cloud by Enrique Otero ...
 
Unbalanced data: Same algorithms different techniques by Eric Martín at Big D...
Unbalanced data: Same algorithms different techniques by Eric Martín at Big D...Unbalanced data: Same algorithms different techniques by Eric Martín at Big D...
Unbalanced data: Same algorithms different techniques by Eric Martín at Big D...
 
State of the art time-series analysis with deep learning by Javier Ordóñez at...
State of the art time-series analysis with deep learning by Javier Ordóñez at...State of the art time-series analysis with deep learning by Javier Ordóñez at...
State of the art time-series analysis with deep learning by Javier Ordóñez at...
 
Trading at market speed with the latest Kafka features by Iñigo González at B...
Trading at market speed with the latest Kafka features by Iñigo González at B...Trading at market speed with the latest Kafka features by Iñigo González at B...
Trading at market speed with the latest Kafka features by Iñigo González at B...
 
Unified Stream Processing at Scale with Apache Samza by Jake Maes at Big Data...
Unified Stream Processing at Scale with Apache Samza by Jake Maes at Big Data...Unified Stream Processing at Scale with Apache Samza by Jake Maes at Big Data...
Unified Stream Processing at Scale with Apache Samza by Jake Maes at Big Data...
 
The Analytic Platform behind IBM’s Watson Data Platform by Luciano Resende a...
 The Analytic Platform behind IBM’s Watson Data Platform by Luciano Resende a... The Analytic Platform behind IBM’s Watson Data Platform by Luciano Resende a...
The Analytic Platform behind IBM’s Watson Data Platform by Luciano Resende a...
 
Artificial Intelligence and Data-centric businesses by Óscar Méndez at Big Da...
Artificial Intelligence and Data-centric businesses by Óscar Méndez at Big Da...Artificial Intelligence and Data-centric businesses by Óscar Méndez at Big Da...
Artificial Intelligence and Data-centric businesses by Óscar Méndez at Big Da...
 
Why big data didn’t end causal inference by Totte Harinen at Big Data Spain 2017
Why big data didn’t end causal inference by Totte Harinen at Big Data Spain 2017Why big data didn’t end causal inference by Totte Harinen at Big Data Spain 2017
Why big data didn’t end causal inference by Totte Harinen at Big Data Spain 2017
 
Meme Index. Analyzing fads and sensations on the Internet by Miguel Romero at...
Meme Index. Analyzing fads and sensations on the Internet by Miguel Romero at...Meme Index. Analyzing fads and sensations on the Internet by Miguel Romero at...
Meme Index. Analyzing fads and sensations on the Internet by Miguel Romero at...
 
Vehicle Big Data that Drives Smart City Advancement by Mike Branch at Big Dat...
Vehicle Big Data that Drives Smart City Advancement by Mike Branch at Big Dat...Vehicle Big Data that Drives Smart City Advancement by Mike Branch at Big Dat...
Vehicle Big Data that Drives Smart City Advancement by Mike Branch at Big Dat...
 
End of the Myth: Ultra-Scalable Transactional Management by Ricardo Jiménez-P...
End of the Myth: Ultra-Scalable Transactional Management by Ricardo Jiménez-P...End of the Myth: Ultra-Scalable Transactional Management by Ricardo Jiménez-P...
End of the Myth: Ultra-Scalable Transactional Management by Ricardo Jiménez-P...
 
Attacking Machine Learning used in AntiVirus with Reinforcement by Rubén Mart...
Attacking Machine Learning used in AntiVirus with Reinforcement by Rubén Mart...Attacking Machine Learning used in AntiVirus with Reinforcement by Rubén Mart...
Attacking Machine Learning used in AntiVirus with Reinforcement by Rubén Mart...
 
More people, less banking: Blockchain by Salvador Casquero at Big Data Spain ...
More people, less banking: Blockchain by Salvador Casquero at Big Data Spain ...More people, less banking: Blockchain by Salvador Casquero at Big Data Spain ...
More people, less banking: Blockchain by Salvador Casquero at Big Data Spain ...
 
Make the elephant fly, once again by Sourygna Luangsay at Big Data Spain 2017
Make the elephant fly, once again by Sourygna Luangsay at Big Data Spain 2017Make the elephant fly, once again by Sourygna Luangsay at Big Data Spain 2017
Make the elephant fly, once again by Sourygna Luangsay at Big Data Spain 2017
 

Recently uploaded

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Recently uploaded (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 

Data warehouse modernization programme by TOBY WOOLFE at Big Data Spain 2014

  • 1. DATA WAREHOUSE MODERNIZATION PROGRAMME TOBY WOOLFE BIG DATA SOLUTIONS LEADER IBM
  • 2. © 2014 IBM Corporation Case studies from around the world help explain what is working in the world of Hadoop, real time analytics and internet connected devices. IBM delivers a significant data warehouse modernization programme to General Motors Toby Woolfe, Big Data Industrial and Automotive Solutions Leader, IBM Europe Toby.woolfe@uk.ibm.com +44 7795 328 742
  • 3. A New Era of Smart We are on the precipice of awareness of, and adoption of, internet connected devices far beyond the Smartphone... Devices: 145 actuators, 70 on-board computers, 4700 relays and 70 sensors, including radar, sonar, accelerometer, camera, rain sensors. 433MB per minute 1 car year = 1TB Plus existing enterprise data, social media data, 3rd party suppliers,...etc 2 © 2014 IBM Corporation
  • 4. A New Era of Smart But how fast does information currently get delivered? Monthly reporting? Based originally on the lunar cycle, the Julian Calendar was introduced by Julius Caesar in 46BC Quarterly reporting? Celebrated by Druids for centuries, the four seasons result from the yearly revolution of the earth around the sun and the tilt of the earth’s axis relative to the plane of revolution. Annual reporting? In temperate and polar regions, the seasons are marked by changes in the intensity of sunlight that reaches the Earth's surface, variations of which may cause animals to go into hibernation or to migrate, and plants to be dormant Weekly reporting? Recorded in Babylonian carvings dated 6th century, the origin of the 7 day week was probably based on a quarter of the lunar cycle (though inaccurate) 3 © 2014 IBM Corporation
  • 5. A New Era of Smart “GM Opens New Data Center Modeled on Google, Facebook” CIO Randy Mott launched a new $130 million data center Chief Executive Dan Akerson said ”If we are going to win, we must turn IT into a competitive advantage and not treat it as something that is just a utility,” The IT Operations and Command Center in the $130-million Enterprise Data Center at the General Motors Technical Center in Warren, Michigan. 4 © 2014 IBM Corporation
  • 6. A New Era of Smart GM’s people, process and technology evolution  GM has hired about 1,500 software developers and engineers, up from “close to zero” just one year ago, according to Tim Cox, CIO of GM global development services  About 70% of the 10,000 IT staff will be focused on innovation  Until recently, GM was using 23 different data centers. Now the company is moving to two  The site in Warren has the capacity to hold more than 10,000 ‘pizza box’ size servers, as well as smaller-size servers and larger mainframes  The data warehouse incorporates both the cutting edge IBM BigInsights for Hadoop technology as well as more traditional Massively Parallel Processing technologies historically used for data warehousing Source/thanks to 5 © 2014 IBM Corporation
  • 7. A New Era of Smart General Motors wiki Business profile  Global vehicle sales leader for 77 consecutive years from 1931 – 2007  212,000 staff in 157 countries Business problem  GM has issued 45 recalls in 2014 involving 28 million cars worldwide  The cost of one recall due to faulty ignition switches, which has been linked to at least 13 deaths, is $1.3 billion IT transformation  General Motors (GM) is in the process of constructing a single global information warehouse that will become the foundation for all business analytics and decision support across the enterprise.  The business objectives are – to reduce operating expenses for data warehouses by 25-30% – deliver 10x the measurable business value – double IT project capability – and integrate existing data warehouse systems with new technology to offload data, reduce latency, reduce costs and improve performance. 6 © 2014 IBM Corporation
  • 8. A New Era of Smart Abstract: IBM delivers a significant data warehouse modernization programme to General Motors  High performance and continuously available data management environment  329,000 users with a 10% concurrency rate  Grow to approximately one petabyte in size over three years  IBM proposed a centralized enterprise architecture leveraging a traditional data warehouse matched with IBM Big Insights Hadoop technology for big data analytics Data types: • Inventory control of parts • Manufacturing equipment and assembly line data • Warranty and services data from dealers • Telemetry data from vehicles • Customer services and social media data 7 © 2014 IBM Corporation
  • 9. A New Era of Smart Big data = disparate data! 8 © 2014 IBM Corporation
  • 10. A New Era of Smart Probabilistic matching 9 © 2014 IBM Corporation
  • 11. A New Era of Smart Analyst validation of key IBM technologies.... ....and Analyst validation of IBM potential future phase technologies.... 10 © 2014 IBM Corporation
  • 12. A New Era of Smart Capabilities required 1. Efficient data protocols 2. Real time analytics 3. Capture all data to a landing zone 4. High performance analytics platform 5. Application Platform 6. Data governance & integration 11 © 2014 IBM Corporation
  • 13. A New Era of Smart IBM conclusions applicable to enterprise data warehouse modernisation learned from work with GM  GM’s objectives published in WSJ: –Reduce operating expenses for data warehouses by 25-30% –Deliver 10x the measurable business value –Double IT project capability –Integrate existing data warehouse systems with new technology to offload data, reduce latency, reduce costs and improve performance  Educate enterprise architecture staff on the new disruptive technology capabilities delivered from Hadoop, and its integration into existing data architectures  Engage in a revue of data warehouse architecture to harness the cost savings, speed advantages and business focus that new technologies such as IBM BigInsights for Hadoop brings to market  Experiment with new data asset/business objective combinations with IBM’s Big Data experts 12 © 2014 IBM Corporation
  • 14. A New Era of Smart Case study illustrates the hidden value of machine data For a volume production car fleet using telematics data, IBM correctly predicted warranty codes/claims 43 days out with 86% accuracy and only 1% false positives For a new electric car model, IBM predicted warranty claims 60 days out with 96% accuracy and 1% false positives on 619 vehicles using Big Data and analytics technology 13 © 2014 IBM Corporation
  • 15. A New Era of Smart Broad range of role based visualisation and analytical capabilities Data scientist Business person R Script support SPSS Predictive analytics Integration with Cognos BI Spreadsheet visualisation of data 360 degree view of machine/client/location Delivered using the BigInsights wizard to accelerate the use of Hadoop data 14 © 2014 IBM Corporation
  • 16. A New Era of Smart BigInsights visualisation examples 1 –Hadoop raw data in spreadsheet format 2 - Sort, filter etc. 15 © 2014 IBM Corporation
  • 17. A New Era of Smart ...and onto many kinds of analytical capability using Cognos 16 © 2014 IBM Corporation
  • 18. A New Era of Smart How is Big Data Being Used in Industry? Product Development & Engineering  Patent Analytics  Social Media Analytics  Quality & Warranty Analytics Manufacturing & Quality  Predictive Maintenance  Quality Early Warning  Operational Efficiency  Supply Chain Analytics Connected Vehicle  V2V, V2C, V2I, V2X  Automotive Telematics  Real-time Alerts  Geospatial Analytics  Text Analytics  Predictive Failsafe Analytics  Driver Specific Predictive Analytics  Condition Based Monitoring Marketing & Sales  Social Media Analytics  Text Analytics  Customer 360  Actionable Customer Intelligence  Next Best Action After Sales  Fraud Analytics  Regulatory Analytics  Social Media Analytics  Warranty Analytics  Customer 360  Next Best Action 17 © 2014 IBM Corporation
  • 19. A New Era of Smart Experiment with Hadoop 18 © 2014 IBM Corporation
  • 20. A New Era of Smart Suggested next action: Create an inventory of your structured and unstructured data and assess where business value will be found Internal External Complaints Telematic Data Workflow Data Geolocation Txn History Data Machine failure data Financial Data Shipping schedule data Warranty History Payment History Machine Performance Call Data Recordings Google Alerts Social Media Maintenance records Activities Data Catastrophic Emails Web Traffic Data Structured Unstructured MDM Customer Data Relatively easy to acquire Difficult to acquire 19 © 2014 IBM Corporation
  • 21. A New Era of Smart 20 20 © 2014 IBM Corporation
  • 22. 17TH ~ 18th NOV 2014 MADRID (SPAIN)