SlideShare ist ein Scribd-Unternehmen logo
1 von 13
Downloaden Sie, um offline zu lesen
© 2014 IBM Corporation 
The sensor data challenge 
Innovations (not only) for the Internet of Things 
Big Data Meetup Berlin – 2014-10-23 
Stephan Reimann – IT Specialist Big Data – stephan.reimann@de.ibm.com 
@stereimann de.linkedin.com/in/stephanreimann/
© 2014 IBM Corporation 
Sensor data present an enormous business opportunity across industries 
Manufacturing 
IBM 
uses automated quality testing data for detection of anomalies in the semiconductor wafer manufacturing process to minimize wafer loss 
Source and more details 
Connected Car 
PSA Peugot Citroën 
uses IBM Big Data technologies including InfoSphere BigInsights as the basis of their con- nected services initiative to bring additional services to vehicle owners 
Source 
Industrial / Transport 
Pratt & Wittney 
“reduce maintenance costs by up to 20 percent” 
“... less disruptions, and remo- vals, and when the engine is in the shop, targeted repairs so the engine can come out of the shop quickly” 
Source 
Industrie 4.0 
Energy & Utilities 
Connected Car 
Healthcare 
... It improves efficiency & quality 
and enables new business models 
... and there are 
many more 
oppotunities 
Your fitness devices are also part of it! 
2
© 2014 IBM Corporation 
Making sense means ... 
–detecting hidden correlations 
–predicting future behavior predictive maintenance 
–detecting outliers 
It is not about having the data, it is about using analytics to make sense of it and creating value 
The hard thing of making sense is doing it, because ... 
–the topic is relatively new, there are not so much out-of-the-box solutions, so you probably have to create your own solution 
–it is a great opportunity to be innovative and gain competitive advantage 
–creating your own solution will typically require using tools such as the Hadoop framework, R, probably something for data preparation and reporting, ... 
–this often means heavy programming, ... Think of available skills and time to market 
3
© 2014 IBM Corporation 
4 
Analyzing large historical sensor data sets require flexible and easy to use tools 
Innovation #1: SQL on Hadoop
© 2014 IBM Corporation 
Sensor is very special structured data 
•A lot of different sources 
•Structure differs between sources, e.g. number of attributes, value encodings, ... And is usually evolving 
•(very) high volume 
Use SQL on Hadoop -> Big SQL 
–widely used, leverage existing skills 
–Declarative: what you want vs. how to get it 
–Use your existing tools 
Sensor data usually requires flexible schemas Analytics on sensor data isn‘t special: it should be as simple as always 
Source B 
Source B 
Source A 
Databases are not the primary choice, due to the flexible schema Hadoop is pretty well suited to analyze sensor data in its raw format 
But databases have SQL, which offers a very easy way to prepare and analyze structured data 
5
© 2014 IBM Corporation 
Big SQL combines the simplicity of SQL with the flexibility of Hadoop 
Big SQL is an IBM innovation that provides rich, robust, standards-based SQL support for data stored in InfoSphere BigInsights (IBM’s Hadoop distribution) 
–Full support for subqueries 
–OLAP operations, grouping sets, analytic aggregates, ... 
–All standard join operators (get value from combining data) 
–Use your existing queries and tools 
No propriety storage format 
–Never need to copy data to a proprietary representation 
–It is not a database, it is running in Hadoop, on standard data formats 
Big SQL = easy to do SQL combined with the flexiblity of Hadoop (like schema-on-read) 
InfoSphere BigInsights 
Big SQL 
SQL MPP Runtime 
Data Sources 
Parquet 
CSV 
Seq 
RC 
Avro 
ORC 
JSON 
Custom 
SQL-based Application 
6
© 2014 IBM Corporation 
Big SQL is architected for performance 
InfoSphere BigInsights 
Big SQL 
SQL MPP Runtime 
Data Sources 
Parquet 
CSV 
Seq 
RC 
Avro 
ORC 
JSON 
Custom 
SQL-based Application 
Uses its own engine, replace MapReduce with a modern MPP architecture 
–Compiler and runtime are native code (not java) 
–Big SQL worker daemons live directly on cluster 
–Continuously running (no startup latency) 
–Processing happens locally at the data 
Architected from the ground up for performance 
–low latency and high throughput 
–Comprehensive query rewrite and optimization (cost based optimizer) 
Operations occur in memory with the ability to spill to disk 
–Supports aggregations and sorts larger than available RAM 
7
© 2014 IBM Corporation 
A free QuickStart Editon, labs, tutorials and a developer community provides a fast start with Big SQL 
Hadoop Dev: links to videos, white paper, labs, . . . . 
https://developer.ibm.com/hadoop/ 
8 
(for the direct links, click on the pictures) 
How-to 
Cloud
© 2014 IBM Corporation 
Innovation #2: Streaming Analytics 
9 
Data don‘t have to be stored to be analyzed 
Streaming analytics is the key enabler for real time use cases with sensor data
© 2014 IBM Corporation 
Traditional approach 
– Historical fact finding 
– Analyze persisted data 
– (Micro-) Batch philosophy 
– PULL approach 
Streaming analytics 
– Analyze the current moment / the now 
– Analyze data directly “in Motion” – without 
storing it 
– Analyze data at the speed it is created 
– PUSH approach 
Data don‘t need to be persisted to be analyzed, streaming analytics 
represents a paradigm shift to enable real time use cases 
Data Repository Analysis Insight Data Analysis Insight 
10
© 2014 IBM Corporation 
InfoSphere Streams is the result of an IBM research project, designed for high-throughput, low latency and to make streaming analytics easy 
Scale out 
Millions of Events per Second 
Complex Data & Analytics 
All kinds of data 
Complex analytics: Everything you can express via an algorithm 
Low Latency 
Analyzes data at the speed it is created 
Latencies down to μs 
Immediate action in real time 
+ 
+ 
InfoSphere Streams 
Capabilities 
How it works 
–Define apps as flow graphs consisting of sources (inputs), operators & sinks (outputs) 
–Extend the functionality with your code if required for full flexibility 
–The clustered, distributed runtime on commodity HW scales nearly limitless 
–GUIs for rapid development and operations make streaming analytics easy 
11
© 2014 IBM Corporation 
Free Quickstart Edition 
Developer Community 
Streaming analytics is about analyzing all the data, continously, just in time, it enables a completely new generation of big data apps 
www.ibmdw.net/streamsdev/ 
ibm.co/streamsqs 
Stop just dreaming of real time big data 
Start with streaming analytics!!! 
+ 
Radio astronomy 
Healthcare 
TelCo 
Transport 
Smart Grid 
IoT 
Streaming Analytics is already reality 
... and is a key component of many innovations 
... 
Tutorials, 
Labs, 
Forum, ... 
Connected Car 
GitHub Community 
github.com/IBMStreams 
+ 
Toolkits, 
Toolkits, 
Toolkits 
12
© 2014 IBM Corporation 
Where technology meets business potential: Start making sense of your sensor data, everything is prepared! 
Big SQL 
Easy to do SQL analytics combined with fully flexible schema-on-read and Hadoop capabilities 
InfoSphere Streams 
Analyzes data at the speed it is created with maximum simplicity and minimum latency 
Many more, such as 
•Time series functionality 
•Efficient transport protocols 
•Cloud services (Bluemix) 
Gain value from your data 
13 
technology 
Innovations 
make it easy 
There are many opportu- nities to gain value from (not only) sensor data. Let‘s talk how to make sense of your data! 
http://www-05.ibm.com/de/events/workshop/bigdata/

Weitere ähnliche Inhalte

Was ist angesagt?

10 Good Reasons: NetApp for Machine Learning
10 Good Reasons: NetApp for Machine Learning10 Good Reasons: NetApp for Machine Learning
10 Good Reasons: NetApp for Machine LearningNetApp
 
IBM Cloud Paris meetup 20180213 - Data Science eXperience @scale
IBM Cloud Paris meetup   20180213 - Data Science eXperience @scaleIBM Cloud Paris meetup   20180213 - Data Science eXperience @scale
IBM Cloud Paris meetup 20180213 - Data Science eXperience @scaleIBM France Lab
 
NetApp HCI. Enterprise-Scale
NetApp HCI. Enterprise-ScaleNetApp HCI. Enterprise-Scale
NetApp HCI. Enterprise-ScaleNetApp
 
IBM Cloud Paris Meetup 20180213 - Data Science eXperience et Bigdata
IBM Cloud Paris Meetup   20180213 - Data Science eXperience et BigdataIBM Cloud Paris Meetup   20180213 - Data Science eXperience et Bigdata
IBM Cloud Paris Meetup 20180213 - Data Science eXperience et BigdataIBM France Lab
 
Better Business in a Flash
Better Business in a FlashBetter Business in a Flash
Better Business in a FlashNetApp
 
Analytics for Autonomous Driving with ROS
Analytics for Autonomous Driving with ROSAnalytics for Autonomous Driving with ROS
Analytics for Autonomous Driving with ROSJan Wiegelmann
 
Reducing the Total Cost of Ownership of Big Data- Impetus White Paper
Reducing the Total Cost of Ownership of Big Data- Impetus White PaperReducing the Total Cost of Ownership of Big Data- Impetus White Paper
Reducing the Total Cost of Ownership of Big Data- Impetus White PaperImpetus Technologies
 
What is BI on Cloud
What is BI on CloudWhat is BI on Cloud
What is BI on Cloudtdwiindia
 
How Market Intelligence From Hadoop on Azure Shows Trucking Companies a Clear...
How Market Intelligence From Hadoop on Azure Shows Trucking Companies a Clear...How Market Intelligence From Hadoop on Azure Shows Trucking Companies a Clear...
How Market Intelligence From Hadoop on Azure Shows Trucking Companies a Clear...DataWorks Summit
 
IBM Power Systems Update 1Q17
IBM Power Systems Update 1Q17IBM Power Systems Update 1Q17
IBM Power Systems Update 1Q17David Spurway
 
Trends towards the merge of HPC + Big Data systems
Trends towards the merge of HPC + Big Data systemsTrends towards the merge of HPC + Big Data systems
Trends towards the merge of HPC + Big Data systemsIgor José F. Freitas
 
IBM Cognos Business Intelligence using dashDB
IBM Cognos Business Intelligence using dashDBIBM Cognos Business Intelligence using dashDB
IBM Cognos Business Intelligence using dashDBIBM Cloud Data Services
 
Downsizing Data Centers by NetApp IT
Downsizing Data Centers by NetApp ITDownsizing Data Centers by NetApp IT
Downsizing Data Centers by NetApp ITNetApp
 
IBM Cloud Paris meetup 20180213 - Hortonworks
IBM Cloud Paris meetup   20180213 - HortonworksIBM Cloud Paris meetup   20180213 - Hortonworks
IBM Cloud Paris meetup 20180213 - HortonworksIBM France Lab
 
Beyond Big Data: Data Science and AI
Beyond Big Data: Data Science and AIBeyond Big Data: Data Science and AI
Beyond Big Data: Data Science and AIDataWorks Summit
 
10 Good Reasons: NetApp HCI
10 Good Reasons: NetApp HCI10 Good Reasons: NetApp HCI
10 Good Reasons: NetApp HCINetApp
 

Was ist angesagt? (20)

10 Good Reasons: NetApp for Machine Learning
10 Good Reasons: NetApp for Machine Learning10 Good Reasons: NetApp for Machine Learning
10 Good Reasons: NetApp for Machine Learning
 
IBM Cloud Paris meetup 20180213 - Data Science eXperience @scale
IBM Cloud Paris meetup   20180213 - Data Science eXperience @scaleIBM Cloud Paris meetup   20180213 - Data Science eXperience @scale
IBM Cloud Paris meetup 20180213 - Data Science eXperience @scale
 
NetApp HCI. Enterprise-Scale
NetApp HCI. Enterprise-ScaleNetApp HCI. Enterprise-Scale
NetApp HCI. Enterprise-Scale
 
IBM Cloud Paris Meetup 20180213 - Data Science eXperience et Bigdata
IBM Cloud Paris Meetup   20180213 - Data Science eXperience et BigdataIBM Cloud Paris Meetup   20180213 - Data Science eXperience et Bigdata
IBM Cloud Paris Meetup 20180213 - Data Science eXperience et Bigdata
 
Better Business in a Flash
Better Business in a FlashBetter Business in a Flash
Better Business in a Flash
 
Analytics for Autonomous Driving with ROS
Analytics for Autonomous Driving with ROSAnalytics for Autonomous Driving with ROS
Analytics for Autonomous Driving with ROS
 
Reducing the Total Cost of Ownership of Big Data- Impetus White Paper
Reducing the Total Cost of Ownership of Big Data- Impetus White PaperReducing the Total Cost of Ownership of Big Data- Impetus White Paper
Reducing the Total Cost of Ownership of Big Data- Impetus White Paper
 
What is BI on Cloud
What is BI on CloudWhat is BI on Cloud
What is BI on Cloud
 
How Market Intelligence From Hadoop on Azure Shows Trucking Companies a Clear...
How Market Intelligence From Hadoop on Azure Shows Trucking Companies a Clear...How Market Intelligence From Hadoop on Azure Shows Trucking Companies a Clear...
How Market Intelligence From Hadoop on Azure Shows Trucking Companies a Clear...
 
Big Data Analytics, Dave Shuttleworth - 22-9-15
Big Data Analytics, Dave Shuttleworth - 22-9-15Big Data Analytics, Dave Shuttleworth - 22-9-15
Big Data Analytics, Dave Shuttleworth - 22-9-15
 
IBM Power Systems Update 1Q17
IBM Power Systems Update 1Q17IBM Power Systems Update 1Q17
IBM Power Systems Update 1Q17
 
Trends towards the merge of HPC + Big Data systems
Trends towards the merge of HPC + Big Data systemsTrends towards the merge of HPC + Big Data systems
Trends towards the merge of HPC + Big Data systems
 
IBM Cognos Business Intelligence using dashDB
IBM Cognos Business Intelligence using dashDBIBM Cognos Business Intelligence using dashDB
IBM Cognos Business Intelligence using dashDB
 
Downsizing Data Centers by NetApp IT
Downsizing Data Centers by NetApp ITDownsizing Data Centers by NetApp IT
Downsizing Data Centers by NetApp IT
 
IBM Cloud Paris meetup 20180213 - Hortonworks
IBM Cloud Paris meetup   20180213 - HortonworksIBM Cloud Paris meetup   20180213 - Hortonworks
IBM Cloud Paris meetup 20180213 - Hortonworks
 
Beyond Big Data: Data Science and AI
Beyond Big Data: Data Science and AIBeyond Big Data: Data Science and AI
Beyond Big Data: Data Science and AI
 
Higher ROI-N
Higher ROI-NHigher ROI-N
Higher ROI-N
 
10 Good Reasons: NetApp HCI
10 Good Reasons: NetApp HCI10 Good Reasons: NetApp HCI
10 Good Reasons: NetApp HCI
 
Big Data Building Blocks with AWS Cloud
Big Data Building Blocks with AWS CloudBig Data Building Blocks with AWS Cloud
Big Data Building Blocks with AWS Cloud
 
The Ecosystem is too damn big
The Ecosystem is too damn big The Ecosystem is too damn big
The Ecosystem is too damn big
 

Andere mochten auch

Marvin, Data Science & Spark – haben wir ohne Mathematik und Technik noch ein...
Marvin, Data Science & Spark – haben wir ohne Mathematik und Technik noch ein...Marvin, Data Science & Spark – haben wir ohne Mathematik und Technik noch ein...
Marvin, Data Science & Spark – haben wir ohne Mathematik und Technik noch ein...Stephan Reimann
 
In-Time Sensor Data Analysis and Pattern Detection
In-Time Sensor Data Analysis and Pattern DetectionIn-Time Sensor Data Analysis and Pattern Detection
In-Time Sensor Data Analysis and Pattern DetectionJordan Barrette
 
Metrics @ App Academy
Metrics @ App AcademyMetrics @ App Academy
Metrics @ App AcademyNiko Vuokko
 
Opportunities in Sensor Networks and Big Data in 2014 (for NIKKEI Big Data Co...
Opportunities in Sensor Networks and Big Data in 2014 (for NIKKEI Big Data Co...Opportunities in Sensor Networks and Big Data in 2014 (for NIKKEI Big Data Co...
Opportunities in Sensor Networks and Big Data in 2014 (for NIKKEI Big Data Co...Rainer Sternfeld
 
Sensor Data in Business
Sensor Data in BusinessSensor Data in Business
Sensor Data in BusinessNiko Vuokko
 
IoT ( M2M) - Big Data - Analytics: Emulation and Demonstration
IoT ( M2M) - Big Data - Analytics: Emulation and DemonstrationIoT ( M2M) - Big Data - Analytics: Emulation and Demonstration
IoT ( M2M) - Big Data - Analytics: Emulation and DemonstrationCHAKER ALLAOUI
 
Introduction to Streaming Analytics
Introduction to Streaming AnalyticsIntroduction to Streaming Analytics
Introduction to Streaming AnalyticsGuido Schmutz
 
Real Time Data Processing using Spark Streaming | Data Day Texas 2015
Real Time Data Processing using Spark Streaming | Data Day Texas 2015Real Time Data Processing using Spark Streaming | Data Day Texas 2015
Real Time Data Processing using Spark Streaming | Data Day Texas 2015Cloudera, Inc.
 
Tiny Sensors, Big Data
Tiny Sensors, Big DataTiny Sensors, Big Data
Tiny Sensors, Big DataJake Galbreath
 

Andere mochten auch (10)

Marvin, Data Science & Spark – haben wir ohne Mathematik und Technik noch ein...
Marvin, Data Science & Spark – haben wir ohne Mathematik und Technik noch ein...Marvin, Data Science & Spark – haben wir ohne Mathematik und Technik noch ein...
Marvin, Data Science & Spark – haben wir ohne Mathematik und Technik noch ein...
 
In-Time Sensor Data Analysis and Pattern Detection
In-Time Sensor Data Analysis and Pattern DetectionIn-Time Sensor Data Analysis and Pattern Detection
In-Time Sensor Data Analysis and Pattern Detection
 
Metrics @ App Academy
Metrics @ App AcademyMetrics @ App Academy
Metrics @ App Academy
 
Opportunities in Sensor Networks and Big Data in 2014 (for NIKKEI Big Data Co...
Opportunities in Sensor Networks and Big Data in 2014 (for NIKKEI Big Data Co...Opportunities in Sensor Networks and Big Data in 2014 (for NIKKEI Big Data Co...
Opportunities in Sensor Networks and Big Data in 2014 (for NIKKEI Big Data Co...
 
Sensor Data in Business
Sensor Data in BusinessSensor Data in Business
Sensor Data in Business
 
IoT ( M2M) - Big Data - Analytics: Emulation and Demonstration
IoT ( M2M) - Big Data - Analytics: Emulation and DemonstrationIoT ( M2M) - Big Data - Analytics: Emulation and Demonstration
IoT ( M2M) - Big Data - Analytics: Emulation and Demonstration
 
Introduction to Streaming Analytics
Introduction to Streaming AnalyticsIntroduction to Streaming Analytics
Introduction to Streaming Analytics
 
Real Time Data Processing using Spark Streaming | Data Day Texas 2015
Real Time Data Processing using Spark Streaming | Data Day Texas 2015Real Time Data Processing using Spark Streaming | Data Day Texas 2015
Real Time Data Processing using Spark Streaming | Data Day Texas 2015
 
Spark+flume seattle
Spark+flume seattleSpark+flume seattle
Spark+flume seattle
 
Tiny Sensors, Big Data
Tiny Sensors, Big DataTiny Sensors, Big Data
Tiny Sensors, Big Data
 

Ähnlich wie The sensor data challenge - Innovations (not only) for the Internet of Things

IBM Smarter Analytics
IBM Smarter AnalyticsIBM Smarter Analytics
IBM Smarter AnalyticsAdrian Turcu
 
Get Started Quickly with IBM's Hadoop as a Service
Get Started Quickly with IBM's Hadoop as a ServiceGet Started Quickly with IBM's Hadoop as a Service
Get Started Quickly with IBM's Hadoop as a ServiceIBM Cloud Data Services
 
Analyzing Big Data - Jeff Scheel
Analyzing Big Data - Jeff ScheelAnalyzing Big Data - Jeff Scheel
Analyzing Big Data - Jeff ScheelKangaroot
 
Getting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixGetting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixNicolas Morales
 
InfoSphere BigInsights - Analytics power for Hadoop - field experience
InfoSphere BigInsights - Analytics power for Hadoop - field experienceInfoSphere BigInsights - Analytics power for Hadoop - field experience
InfoSphere BigInsights - Analytics power for Hadoop - field experienceWilfried Hoge
 
Making Hadoop Ready for the Enterprise
Making Hadoop Ready for the Enterprise Making Hadoop Ready for the Enterprise
Making Hadoop Ready for the Enterprise DataWorks Summit
 
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...Cynthia Saracco
 
Enterprise analytics journey from Helene Lyon
Enterprise analytics journey from Helene LyonEnterprise analytics journey from Helene Lyon
Enterprise analytics journey from Helene LyonHelene Lyon
 
Big Data Architecture
Big Data ArchitectureBig Data Architecture
Big Data ArchitectureGuido Schmutz
 
Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data Pactera_US
 
How to Increase Performance in IBM Cognos
How to Increase Performance in IBM CognosHow to Increase Performance in IBM Cognos
How to Increase Performance in IBM CognosCresco International
 
Hadoop and the Future of SQL: Using BI Tools with Big Data
Hadoop and the Future of SQL: Using BI Tools with Big DataHadoop and the Future of SQL: Using BI Tools with Big Data
Hadoop and the Future of SQL: Using BI Tools with Big DataSenturus
 
Software Defined Infrastructure
Software Defined InfrastructureSoftware Defined Infrastructure
Software Defined Infrastructureinside-BigData.com
 
Simplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache KuduSimplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache KuduCloudera, Inc.
 
OC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBMOC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBMBig Data Joe™ Rossi
 
SD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBMSD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBMBig Data Joe™ Rossi
 
2016 Sept 1st - IBM Consultants & System Integrators Interchange - Big Data -...
2016 Sept 1st - IBM Consultants & System Integrators Interchange - Big Data -...2016 Sept 1st - IBM Consultants & System Integrators Interchange - Big Data -...
2016 Sept 1st - IBM Consultants & System Integrators Interchange - Big Data -...Anand Haridass
 
2014.07.11 biginsights data2014
2014.07.11 biginsights data20142014.07.11 biginsights data2014
2014.07.11 biginsights data2014Wilfried Hoge
 
Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platform
Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platformPivotal deep dive_on_pivotal_hd_world_class_hdfs_platform
Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platformEMC
 

Ähnlich wie The sensor data challenge - Innovations (not only) for the Internet of Things (20)

IBM Smarter Analytics
IBM Smarter AnalyticsIBM Smarter Analytics
IBM Smarter Analytics
 
Get Started Quickly with IBM's Hadoop as a Service
Get Started Quickly with IBM's Hadoop as a ServiceGet Started Quickly with IBM's Hadoop as a Service
Get Started Quickly with IBM's Hadoop as a Service
 
Analyzing Big Data - Jeff Scheel
Analyzing Big Data - Jeff ScheelAnalyzing Big Data - Jeff Scheel
Analyzing Big Data - Jeff Scheel
 
Getting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixGetting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with Bluemix
 
InfoSphere BigInsights - Analytics power for Hadoop - field experience
InfoSphere BigInsights - Analytics power for Hadoop - field experienceInfoSphere BigInsights - Analytics power for Hadoop - field experience
InfoSphere BigInsights - Analytics power for Hadoop - field experience
 
Making Hadoop Ready for the Enterprise
Making Hadoop Ready for the Enterprise Making Hadoop Ready for the Enterprise
Making Hadoop Ready for the Enterprise
 
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
 
Enterprise analytics journey from Helene Lyon
Enterprise analytics journey from Helene LyonEnterprise analytics journey from Helene Lyon
Enterprise analytics journey from Helene Lyon
 
Big Data Architecture
Big Data ArchitectureBig Data Architecture
Big Data Architecture
 
Infrastructure Matters
Infrastructure MattersInfrastructure Matters
Infrastructure Matters
 
Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data
 
How to Increase Performance in IBM Cognos
How to Increase Performance in IBM CognosHow to Increase Performance in IBM Cognos
How to Increase Performance in IBM Cognos
 
Hadoop and the Future of SQL: Using BI Tools with Big Data
Hadoop and the Future of SQL: Using BI Tools with Big DataHadoop and the Future of SQL: Using BI Tools with Big Data
Hadoop and the Future of SQL: Using BI Tools with Big Data
 
Software Defined Infrastructure
Software Defined InfrastructureSoftware Defined Infrastructure
Software Defined Infrastructure
 
Simplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache KuduSimplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache Kudu
 
OC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBMOC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBM
 
SD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBMSD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBM
 
2016 Sept 1st - IBM Consultants & System Integrators Interchange - Big Data -...
2016 Sept 1st - IBM Consultants & System Integrators Interchange - Big Data -...2016 Sept 1st - IBM Consultants & System Integrators Interchange - Big Data -...
2016 Sept 1st - IBM Consultants & System Integrators Interchange - Big Data -...
 
2014.07.11 biginsights data2014
2014.07.11 biginsights data20142014.07.11 biginsights data2014
2014.07.11 biginsights data2014
 
Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platform
Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platformPivotal deep dive_on_pivotal_hd_world_class_hdfs_platform
Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platform
 

Kürzlich hochgeladen

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...apidays
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusZilliz
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 

Kürzlich hochgeladen (20)

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 

The sensor data challenge - Innovations (not only) for the Internet of Things

  • 1. © 2014 IBM Corporation The sensor data challenge Innovations (not only) for the Internet of Things Big Data Meetup Berlin – 2014-10-23 Stephan Reimann – IT Specialist Big Data – stephan.reimann@de.ibm.com @stereimann de.linkedin.com/in/stephanreimann/
  • 2. © 2014 IBM Corporation Sensor data present an enormous business opportunity across industries Manufacturing IBM uses automated quality testing data for detection of anomalies in the semiconductor wafer manufacturing process to minimize wafer loss Source and more details Connected Car PSA Peugot Citroën uses IBM Big Data technologies including InfoSphere BigInsights as the basis of their con- nected services initiative to bring additional services to vehicle owners Source Industrial / Transport Pratt & Wittney “reduce maintenance costs by up to 20 percent” “... less disruptions, and remo- vals, and when the engine is in the shop, targeted repairs so the engine can come out of the shop quickly” Source Industrie 4.0 Energy & Utilities Connected Car Healthcare ... It improves efficiency & quality and enables new business models ... and there are many more oppotunities Your fitness devices are also part of it! 2
  • 3. © 2014 IBM Corporation Making sense means ... –detecting hidden correlations –predicting future behavior predictive maintenance –detecting outliers It is not about having the data, it is about using analytics to make sense of it and creating value The hard thing of making sense is doing it, because ... –the topic is relatively new, there are not so much out-of-the-box solutions, so you probably have to create your own solution –it is a great opportunity to be innovative and gain competitive advantage –creating your own solution will typically require using tools such as the Hadoop framework, R, probably something for data preparation and reporting, ... –this often means heavy programming, ... Think of available skills and time to market 3
  • 4. © 2014 IBM Corporation 4 Analyzing large historical sensor data sets require flexible and easy to use tools Innovation #1: SQL on Hadoop
  • 5. © 2014 IBM Corporation Sensor is very special structured data •A lot of different sources •Structure differs between sources, e.g. number of attributes, value encodings, ... And is usually evolving •(very) high volume Use SQL on Hadoop -> Big SQL –widely used, leverage existing skills –Declarative: what you want vs. how to get it –Use your existing tools Sensor data usually requires flexible schemas Analytics on sensor data isn‘t special: it should be as simple as always Source B Source B Source A Databases are not the primary choice, due to the flexible schema Hadoop is pretty well suited to analyze sensor data in its raw format But databases have SQL, which offers a very easy way to prepare and analyze structured data 5
  • 6. © 2014 IBM Corporation Big SQL combines the simplicity of SQL with the flexibility of Hadoop Big SQL is an IBM innovation that provides rich, robust, standards-based SQL support for data stored in InfoSphere BigInsights (IBM’s Hadoop distribution) –Full support for subqueries –OLAP operations, grouping sets, analytic aggregates, ... –All standard join operators (get value from combining data) –Use your existing queries and tools No propriety storage format –Never need to copy data to a proprietary representation –It is not a database, it is running in Hadoop, on standard data formats Big SQL = easy to do SQL combined with the flexiblity of Hadoop (like schema-on-read) InfoSphere BigInsights Big SQL SQL MPP Runtime Data Sources Parquet CSV Seq RC Avro ORC JSON Custom SQL-based Application 6
  • 7. © 2014 IBM Corporation Big SQL is architected for performance InfoSphere BigInsights Big SQL SQL MPP Runtime Data Sources Parquet CSV Seq RC Avro ORC JSON Custom SQL-based Application Uses its own engine, replace MapReduce with a modern MPP architecture –Compiler and runtime are native code (not java) –Big SQL worker daemons live directly on cluster –Continuously running (no startup latency) –Processing happens locally at the data Architected from the ground up for performance –low latency and high throughput –Comprehensive query rewrite and optimization (cost based optimizer) Operations occur in memory with the ability to spill to disk –Supports aggregations and sorts larger than available RAM 7
  • 8. © 2014 IBM Corporation A free QuickStart Editon, labs, tutorials and a developer community provides a fast start with Big SQL Hadoop Dev: links to videos, white paper, labs, . . . . https://developer.ibm.com/hadoop/ 8 (for the direct links, click on the pictures) How-to Cloud
  • 9. © 2014 IBM Corporation Innovation #2: Streaming Analytics 9 Data don‘t have to be stored to be analyzed Streaming analytics is the key enabler for real time use cases with sensor data
  • 10. © 2014 IBM Corporation Traditional approach – Historical fact finding – Analyze persisted data – (Micro-) Batch philosophy – PULL approach Streaming analytics – Analyze the current moment / the now – Analyze data directly “in Motion” – without storing it – Analyze data at the speed it is created – PUSH approach Data don‘t need to be persisted to be analyzed, streaming analytics represents a paradigm shift to enable real time use cases Data Repository Analysis Insight Data Analysis Insight 10
  • 11. © 2014 IBM Corporation InfoSphere Streams is the result of an IBM research project, designed for high-throughput, low latency and to make streaming analytics easy Scale out Millions of Events per Second Complex Data & Analytics All kinds of data Complex analytics: Everything you can express via an algorithm Low Latency Analyzes data at the speed it is created Latencies down to μs Immediate action in real time + + InfoSphere Streams Capabilities How it works –Define apps as flow graphs consisting of sources (inputs), operators & sinks (outputs) –Extend the functionality with your code if required for full flexibility –The clustered, distributed runtime on commodity HW scales nearly limitless –GUIs for rapid development and operations make streaming analytics easy 11
  • 12. © 2014 IBM Corporation Free Quickstart Edition Developer Community Streaming analytics is about analyzing all the data, continously, just in time, it enables a completely new generation of big data apps www.ibmdw.net/streamsdev/ ibm.co/streamsqs Stop just dreaming of real time big data Start with streaming analytics!!! + Radio astronomy Healthcare TelCo Transport Smart Grid IoT Streaming Analytics is already reality ... and is a key component of many innovations ... Tutorials, Labs, Forum, ... Connected Car GitHub Community github.com/IBMStreams + Toolkits, Toolkits, Toolkits 12
  • 13. © 2014 IBM Corporation Where technology meets business potential: Start making sense of your sensor data, everything is prepared! Big SQL Easy to do SQL analytics combined with fully flexible schema-on-read and Hadoop capabilities InfoSphere Streams Analyzes data at the speed it is created with maximum simplicity and minimum latency Many more, such as •Time series functionality •Efficient transport protocols •Cloud services (Bluemix) Gain value from your data 13 technology Innovations make it easy There are many opportu- nities to gain value from (not only) sensor data. Let‘s talk how to make sense of your data! http://www-05.ibm.com/de/events/workshop/bigdata/