Suche senden
Hochladen
InfoSphere BigInsights
•
5 gefällt mir
•
10,338 views
Wilfried Hoge
Folgen
Presentation about InfoSphere BigInsights from IM Forum 2013 in Berlin
Weniger lesen
Mehr lesen
Technologie
Melden
Teilen
Melden
Teilen
1 von 28
Jetzt herunterladen
Downloaden Sie, um offline zu lesen
Empfohlen
2.3 bayesian classification
2.3 bayesian classification
Krish_ver2
Big Data: Technical Introduction to BigSheets for InfoSphere BigInsights
Big Data: Technical Introduction to BigSheets for InfoSphere BigInsights
Cynthia Saracco
Big data unit i
Big data unit i
Navjot Kaur
Learning With Complete Data
Learning With Complete Data
Vishnuprabhu Gopalakrishnan
Physical organization of parallel platforms
Physical organization of parallel platforms
Syed Zaid Irshad
Introduction to Distributed System
Introduction to Distributed System
Sunita Sahu
Communication costs in parallel machines
Communication costs in parallel machines
Syed Zaid Irshad
Distributed information system
Distributed information system
District Administration
Empfohlen
2.3 bayesian classification
2.3 bayesian classification
Krish_ver2
Big Data: Technical Introduction to BigSheets for InfoSphere BigInsights
Big Data: Technical Introduction to BigSheets for InfoSphere BigInsights
Cynthia Saracco
Big data unit i
Big data unit i
Navjot Kaur
Learning With Complete Data
Learning With Complete Data
Vishnuprabhu Gopalakrishnan
Physical organization of parallel platforms
Physical organization of parallel platforms
Syed Zaid Irshad
Introduction to Distributed System
Introduction to Distributed System
Sunita Sahu
Communication costs in parallel machines
Communication costs in parallel machines
Syed Zaid Irshad
Distributed information system
Distributed information system
District Administration
Distributed system architecture
Distributed system architecture
Yisal Khan
Deductive databases
Deductive databases
Dabbal Singh Mahara
Ensemble learning
Ensemble learning
Haris Jamil
Distribution transparency and Distributed transaction
Distribution transparency and Distributed transaction
shraddha mane
Hadoop Distributed File System
Hadoop Distributed File System
Rutvik Bapat
Distributed file system
Distributed file system
Anamika Singh
Ddb 1.6-design issues
Ddb 1.6-design issues
Esar Qasmi
Mobile Network Layer
Mobile Network Layer
Rahul Hada
Common Standards in Cloud Computing
Common Standards in Cloud Computing
mrzahidfaiz.blogspot.com
Map reduce in BIG DATA
Map reduce in BIG DATA
GauravBiswas9
Chapter 6 synchronization
Chapter 6 synchronization
Alagappa Government Arts College, Karaikudi
Parallel computing
Parallel computing
Vinay Gupta
Design issues of dos
Design issues of dos
vanamali_vanu
Eucalyptus, Nimbus & OpenNebula
Eucalyptus, Nimbus & OpenNebula
Amar Myana
Distributed dbms architectures
Distributed dbms architectures
Pooja Dixit
Routing Protocols in WSN
Routing Protocols in WSN
Darpan Dekivadiya
Deployment Models of Cloud Computing.pptx
Deployment Models of Cloud Computing.pptx
Jaya Silwal
2.4 rule based classification
2.4 rule based classification
Krish_ver2
Parallel Database
Parallel Database
VESIT/University of Mumbai
Big data lecture notes
Big data lecture notes
Mohit Saini
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
Cynthia Saracco
Machine Data Analytics
Machine Data Analytics
Nicolas Morales
Weitere ähnliche Inhalte
Was ist angesagt?
Distributed system architecture
Distributed system architecture
Yisal Khan
Deductive databases
Deductive databases
Dabbal Singh Mahara
Ensemble learning
Ensemble learning
Haris Jamil
Distribution transparency and Distributed transaction
Distribution transparency and Distributed transaction
shraddha mane
Hadoop Distributed File System
Hadoop Distributed File System
Rutvik Bapat
Distributed file system
Distributed file system
Anamika Singh
Ddb 1.6-design issues
Ddb 1.6-design issues
Esar Qasmi
Mobile Network Layer
Mobile Network Layer
Rahul Hada
Common Standards in Cloud Computing
Common Standards in Cloud Computing
mrzahidfaiz.blogspot.com
Map reduce in BIG DATA
Map reduce in BIG DATA
GauravBiswas9
Chapter 6 synchronization
Chapter 6 synchronization
Alagappa Government Arts College, Karaikudi
Parallel computing
Parallel computing
Vinay Gupta
Design issues of dos
Design issues of dos
vanamali_vanu
Eucalyptus, Nimbus & OpenNebula
Eucalyptus, Nimbus & OpenNebula
Amar Myana
Distributed dbms architectures
Distributed dbms architectures
Pooja Dixit
Routing Protocols in WSN
Routing Protocols in WSN
Darpan Dekivadiya
Deployment Models of Cloud Computing.pptx
Deployment Models of Cloud Computing.pptx
Jaya Silwal
2.4 rule based classification
2.4 rule based classification
Krish_ver2
Parallel Database
Parallel Database
VESIT/University of Mumbai
Big data lecture notes
Big data lecture notes
Mohit Saini
Was ist angesagt?
(20)
Distributed system architecture
Distributed system architecture
Deductive databases
Deductive databases
Ensemble learning
Ensemble learning
Distribution transparency and Distributed transaction
Distribution transparency and Distributed transaction
Hadoop Distributed File System
Hadoop Distributed File System
Distributed file system
Distributed file system
Ddb 1.6-design issues
Ddb 1.6-design issues
Mobile Network Layer
Mobile Network Layer
Common Standards in Cloud Computing
Common Standards in Cloud Computing
Map reduce in BIG DATA
Map reduce in BIG DATA
Chapter 6 synchronization
Chapter 6 synchronization
Parallel computing
Parallel computing
Design issues of dos
Design issues of dos
Eucalyptus, Nimbus & OpenNebula
Eucalyptus, Nimbus & OpenNebula
Distributed dbms architectures
Distributed dbms architectures
Routing Protocols in WSN
Routing Protocols in WSN
Deployment Models of Cloud Computing.pptx
Deployment Models of Cloud Computing.pptx
2.4 rule based classification
2.4 rule based classification
Parallel Database
Parallel Database
Big data lecture notes
Big data lecture notes
Andere mochten auch
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
Cynthia Saracco
Machine Data Analytics
Machine Data Analytics
Nicolas Morales
The Eclipse Modeling Framework and MDA
The Eclipse Modeling Framework and MDA
elliando dias
InfoSphere Streams Technical Overview - Use Cases Big Data - Jerome CHAILLOUX
InfoSphere Streams Technical Overview - Use Cases Big Data - Jerome CHAILLOUX
IBMInfoSphereUGFR
InfoSphere BigInsights - Analytics power for Hadoop - field experience
InfoSphere BigInsights - Analytics power for Hadoop - field experience
Wilfried Hoge
2014.07.11 biginsights data2014
2014.07.11 biginsights data2014
Wilfried Hoge
The European Conference on Software Architecture (ECSA) 14 - IBM BigData Refe...
The European Conference on Software Architecture (ECSA) 14 - IBM BigData Refe...
Romeo Kienzler
Big SQL 3.0 - Fast and easy SQL on Hadoop
Big SQL 3.0 - Fast and easy SQL on Hadoop
Wilfried Hoge
Value proposition for big data isv partners 0714
Value proposition for big data isv partners 0714
Niu Bai
Big Data, Big Thinking: Simplified Architecture Webinar Fact Sheet
Big Data, Big Thinking: Simplified Architecture Webinar Fact Sheet
SAP Technology
MSP Best Practice: Using Service Blueprints and Strategic IT Roadmaps to Get ...
MSP Best Practice: Using Service Blueprints and Strategic IT Roadmaps to Get ...
Kaseya
Big Data & Analytics Architecture
Big Data & Analytics Architecture
Arvind Sathi
Overview - IBM Big Data Platform
Overview - IBM Big Data Platform
Vikas Manoria
Big Data Architecture
Big Data Architecture
Guido Schmutz
Planning, implementation, monitoring and evaluation of health education progr...
Planning, implementation, monitoring and evaluation of health education progr...
Jimma University
5 Steps To Effective Jad Sessions
5 Steps To Effective Jad Sessions
LizLavaveshkul
Andere mochten auch
(16)
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
Machine Data Analytics
Machine Data Analytics
The Eclipse Modeling Framework and MDA
The Eclipse Modeling Framework and MDA
InfoSphere Streams Technical Overview - Use Cases Big Data - Jerome CHAILLOUX
InfoSphere Streams Technical Overview - Use Cases Big Data - Jerome CHAILLOUX
InfoSphere BigInsights - Analytics power for Hadoop - field experience
InfoSphere BigInsights - Analytics power for Hadoop - field experience
2014.07.11 biginsights data2014
2014.07.11 biginsights data2014
The European Conference on Software Architecture (ECSA) 14 - IBM BigData Refe...
The European Conference on Software Architecture (ECSA) 14 - IBM BigData Refe...
Big SQL 3.0 - Fast and easy SQL on Hadoop
Big SQL 3.0 - Fast and easy SQL on Hadoop
Value proposition for big data isv partners 0714
Value proposition for big data isv partners 0714
Big Data, Big Thinking: Simplified Architecture Webinar Fact Sheet
Big Data, Big Thinking: Simplified Architecture Webinar Fact Sheet
MSP Best Practice: Using Service Blueprints and Strategic IT Roadmaps to Get ...
MSP Best Practice: Using Service Blueprints and Strategic IT Roadmaps to Get ...
Big Data & Analytics Architecture
Big Data & Analytics Architecture
Overview - IBM Big Data Platform
Overview - IBM Big Data Platform
Big Data Architecture
Big Data Architecture
Planning, implementation, monitoring and evaluation of health education progr...
Planning, implementation, monitoring and evaluation of health education progr...
5 Steps To Effective Jad Sessions
5 Steps To Effective Jad Sessions
Ähnlich wie InfoSphere BigInsights
Cassandra Summit 2014: Internet of Complex Things Analytics with Apache Cassa...
Cassandra Summit 2014: Internet of Complex Things Analytics with Apache Cassa...
DataStax Academy
Simplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache Kudu
Cloudera, Inc.
Enabling Next Gen Analytics with Azure Data Lake and StreamSets
Enabling Next Gen Analytics with Azure Data Lake and StreamSets
Streamsets Inc.
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
DATAVERSITY
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Cloudera, Inc.
Capgemini Leap Data Transformation Framework with Cloudera
Capgemini Leap Data Transformation Framework with Cloudera
Capgemini
Data & Analytics with CIS & Microsoft Platforms
Data & Analytics with CIS & Microsoft Platforms
Sonata Software
Self-Service Analytics with Guard Rails
Self-Service Analytics with Guard Rails
Denodo
SPS Vancouver 2018 - What is CDM and CDS
SPS Vancouver 2018 - What is CDM and CDS
Nicolas Georgeault
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
MapR Technologies
SoftWatch Overview_short (1)
SoftWatch Overview_short (1)
Dror Leshem
SoftWatch Overview_short (1)
SoftWatch Overview_short (1)
Moshe Kozlovski
Build and Manage Hadoop & Oracle NoSQL DB Solutions- Impetus Webinar
Build and Manage Hadoop & Oracle NoSQL DB Solutions- Impetus Webinar
Impetus Technologies
25 Best Data Mining Tools in 2022
25 Best Data Mining Tools in 2022
Kavika Roy
Webinar: Faster Big Data Analytics with MongoDB
Webinar: Faster Big Data Analytics with MongoDB
MongoDB
Big Data: InterConnect 2016 Session on Getting Started with Big Data Analytics
Big Data: InterConnect 2016 Session on Getting Started with Big Data Analytics
Cynthia Saracco
CSC - Presentation at Hortonworks Booth - Strata 2014
CSC - Presentation at Hortonworks Booth - Strata 2014
Hortonworks
zData BI & Advanced Analytics Platform + 8 Week Pilot Programs
zData BI & Advanced Analytics Platform + 8 Week Pilot Programs
zData Inc.
Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data
Pactera_US
EMC Pivotal overview deck
EMC Pivotal overview deck
mister_moun
Ähnlich wie InfoSphere BigInsights
(20)
Cassandra Summit 2014: Internet of Complex Things Analytics with Apache Cassa...
Cassandra Summit 2014: Internet of Complex Things Analytics with Apache Cassa...
Simplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache Kudu
Enabling Next Gen Analytics with Azure Data Lake and StreamSets
Enabling Next Gen Analytics with Azure Data Lake and StreamSets
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Capgemini Leap Data Transformation Framework with Cloudera
Capgemini Leap Data Transformation Framework with Cloudera
Data & Analytics with CIS & Microsoft Platforms
Data & Analytics with CIS & Microsoft Platforms
Self-Service Analytics with Guard Rails
Self-Service Analytics with Guard Rails
SPS Vancouver 2018 - What is CDM and CDS
SPS Vancouver 2018 - What is CDM and CDS
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
SoftWatch Overview_short (1)
SoftWatch Overview_short (1)
SoftWatch Overview_short (1)
SoftWatch Overview_short (1)
Build and Manage Hadoop & Oracle NoSQL DB Solutions- Impetus Webinar
Build and Manage Hadoop & Oracle NoSQL DB Solutions- Impetus Webinar
25 Best Data Mining Tools in 2022
25 Best Data Mining Tools in 2022
Webinar: Faster Big Data Analytics with MongoDB
Webinar: Faster Big Data Analytics with MongoDB
Big Data: InterConnect 2016 Session on Getting Started with Big Data Analytics
Big Data: InterConnect 2016 Session on Getting Started with Big Data Analytics
CSC - Presentation at Hortonworks Booth - Strata 2014
CSC - Presentation at Hortonworks Booth - Strata 2014
zData BI & Advanced Analytics Platform + 8 Week Pilot Programs
zData BI & Advanced Analytics Platform + 8 Week Pilot Programs
Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data
EMC Pivotal overview deck
EMC Pivotal overview deck
Mehr von Wilfried Hoge
Cloud Data Services - from prototyping to scalable analytics on cloud
Cloud Data Services - from prototyping to scalable analytics on cloud
Wilfried Hoge
Is it harder to find a taxi when it is raining?
Is it harder to find a taxi when it is raining?
Wilfried Hoge
innovations born in the cloud - cloud data services from IBM to prototype you...
innovations born in the cloud - cloud data services from IBM to prototype you...
Wilfried Hoge
2015.05.07 watson rp15
2015.05.07 watson rp15
Wilfried Hoge
Twitter analytics in Bluemix
Twitter analytics in Bluemix
Wilfried Hoge
2013.12.12 big data heise webcast
2013.12.12 big data heise webcast
Wilfried Hoge
2012.04.26 big insights streams im forum2
2012.04.26 big insights streams im forum2
Wilfried Hoge
IBM - Big Value from Big Data
IBM - Big Value from Big Data
Wilfried Hoge
Mehr von Wilfried Hoge
(8)
Cloud Data Services - from prototyping to scalable analytics on cloud
Cloud Data Services - from prototyping to scalable analytics on cloud
Is it harder to find a taxi when it is raining?
Is it harder to find a taxi when it is raining?
innovations born in the cloud - cloud data services from IBM to prototype you...
innovations born in the cloud - cloud data services from IBM to prototype you...
2015.05.07 watson rp15
2015.05.07 watson rp15
Twitter analytics in Bluemix
Twitter analytics in Bluemix
2013.12.12 big data heise webcast
2013.12.12 big data heise webcast
2012.04.26 big insights streams im forum2
2012.04.26 big insights streams im forum2
IBM - Big Value from Big Data
IBM - Big Value from Big Data
Kürzlich hochgeladen
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
presentation ICT roal in 21st century education
presentation ICT roal in 21st century education
jfdjdjcjdnsjd
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
Maria Levchenko
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
Andrey Devyatkin
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
The Digital Insurer
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
Khem
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
DianaGray10
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
Anna Loughnan Colquhoun
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Miguel Araújo
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
The Digital Insurer
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
debabhi2
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
Martijn de Jong
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
apidays
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
ThousandEyes
🐬 The future of MySQL is Postgres 🐘
🐬 The future of MySQL is Postgres 🐘
RTylerCroy
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
hans926745
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
wesley chun
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
sammart93
Kürzlich hochgeladen
(20)
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
presentation ICT roal in 21st century education
presentation ICT roal in 21st century education
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
🐬 The future of MySQL is Postgres 🐘
🐬 The future of MySQL is Postgres 🐘
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
InfoSphere BigInsights
1.
InfoSphere BigInsights Hadoop business
ready Wilfried Hoge IT Architect Big Data
2.
© 2013 International
Business Machines Corporation 2 Getting the Value from Big Data – Why a Platform? § Almost all big data use cases require an integrated set of big data technologies to address the business pain completely § Reduce time and cost and provide quick ROI by leveraging pre-integrated components § Be flexible in the combination of technologies § Start small with a single project and progress to others over your big data journey Accelerators Information Integration & Governance Data Warehouse Stream Computing Hadoop System DiscoveryApplication Development Systems Management Data Media Content Machine Social BIG DATA PLATFORM
3.
© 2013 International
Business Machines Corporation 3 Accelerators Information Integration & Governance Data Warehouse Stream Computing Hadoop System DiscoveryApplication Development Systems Management Data Media Content Machine Social BIG DATA PLATFORM InfoSphere BigInsights is IBM‘s distribution of Hadoop that delivers additional value Accelerators Speed time to value with analytic and application accelerators InfoSphere BigInsights Bringing Hadoop to the enterprise
4.
© 2013 International
Business Machines Corporation 4 New Architecture to Leverage All Data and Analytics Data in Mo)on Data at Rest Data in Many Forms Information Ingestion and Operational Information Decision Management BI and Predictive Analytics Navigation and Discovery Intelligence Analysis Landing Area, Analytics Zone and Archive § Raw Data § Structured Data § Text Analytics § Data Mining § Entity Analytics § Machine Learning Real-time Analytics § Video/Audio § Network/Sensor § Entity Analytics § Predictive Exploration, Integrated Warehouse, and Mart Zones § Discovery § Deep Reflection § Operational § Predictive § Stream Processing § Data Integration § Master Data Streams Information Governance, Security and Business Continuity
5.
© 2013 International
Business Machines Corporation 5 New Architecture to Leverage All Data and Analytics Data in Mo)on Data at Rest Data in Many Forms Information Ingestion and Operational Information Decision Management BI and Predictive Analytics Navigation and Discovery Intelligence Analysis Landing Area, Analytics Zone and Archive § Raw Data § Structured Data § Text Analytics § Data Mining § Entity Analytics § Machine Learning Real-time Analytics § Video/Audio § Network/Sensor § Entity Analytics § Predictive Exploration, Integrated Warehouse, and Mart Zones § Discovery § Deep Reflection § Operational § Predictive § Stream Processing § Data Integration § Master Data Streams Information Governance, Security and Business Continuity • brings Hadoop to the Enterprise • enhances ease of use and consumability • takes the complexity out of getting started with Hadoop • users across the organization can build applications, and get insights at their fingertips without having to learn new skill sets InfoSphere BigInsights
6.
© 2013 International
Business Machines Corporation 6 Tools for Administrators 6 • Monitoring capabilities provide a centralized dashboard view to visualize key performance indicators including CPU, disk, and memory and network usage for the cluster, data services such as HDFS, HBase, Zookeeper and Flume, and application services including MapReduce, Hive, and Oozie • Status information and control over the major cluster capabilities • Advanced capabilities to control application permissions and deployment • Capability to view and control all applications from a single page
7.
© 2013 International
Business Machines Corporation 7 BigSheets to analyze and visualize • Model “big data” collected from various sources in spreadsheet-like structures • Filter and enrich content with built-in functions • Combine data in different workbooks • Visualize results through spreadsheets, charts • Export data into common formats (if desired) No programming knowledge needed!
8.
© 2013 International
Business Machines Corporation 8 8 A centralized dashboard to visualize analytic results: • BigSheets collections • Analytic application results • Monitoring metrics • Ability to view BigSheets data flows between and across data sets to quickly navigate and relate analysis and charts • Visualize inner outer joins, enhanced filters for BigSheets columns, column data-type mapping for collections and application of analytics to BigSheets columns, … etc Centralized dashboard & data flows
9.
© 2013 International
Business Machines Corporation 9 9 Editors • A workflow editor that greatly simplifies the creation of complex Oozie workflows with a consumable interface • A Pig/Jaql Editor with content assist and syntax highlighting that enables users to create and execute new applications using Pig or Jaql in local or cluster mode from the Eclipse IDE Application development & deployment • Enablement of BigSheets macro and BigSheets reader development • Text Analytics development, including support for modular rule sets • Publish new application: BigSheets Macro, BigSheets Reader, AQL module, Jaql module Tools for Developers 1. Sample your Data 2. Develop your application using BigInsights tools 3. Test your application 4. Package and publish your application 5. Deploy your application on the cluster
10.
© 2013 International
Business Machines Corporation 10 Running Applications on Big Data • Browse available applications • Deploy published applications (administrators only) • Launch (or schedule for launch) a deployed application • Monitor job (application) execution status • Predefined applications • Import & Export Data • Database & Files • Web and Social • Analyze and Query • Predictive Analytics • Text Analytics • SQL/Hive, Jaql, Pig, Hbase • Accelerators
11.
© 2013 International
Business Machines Corporation 11 Application linking and interfaces to build new apps 11 • Compose new applications from existing applications and BigSheets • Invoke analytics applications from the web console, including integration within BigSheets • REST data source App that enables users to load data from any data source supporting REST APIs into BigInsights, including popular social media services • Sampling App that enables users to sample data for analysis • Subsetting App that enables users to subset data for data analysis
12.
© 2013 International
Business Machines Corporation 12 Collaborative Big Data for many roles • Business Users can get their hands on big data and use big data applications and BigSheets to get insights into their data § Data scientists can perform deeper analysis and get richer insights § Administrators are empowered to be more agile through better controls and views into key performance indicators § Developers can leverage unified tooling in a Big Data Application Development Lifecycle and are able to create and deploy new types of applications, with enhancements that simplify even complex workflows
13.
© 2013 International
Business Machines Corporation 13 Build-in accelerators • Software components that accelerate development and/or implementation of specific solutions or use cases on top of the Big Data platform • Provide business logic, data processing, and UI/visualization, tailored for a given use case • Bundled with Big Data platform components – InfoSphere BigInsights and InfoSphere Streams • Key Benefits – Time to value – Leverage best practices around implementation of a given use case. • Analytical Accelerators – Text analytics – Geospatial analytics – Machine learning – Time series – Data mining • Application Accelerators – Machine Data Analytics – operational data including logs for operations efficiency – Social Data Analytics – sentiment analytics, Intent to purchase – Telecommunications – CDR streaming analytics deep customer event analytics – Finance Analysis – streaming options, trading, Insurance and banking DW models
14.
© 2013 International
Business Machines Corporation 14 Machine Data Analytics Accelerator What does it do? § Provides the ability to ingest, parse and extract a wide variety of machine data – Faceted search enables easy navigation and discovery – Visualization enables easy analysis of the data Machine Data Analytics Example Application: Facilities Management • Use real time data from building devices such as meters, sensors and motion detectors to monitor and manage power usage Why should you care? § It enables clients to gain insights into operations, customer experience, transactions and behavior, processing machine data in minutes instead of days and weeks § With these insights, clients can: – Proactively plan to increase operational efficiency – Troubleshoot problems and investigate security incidents – Monitor end-to-end infrastructure to avoid service degradation or outages
15.
© 2013 International
Business Machines Corporation 15 Machine Data Analytics Accelerator High-Level Workflow © 2013 IBM Corporation
16.
© 2013 International
Business Machines Corporation 16 Use the Machine Data Analytics Accelerator by starting the predefined applications
17.
© 2013 International
Business Machines Corporation 17 © 2013 IBM Corporation View results of MDA in web, BigSheets and dashboard
18.
© 2013 International
Business Machines Corporation 18 BigInsights Enterprise Edition Connectivity and Integration Streams Netezza Text processing engine and library JDBC Flume Infrastructure Jaql Hive Pig HBase MapReduce HDFS ZooKeeper Indexing Lucene Adaptive MapReduce Oozie Text compression Enhanced security Flexible scheduler Optional IBM and partner offerings Analytics and discovery “Apps” DB2 BigSheets Web Crawler Distrib file copy DB export Boardreader DB import Ad hoc query Machine learning Data processing . . . Administrative and development tools Web console • Monitor cluster health, jobs, etc. • Add / remove nodes • Start / stop services • Inspect job status • Inspect workflow status • Deploy applications • Launch apps / jobs • Work with distrib file system • Work with spreadsheet Interface • Support REST-based API • . . . R Eclipse tools • Text analytics • MapReduce programming • Jaql, Hive, Pig development • BigSheets plug-in development • Oozie workflow generation Integrated installer Open Source IBMIBM Cognos BI GPFS (EAP) Accelerator for machine data analysis Accelerator for social data analysis Guardium DataStageData Explorer Sqoop HCatalog
19.
© 2013 International
Business Machines Corporation 19 BigInsights: Value Beyond Open Source Enterprise Capabilities Administration & Security Workload Optimization Connectors Open source components Advanced Engines Visualization & Exploration Development Tools IBM-certified Apache Hadoop or or … Key differentiators • Built-in analytics • Enterprise software integration • Spreadsheet-style analysis • Integrated installation of supported open source and other components • Web Console for admin and application access • Platform enrichment: additional security, performance features, . . . • World-class support • Full open source compatibility Business benefits • Quicker time-to-value due to IBM technology and support • Reduced operational risk • Enhanced business knowledge with flexible analytical platform • Leverages and complements existing software
20.
© 2013 International
Business Machines Corporation 20 If this were easy, everyone would already be leveraging big data “Big Data offers big business gains but hidden costs and complexity present barriers that most organizations will struggle with” - The Cost of Big Data, Eric Savitz, Forbes 5/2012 § Open source Apache Hadoop for enterprise usage is incomplete § Hadoop skills are in short supply § Custom built solutions lack integrated cluster management § Requires integration effort within the existing analytic ecosystem § Most integrated solutions do not help with archival
21.
© 2013 International
Business Machines Corporation 21 Simplifying Big Data for the Enterprise The new PureData System for Hadoop § Accelerate time to value § Accelerate time to insight § Simplify big data adoption and consumption § Extend the value of the data warehouse § Implement enterprise class big data § Minimize system setup and administration § Available in 2H2013 System for Hadoop
22.
© 2013 International
Business Machines Corporation 22 Accelerate Big Data Time to Value Simplify Big Data Adoption & Consumption Implement Enterprise Class Big Data 1 Based on IBM internal testing and customer feedback. "Custom built clusters" refer to clusters that are not professionally pre- built, pre-tested and optimized. Individual results may vary. 2 Based on current commercially available Big Data appliance product data sheets from large vendors. US ONLY CLAIM. Built-in Expertise Simplified Experience Integration by Design Benefits of IBM PureData System for Hadoop § Deploy 8x faster than custom-built solutions1 § Built-in visualization to accelerate insight § Built-in analytic accelerators2 unlike big data appliances on the market § Single system console for full system administration § Rapid maintenance updates with automation § No assembly required, data load ready in hours § Only integrated Hadoop system with built-in archiving tools2 § Delivered with more robust security than open source software § Architected for high availability
23.
© 2013 International
Business Machines Corporation 23 SQL Access for Hadoop: Why? • Data warehouse augmentation is a leading Hadoop use case • MapReduce is difficult – MapReduce Java API is tedious and requires programming expertise – Unfamiliar languages (ie. Pig) also require special skills • SQL support would open the data to a much wider audience – Familiar, widely known syntax – Common catalog for identifying data and structure – Declarative – clear separation of the what (the data you’re after) vs. the how (processing) Pre-Processing Hub Query-able Archive Exploratory Analysis Information Integration Data Warehouse Streams Real-time processing BigInsights Landing zone for all data Data Warehouse BigInsights Can combine with unstructured information Data Warehouse 1 2 3
24.
© 2013 International
Business Machines Corporation 24 SQL for Hadoop: What’s the Problem? • SQL Access to data in Hadoop is challenging – Data is in many formats • CSV, JSON, Hive RCFile, HBase, ... • Some formats (HBase composite keys) don’t map cleanly to relational models – No schemas or statistics – Hadoop was not designed to be a query engine • Hive (with HiveQL): limited query access for Hadoop – SQL-like, but NOT SQL • Limited data types – no varchar(n), decimal(p,s), etc… • Limited join support • No subqueries • No windowed aggregates – Very limited JDBC/ODBC driver – Everything executes in MapReduce • Even very small queries requiring little processing
25.
© 2013 International
Business Machines Corporation 25 Big SQL: Native SQL Query Access for Hadoop • Native SQL access to data stored in BigInsights – ANSI SQL 92+ – Standard syntax support (joins, data types, …) • Real JDBC/ODBC drivers – Prepared statements – Cancel support – Database metadata API support – Secure socket connections (SSL) • Optimization – Leveraging MapReduce parallelism or… – Direct access for low-latency queries • Varied data sources – HBase (including secondary indexes) – CSV, Delimited files, Sequence files – JSON – Hive tables Big SQL Engine BigInsights Data Sources SQL Hive Tables HBase tables CSV Files Application JDBC / ODBC Server JDBC / ODBC Driver
26.
© 2013 International
Business Machines Corporation 26 From Getting Starting to Enterprise Deployment InfoSphere BigInsights Brings Hadoop to the Enterprise Basic Edition Enterprise Edition - Accelerators - Performance Optimization - Visualization Capabilities - Pre-built applications - Text analytics - Spreadsheet-style tool - RDBMS, warehouse connectivity - Administrative tools, security - Eclipse development tools - Enterprise Integration . . . . - Web-based mgmt console - Jaql - Integrated install Breadth of capabilities Enterpriseclass Free download Sold by # of terabytes managed Apache Hadoop PureData for Hadoop - Appliance simplicity for the enterprise
27.
© 2013 International
Business Machines Corporation 27 Where to start with BigInsights? • Learn it at BigDataUniversity.com • Try it on Smart Cloud Enterprise: ibm.biz/Bdx8FF • Read about it in “Harness the Power of Big Data” at ibm.biz/Bdx8RP • Learn about Big Data at www.ibmbigdatahub.com • Register for “Big Data at the speed of business” event on April 30th at ibm.co/bigdataevent • Try BigSQL: bigsql.imdemocloud.com • YouTube Videos - Big Data Channel: youtube.com/user/ibmbigdata
28.
© 2013 International
Business Machines Corporation 28 IBM’s statements regarding its plans, directions, and intent are subject to change or withdrawal without notice at IBM’s sole discretion. Information regarding potential future products is intended to outline our general product direction and it should not be relied on in making a purchasing decision. The information mentioned regarding potential future products is not a commitment, promise, or legal obligation to deliver any material, code or functionality. Information about potential future products may not be incorporated into any contract. The development, release, and timing of any future features or functionality described for our products remains at our sole discretion. Performance is based on measurements and projections using standard IBM benchmarks in a controlled environment. The actual throughput or performance that any user will experience will vary depending upon many factors, including considerations such as the amount of multiprogramming in the user’s job stream, the I/O configuration, the storage configuration, and the workload processed. Therefore, no assurance can be given that an individual user will achieve results similar to those stated here. Please Note
Jetzt herunterladen