SlideShare ist ein Scribd-Unternehmen logo
1 von 28
© 2013 Gartner, Inc. and/or its affiliates. All rights reserved. Gartner is a registered trademark of Gartner, Inc. or its affiliates. This publication may not be reproduced or distributed in any form without Gartner's prior written
permission. If you are authorized to access this publication, your use of it is subject to the Usage Guidelines for Gartner Services posted on gartner.com. The information contained in this publication has been obtained
from sources believed to be reliable. Gartner disclaims all warranties as to the accuracy, completeness or adequacy of such information and shall have no liability for errors, omissions or inadequacies in such information.
This publication consists of the opinions of Gartner's research organization and should not be construed as statements of fact. The opinions expressed herein are subject to change without notice. Although Gartner
research may include a discussion of related legal issues, Gartner does not provide legal advice or services and its research should not be construed or used as such. Gartner is a public company, and its shareholders
may include firms and funds that have financial interests in entities covered in Gartner research. Gartner's Board of Directors may include senior managers of these firms or funds. Gartner research is produced
independently by its research organization without input or influence from these firms, funds or their managers. For further information on the independence and integrity of Gartner research, see "Guiding Principles on
Independence and Objectivity."
Merv Adrian
Research Vice President, Information Management
Twitter: @merv
Blogs.gartner.com/merv-adrian
Hadoop — Entering Phase Two?
© 2013 Gartner, Inc. and/or its affiliates. All rights reserved.
NEXUS
Nexus of Forces Drives Innovation
Extreme
Networking
Pervasive
Access
Global-Class
Delivery
"Big," Rich
Context
© 2013 Gartner, Inc. and/or its affiliates. All rights reserved.
Cameras and
microphones widely
deployed
New routes to market via
intelligent objects
Content and services
via connected
products
Everything
has a URL
Remote sensing of
objects and environment
Augmented
reality
Situational
decision support
Building and
infrastructure management
Over 50% of Internet connections are things:
2011: 15+ billion permanent, 50+ billion intermittent
2020: 30+ billion permanent, >200 billion intermittent
Audio
GPRS Wi-Fi NFC
Higher-resolution display
LTE
Flash
© 2013 Gartner, Inc. and/or its affiliates. All rights reserved.
Gartner Definition of Big Data: High-volume, velocity and variety
information assets that demand cost-effective, innovative forms of
information processing for enhanced insight and decision making.
Gartner Research Circle 2013 Big Data Survey
687Respondents
Worldwide
$3.2BMean
Company Size
5,100
Mean
Employees
60%Mainstream
Adopters
18%Focused on
Running/Maintaining
© 2013 Gartner, Inc. and/or its affiliates. All rights reserved.
Are They Investing?
30%
Have
31%
No plans
at this
time
19%
Plan to within
the next
year
15%
Plan to
within two
years
5%
Don't
know
© 2013 Gartner, Inc. and/or its affiliates. All rights reserved.
How Does That Compare to Last Year?
Note — Survey base increased from 473 in 2012 to 687 in 2013
27
15
16
11
30
19
15
31
5
Have invested
Within next year
Within two years
No plans
Don't know
20132012
0 10 20 30 40
© 2013 Gartner, Inc. and/or its affiliates. All rights reserved.
Things Are Done Differently in Silicon Valley …
Traditional IM
• Requirements based
• Top-down design
• Integration and reuse
• Technology consolidation
• World of DW and ECM
• Competence centers
• Better decisions
• Commercial software
"Big Data" Style
• Opportunity oriented
• Bottom-up experimentation
• Immediate use
• Tool proliferation
• "World of Hadoop"
• Hackathons
• Better business
• Open source
© 2013 Gartner, Inc. and/or its affiliates. All rights reserved.
Introducing: The Open-Source Car!
© 2013 Gartner, Inc. and/or its affiliates. All rights reserved.
Apache Hadoop is a set of standard open-source software projects
that provide a framework for using massive amounts of data across
a distributed network
The standards steward — Apache Software Foundation — manages
and distributes many typical components of "Hadoop" platform
Many distributions exist —
Built and/or marketed by pure-play specialists or major vendors and they
include additional open-source and commercial components
© 2013 Gartner, Inc. and/or its affiliates. All rights reserved.
Apache Hadoop is a set of standard open source software projects
that provide a framework for using massive amounts of data across
a distributed network
The standards steward — Apache Software Foundation — manages
and distributes many typical components of "Hadoop" platform
Many distributions exist —
Built and/or marketed by pure play specialists or major vendors and they
include additional open source and commercial components
© 2013 Gartner, Inc. and/or its affiliates. All rights reserved.
Clients Ask: Which Projects Are "Hadoop"?
• Minimum set (from Apache website):
- Apache HDFS
- Apache MapReduce
- Apache Yarn
• Other independent Apache projects:
Ambari, Avro, Cassandra, Chukwa, HBase, Hive, Mahout,
Pig, ZooKeeper
- The virtuous circle of open-source community
• Apache Hadoop is version 1.0. Version 2.0,
including Yarn, is alpha.
© 2013 Gartner, Inc. and/or its affiliates. All rights reserved.
Rich, Complex Set of Functional Choices
Ingest/Propagate
Persist
Describe, Develop
Monitor, Administer
Analytics, Machine Learning
Compute, Search
© 2013 Gartner, Inc. and/or its affiliates. All rights reserved.
Ingest/Propagate
Apache Flume, Apache Kafka, Apache Sqoop, HDFS NFS,
Informatica HParser, DBMS vendor utilities, Talend, WebHDFS
Import data into HDFS
(or alternatives)
• Commercial DBMS, DI or OSS
• "Big data" ≠ Hadoop —
import is not always required
− MapReduce inside DBMSs, HPCC,
SAS, Splunk, others
Export data into RDBMS
(or alternatives)
• NoSQL DBMS supported, or
offer integration
• On same cluster (HBase),
even same nodes (Hadapt)
© 2013 Gartner, Inc. and/or its affiliates. All rights reserved.
Also included here: "intercept-based" data remediation
Develop refers to coding functions, as in Pig, for execution elsewhere,
such as MapReduce
Metadata (Hive, Hcatalog) describes for other stack components
and external ones; e.g., DI and BI tools
Describe, Develop
Apache Crunch, Apache Hive, Apache Pig, Apache Tika, Cascading,
Cloudera Hue, DataFu, Dataguise, IBM Jaql
© 2013 Gartner, Inc. and/or its affiliates. All rights reserved.
Runtime execution for programs created to run against HDFS
or HBase data
With Apache Hadoop 2.0, MapReduce will begin to lose its exclusivity
in "the basic stack" with Yarn support
MapReduce was first, but others have emerged as additions/
alternatives/supplements
Compute, Search
Apache Blur, Apache Drill, Apache Giraph, Apache Hama, Apache Lucene, Apache MapReduce,
Apache Solr, Cloudera Impala, HP HAVEn, IBM BigSQL, IBM InfoSphere Streams, HStreaming,
Pivotal HAWQ, SQLstream, Storm, Teradata SQL-H
© 2013 Gartner, Inc. and/or its affiliates. All rights reserved.
File system: Append only, access methods at OS level
Database: Collected and structured to facilitate storage, retrieval, modification,
and deletion in online, not only batch, mode
Serialized: Format that can be stored in a database, eliminating
byte ordering, adding metadata
Persist
File System: Apache HDFS, IBM GPFS, Lustre, MapR Data Platform
Serialization: Apache Avro, RCFile (and ORCFile), SequenceFile, Text, Trevni
DBMS: Apache Accumulo, Apache Cassandra, Apache HBase, Google Dremel, Hadapt,
HP Vertica, IBM DB2, Kognitio, Oracle, Oracle MySQL, RainStor, Teradata Aster, Teradata, others
© 2013 Gartner, Inc. and/or its affiliates. All rights reserved.
System health and administration
Cloud configuration and connection to resources
Virtualization and resource management
Job management and orchestration
Monitor, Administer
Apache Ambari, Apache Chukwa, Apache Falcon, Apache Oozie, Apache Whirr,
Apache ZooKeeper, Cloudera Manager, Ganglia, Nagios, Pivotal Serengeti
© 2013 Gartner, Inc. and/or its affiliates. All rights reserved.
Analytics, Machine Learning
Apache Drill, Apache Hive, Apache Mahout, Datameer, IBM Big Sheets, IBM BigSQL,
Karmasphere, Microsoft Excel, Platfora, Revolution Analytics RHadoop, SAS, Skytree
This is where the future is — it's not just "a part of the stack" but why it exists
Machine learning, advanced statistical analysis, scenario modeling
"BI for Hadoop": Statistical libraries for use in programs, spreadsheets,
reporting, visualization tools
© 2013 Gartner, Inc. and/or its affiliates. All rights reserved.
Go Ahead — Pick the Pieces You Need
Ingest/Propagate
Persist
Describe, Develop
Monitor, Administer
Analytics, Machine Learning
Compute, Search
© 2013 Gartner, Inc. and/or its affiliates. All rights reserved.
Distribution Vendors Sort It Out for You
Megavendors:
Amazon, EMC
Pivotal, IBM, Intel
Megapartners:
Dell, HP, NetApp,
Microsoft, Oracle,
Teradata
Leading pure plays:
Cloudera, Hortonworks, MapR
Others:
Datastax, LucidWorks, RainStor, Sqrrl,
WANdisco, Zettaset
© 2013 Gartner, Inc. and/or its affiliates. All rights reserved.
Hadoop's Great Leap Forward
Hadoop has moved to the next stage with Apache Hadoop 2.0.
• Mainstream vendors are all interested, contributing and adding value
• Skills development is ramping rapidly
From To
Single-stack Yarn-based multistyle environment, supporting
multiple engines
Batch-only, file-based stack Interactive capabilities with multiple optional databases
SQL translation
with Hive
"SQL in front of Hadoop": Cloudera Impala, IBM Big
SQL, Pivotal Hawq, Platfora, others
Relatively unmanaged Ambari-based beginnings of real management
© 2013 Gartner, Inc. and/or its affiliates. All rights reserved.
What's Next?
Search
Advanced
prebuilt
analytic
functions
Cluster,
appliance
or cloud?
Virtualization
Graph
processing
© 2013 Gartner, Inc. and/or its affiliates. All rights reserved.
What's Still Needed?
Security
Data Warehousing Tools
Governance
Distributed Optimization
Subproject Optimization Skills
© 2013 Gartner, Inc. and/or its affiliates. All rights reserved.
By 2015, big data demand will reach
4.4 million jobs worldwide,
but only one-third of those jobs will be filled.
0
500,000
1,000,000
1,500,000
2,000,000
2,500,000
Americas EMEA APJ
Education
Wholesale Trade
Healthcare Providers
Transportation
Utilities
Retail
Insurance
Communications, Media & Services
Government
Banking & Securities
Manufacturing & Natural Resources
© 2013 Gartner, Inc. and/or its affiliates. All rights reserved.
Recommendations
 Audit your data — find "dark data" and map it to business
opportunities to identify pilot projects
 Familiarize yourself with the capabilities of available
Hadoop distributions
 Build skills and recruit within the organization from early
experimenters for a data science lab
 Consider cloud pilots to minimize capital expenditure
© 2013 Gartner, Inc. and/or its affiliates. All rights reserved.
Thank you!
http://www.flickr.com/photos/orinrobertjohn/3267286885/sizes/o/in/photostream/
Hadoop Turns a Corner and Sees the Future

Weitere ähnliche Inhalte

Was ist angesagt?

Intro to Machine Learning with H2O and AWS
Intro to Machine Learning with H2O and AWSIntro to Machine Learning with H2O and AWS
Intro to Machine Learning with H2O and AWSSri Ambati
 
Hadoop - An Introduction
Hadoop - An IntroductionHadoop - An Introduction
Hadoop - An IntroductionShankar R
 
Future of Data - Big Data
Future of Data - Big DataFuture of Data - Big Data
Future of Data - Big DataShankar R
 
Learn Big Data & Hadoop
Learn Big Data & Hadoop Learn Big Data & Hadoop
Learn Big Data & Hadoop Edureka!
 
Data infrastructure and Hadoop at LinkedIn
Data infrastructure and Hadoop at LinkedInData infrastructure and Hadoop at LinkedIn
Data infrastructure and Hadoop at LinkedInHari Shankar Sreekumar
 
Introduction To Big Data Analytics On Hadoop - SpringPeople
Introduction To Big Data Analytics On Hadoop - SpringPeopleIntroduction To Big Data Analytics On Hadoop - SpringPeople
Introduction To Big Data Analytics On Hadoop - SpringPeopleSpringPeople
 
Big Data Analytics(Intro,Hadoop Map Reduce,Mahout,K-means clustering,H-base)
Big Data Analytics(Intro,Hadoop Map Reduce,Mahout,K-means clustering,H-base)Big Data Analytics(Intro,Hadoop Map Reduce,Mahout,K-means clustering,H-base)
Big Data Analytics(Intro,Hadoop Map Reduce,Mahout,K-means clustering,H-base)MIT College Of Engineering,Pune
 
Big data, map reduce and beyond
Big data, map reduce and beyondBig data, map reduce and beyond
Big data, map reduce and beyonddatasalt
 
Machine Learning Hadoop
Machine Learning HadoopMachine Learning Hadoop
Machine Learning HadoopAletheLabs
 
Why hadoop for data science?
Why hadoop for data science?Why hadoop for data science?
Why hadoop for data science?Hortonworks
 
Big Data: An Overview
Big Data: An OverviewBig Data: An Overview
Big Data: An OverviewC. Scyphers
 
Whatisbigdataandwhylearnhadoop
WhatisbigdataandwhylearnhadoopWhatisbigdataandwhylearnhadoop
WhatisbigdataandwhylearnhadoopEdureka!
 
Big Data Analytics 2014
Big Data Analytics 2014Big Data Analytics 2014
Big Data Analytics 2014Stratebi
 
DW Appliance
DW ApplianceDW Appliance
DW ApplianceShankar R
 
Apache Hadoop India Summit 2011 talk "Data Infrastructure on Hadoop" by Venka...
Apache Hadoop India Summit 2011 talk "Data Infrastructure on Hadoop" by Venka...Apache Hadoop India Summit 2011 talk "Data Infrastructure on Hadoop" by Venka...
Apache Hadoop India Summit 2011 talk "Data Infrastructure on Hadoop" by Venka...Yahoo Developer Network
 
Hadoop,Big Data Analytics and More
Hadoop,Big Data Analytics and MoreHadoop,Big Data Analytics and More
Hadoop,Big Data Analytics and MoreTrendwise Analytics
 
Introduction to Big data with Hadoop & Spark | Big Data Hadoop Spark Tutorial...
Introduction to Big data with Hadoop & Spark | Big Data Hadoop Spark Tutorial...Introduction to Big data with Hadoop & Spark | Big Data Hadoop Spark Tutorial...
Introduction to Big data with Hadoop & Spark | Big Data Hadoop Spark Tutorial...CloudxLab
 

Was ist angesagt? (20)

Intro to Machine Learning with H2O and AWS
Intro to Machine Learning with H2O and AWSIntro to Machine Learning with H2O and AWS
Intro to Machine Learning with H2O and AWS
 
Bigdata
BigdataBigdata
Bigdata
 
Hadoop - An Introduction
Hadoop - An IntroductionHadoop - An Introduction
Hadoop - An Introduction
 
Future of Data - Big Data
Future of Data - Big DataFuture of Data - Big Data
Future of Data - Big Data
 
Learn Big Data & Hadoop
Learn Big Data & Hadoop Learn Big Data & Hadoop
Learn Big Data & Hadoop
 
Data infrastructure and Hadoop at LinkedIn
Data infrastructure and Hadoop at LinkedInData infrastructure and Hadoop at LinkedIn
Data infrastructure and Hadoop at LinkedIn
 
Introduction To Big Data Analytics On Hadoop - SpringPeople
Introduction To Big Data Analytics On Hadoop - SpringPeopleIntroduction To Big Data Analytics On Hadoop - SpringPeople
Introduction To Big Data Analytics On Hadoop - SpringPeople
 
Big Data Analytics(Intro,Hadoop Map Reduce,Mahout,K-means clustering,H-base)
Big Data Analytics(Intro,Hadoop Map Reduce,Mahout,K-means clustering,H-base)Big Data Analytics(Intro,Hadoop Map Reduce,Mahout,K-means clustering,H-base)
Big Data Analytics(Intro,Hadoop Map Reduce,Mahout,K-means clustering,H-base)
 
Big data, map reduce and beyond
Big data, map reduce and beyondBig data, map reduce and beyond
Big data, map reduce and beyond
 
BigData
BigDataBigData
BigData
 
Machine Learning Hadoop
Machine Learning HadoopMachine Learning Hadoop
Machine Learning Hadoop
 
Why hadoop for data science?
Why hadoop for data science?Why hadoop for data science?
Why hadoop for data science?
 
Big Data: An Overview
Big Data: An OverviewBig Data: An Overview
Big Data: An Overview
 
Whatisbigdataandwhylearnhadoop
WhatisbigdataandwhylearnhadoopWhatisbigdataandwhylearnhadoop
Whatisbigdataandwhylearnhadoop
 
Big Data Analytics 2014
Big Data Analytics 2014Big Data Analytics 2014
Big Data Analytics 2014
 
DW Appliance
DW ApplianceDW Appliance
DW Appliance
 
Apache Hadoop India Summit 2011 talk "Data Infrastructure on Hadoop" by Venka...
Apache Hadoop India Summit 2011 talk "Data Infrastructure on Hadoop" by Venka...Apache Hadoop India Summit 2011 talk "Data Infrastructure on Hadoop" by Venka...
Apache Hadoop India Summit 2011 talk "Data Infrastructure on Hadoop" by Venka...
 
Hadoop,Big Data Analytics and More
Hadoop,Big Data Analytics and MoreHadoop,Big Data Analytics and More
Hadoop,Big Data Analytics and More
 
Introduction to Big data with Hadoop & Spark | Big Data Hadoop Spark Tutorial...
Introduction to Big data with Hadoop & Spark | Big Data Hadoop Spark Tutorial...Introduction to Big data with Hadoop & Spark | Big Data Hadoop Spark Tutorial...
Introduction to Big data with Hadoop & Spark | Big Data Hadoop Spark Tutorial...
 
Big Data, Baby Steps
Big Data, Baby StepsBig Data, Baby Steps
Big Data, Baby Steps
 

Andere mochten auch

Jubatus talk at HadoopSummit 2013
Jubatus talk at HadoopSummit 2013Jubatus talk at HadoopSummit 2013
Jubatus talk at HadoopSummit 2013Preferred Networks
 
前回のCasual Talkでいただいたご要望に対する進捗状況
前回のCasual Talkでいただいたご要望に対する進捗状況前回のCasual Talkでいただいたご要望に対する進捗状況
前回のCasual Talkでいただいたご要望に対する進捗状況JubatusOfficial
 
Jubatusハンズオン分散編
Jubatusハンズオン分散編Jubatusハンズオン分散編
Jubatusハンズオン分散編odasatoshi
 
機械学習チュートリアル@Jubatus Casual Talks
機械学習チュートリアル@Jubatus Casual Talks機械学習チュートリアル@Jubatus Casual Talks
機械学習チュートリアル@Jubatus Casual TalksYuya Unno
 
Jubatusをベースにしたオーディエンスの分析エンジンの紹介
Jubatusをベースにしたオーディエンスの分析エンジンの紹介Jubatusをベースにしたオーディエンスの分析エンジンの紹介
Jubatusをベースにしたオーディエンスの分析エンジンの紹介JubatusOfficial
 
評BanにおけるJubatus活用事例
評BanにおけるJubatus活用事例評BanにおけるJubatus活用事例
評BanにおけるJubatus活用事例JubatusOfficial
 
標的型メール対策製品でのJubatus活用事例
標的型メール対策製品でのJubatus活用事例標的型メール対策製品でのJubatus活用事例
標的型メール対策製品でのJubatus活用事例JubatusOfficial
 
Jubatus 0.6.0 新機能紹介
Jubatus 0.6.0 新機能紹介Jubatus 0.6.0 新機能紹介
Jubatus 0.6.0 新機能紹介JubatusOfficial
 
Jubatus Casual Talks #2: 大量映像・画像のための異常値検知とクラス分類
Jubatus Casual Talks #2: 大量映像・画像のための異常値検知とクラス分類Jubatus Casual Talks #2: 大量映像・画像のための異常値検知とクラス分類
Jubatus Casual Talks #2: 大量映像・画像のための異常値検知とクラス分類Hirotaka Ogawa
 
Jubatusで始める機械学習
Jubatusで始める機械学習Jubatusで始める機械学習
Jubatusで始める機械学習JubatusOfficial
 
世界征服を目指すJubatusだからこそ期待する5つのポイント
世界征服を目指すJubatusだからこそ期待する5つのポイント世界征服を目指すJubatusだからこそ期待する5つのポイント
世界征服を目指すJubatusだからこそ期待する5つのポイントNTT DATA OSS Professional Services
 
Jubatus Casual Talks #2 Jubatus開発者入門
Jubatus Casual Talks #2 Jubatus開発者入門Jubatus Casual Talks #2 Jubatus開発者入門
Jubatus Casual Talks #2 Jubatus開発者入門Shuzo Kashihara
 
Jubatus Casual Talks #2 : 0.5.0の新機能(クラスタリング)の紹介
Jubatus Casual Talks #2 : 0.5.0の新機能(クラスタリング)の紹介Jubatus Casual Talks #2 : 0.5.0の新機能(クラスタリング)の紹介
Jubatus Casual Talks #2 : 0.5.0の新機能(クラスタリング)の紹介瑛 村下
 
センサデータ解析におけるJubatus活用事例
センサデータ解析におけるJubatus活用事例センサデータ解析におけるJubatus活用事例
センサデータ解析におけるJubatus活用事例JubatusOfficial
 
Jubatus分類器の活用テクニック
Jubatus分類器の活用テクニックJubatus分類器の活用テクニック
Jubatus分類器の活用テクニックJubatusOfficial
 
Jubatus使ってみた 作ってみたJubatus
Jubatus使ってみた 作ってみたJubatusJubatus使ってみた 作ってみたJubatus
Jubatus使ってみた 作ってみたJubatusJubatusOfficial
 

Andere mochten auch (20)

Jubatus talk at HadoopSummit 2013
Jubatus talk at HadoopSummit 2013Jubatus talk at HadoopSummit 2013
Jubatus talk at HadoopSummit 2013
 
前回のCasual Talkでいただいたご要望に対する進捗状況
前回のCasual Talkでいただいたご要望に対する進捗状況前回のCasual Talkでいただいたご要望に対する進捗状況
前回のCasual Talkでいただいたご要望に対する進捗状況
 
Jubatusハンズオン分散編
Jubatusハンズオン分散編Jubatusハンズオン分散編
Jubatusハンズオン分散編
 
Video Analysis in Hadoop
Video Analysis in HadoopVideo Analysis in Hadoop
Video Analysis in Hadoop
 
機械学習チュートリアル@Jubatus Casual Talks
機械学習チュートリアル@Jubatus Casual Talks機械学習チュートリアル@Jubatus Casual Talks
機械学習チュートリアル@Jubatus Casual Talks
 
Jubatusをベースにしたオーディエンスの分析エンジンの紹介
Jubatusをベースにしたオーディエンスの分析エンジンの紹介Jubatusをベースにしたオーディエンスの分析エンジンの紹介
Jubatusをベースにしたオーディエンスの分析エンジンの紹介
 
評BanにおけるJubatus活用事例
評BanにおけるJubatus活用事例評BanにおけるJubatus活用事例
評BanにおけるJubatus活用事例
 
Jubatus on Mavericks
Jubatus on MavericksJubatus on Mavericks
Jubatus on Mavericks
 
標的型メール対策製品でのJubatus活用事例
標的型メール対策製品でのJubatus活用事例標的型メール対策製品でのJubatus活用事例
標的型メール対策製品でのJubatus活用事例
 
Jubatus 0.6.0 新機能紹介
Jubatus 0.6.0 新機能紹介Jubatus 0.6.0 新機能紹介
Jubatus 0.6.0 新機能紹介
 
Jubatus Casual Talks #2: 大量映像・画像のための異常値検知とクラス分類
Jubatus Casual Talks #2: 大量映像・画像のための異常値検知とクラス分類Jubatus Casual Talks #2: 大量映像・画像のための異常値検知とクラス分類
Jubatus Casual Talks #2: 大量映像・画像のための異常値検知とクラス分類
 
Jubatusで始める機械学習
Jubatusで始める機械学習Jubatusで始める機械学習
Jubatusで始める機械学習
 
世界征服を目指すJubatusだからこそ期待する5つのポイント
世界征服を目指すJubatusだからこそ期待する5つのポイント世界征服を目指すJubatusだからこそ期待する5つのポイント
世界征服を目指すJubatusだからこそ期待する5つのポイント
 
Jubatus Casual Talks #2 Jubatus開発者入門
Jubatus Casual Talks #2 Jubatus開発者入門Jubatus Casual Talks #2 Jubatus開発者入門
Jubatus Casual Talks #2 Jubatus開発者入門
 
Jubatus Casual Talks #2 : 0.5.0の新機能(クラスタリング)の紹介
Jubatus Casual Talks #2 : 0.5.0の新機能(クラスタリング)の紹介Jubatus Casual Talks #2 : 0.5.0の新機能(クラスタリング)の紹介
Jubatus Casual Talks #2 : 0.5.0の新機能(クラスタリング)の紹介
 
センサデータ解析におけるJubatus活用事例
センサデータ解析におけるJubatus活用事例センサデータ解析におけるJubatus活用事例
センサデータ解析におけるJubatus活用事例
 
Jubatus分類器の活用テクニック
Jubatus分類器の活用テクニックJubatus分類器の活用テクニック
Jubatus分類器の活用テクニック
 
Jubatus casulatalks2
Jubatus casulatalks2Jubatus casulatalks2
Jubatus casulatalks2
 
A use case of online machine learning using Jubatus
A use case of online machine learning using JubatusA use case of online machine learning using Jubatus
A use case of online machine learning using Jubatus
 
Jubatus使ってみた 作ってみたJubatus
Jubatus使ってみた 作ってみたJubatusJubatus使ってみた 作ってみたJubatus
Jubatus使ってみた 作ってみたJubatus
 

Ähnlich wie Hadoop Turns a Corner and Sees the Future

Lowering the entry point to getting going with Hadoop and obtaining business ...
Lowering the entry point to getting going with Hadoop and obtaining business ...Lowering the entry point to getting going with Hadoop and obtaining business ...
Lowering the entry point to getting going with Hadoop and obtaining business ...DataWorks Summit
 
Hadoop Summit Keynote 2014
Hadoop Summit Keynote 2014Hadoop Summit Keynote 2014
Hadoop Summit Keynote 2014Merv Adrian
 
2015 HortonWorks MDA Roadshow Presentation
2015 HortonWorks MDA Roadshow Presentation2015 HortonWorks MDA Roadshow Presentation
2015 HortonWorks MDA Roadshow PresentationFelix Liao
 
Expand a Data warehouse with Hadoop and Big Data
Expand a Data warehouse with Hadoop and Big DataExpand a Data warehouse with Hadoop and Big Data
Expand a Data warehouse with Hadoop and Big Datajdijcks
 
Accelerate Your Big Data Analytics Efforts with SAS and Hadoop
Accelerate Your Big Data Analytics Efforts with SAS and HadoopAccelerate Your Big Data Analytics Efforts with SAS and Hadoop
Accelerate Your Big Data Analytics Efforts with SAS and HadoopDataWorks Summit
 
SAP HANA - Big Data and Fast Data
SAP HANA - Big Data and Fast DataSAP HANA - Big Data and Fast Data
SAP HANA - Big Data and Fast DataVitaliy Rudnytskiy
 
Hadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - JaspersoftHadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - JaspersoftHortonworks
 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsFredReynolds2
 
Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks Pactera_US
 
Transform You Business with Big Data and Hortonworks
Transform You Business with Big Data and HortonworksTransform You Business with Big Data and Hortonworks
Transform You Business with Big Data and HortonworksHortonworks
 
20150314 sahara intro and the future plan for open stack meetup
20150314 sahara intro and the future plan for open stack meetup20150314 sahara intro and the future plan for open stack meetup
20150314 sahara intro and the future plan for open stack meetupWei Ting Chen
 
Information Security Analytics
Information Security AnalyticsInformation Security Analytics
Information Security AnalyticsAmrit Chhetri
 
Impala Unlocks Interactive BI on Hadoop
Impala Unlocks Interactive BI on HadoopImpala Unlocks Interactive BI on Hadoop
Impala Unlocks Interactive BI on HadoopCloudera, Inc.
 
Making Big Data Easy for Everyone
Making Big Data Easy for EveryoneMaking Big Data Easy for Everyone
Making Big Data Easy for EveryoneCaserta
 
A new platform for a new era emc
A new platform for a new era   emcA new platform for a new era   emc
A new platform for a new era emcTaldor Group
 
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...TheInevitableCloud
 
Cw13 big data and apache hadoop by amr awadallah-cloudera
Cw13 big data and apache hadoop by amr awadallah-clouderaCw13 big data and apache hadoop by amr awadallah-cloudera
Cw13 big data and apache hadoop by amr awadallah-clouderainevitablecloud
 

Ähnlich wie Hadoop Turns a Corner and Sees the Future (20)

Lowering the entry point to getting going with Hadoop and obtaining business ...
Lowering the entry point to getting going with Hadoop and obtaining business ...Lowering the entry point to getting going with Hadoop and obtaining business ...
Lowering the entry point to getting going with Hadoop and obtaining business ...
 
Hadoop Summit Keynote 2014
Hadoop Summit Keynote 2014Hadoop Summit Keynote 2014
Hadoop Summit Keynote 2014
 
2015 HortonWorks MDA Roadshow Presentation
2015 HortonWorks MDA Roadshow Presentation2015 HortonWorks MDA Roadshow Presentation
2015 HortonWorks MDA Roadshow Presentation
 
Expand a Data warehouse with Hadoop and Big Data
Expand a Data warehouse with Hadoop and Big DataExpand a Data warehouse with Hadoop and Big Data
Expand a Data warehouse with Hadoop and Big Data
 
Accelerate Your Big Data Analytics Efforts with SAS and Hadoop
Accelerate Your Big Data Analytics Efforts with SAS and HadoopAccelerate Your Big Data Analytics Efforts with SAS and Hadoop
Accelerate Your Big Data Analytics Efforts with SAS and Hadoop
 
SAP HANA - Big Data and Fast Data
SAP HANA - Big Data and Fast DataSAP HANA - Big Data and Fast Data
SAP HANA - Big Data and Fast Data
 
View on big data technologies
View on big data technologiesView on big data technologies
View on big data technologies
 
Hadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - JaspersoftHadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - Jaspersoft
 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential Tools
 
Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks
 
Transform You Business with Big Data and Hortonworks
Transform You Business with Big Data and HortonworksTransform You Business with Big Data and Hortonworks
Transform You Business with Big Data and Hortonworks
 
20150314 sahara intro and the future plan for open stack meetup
20150314 sahara intro and the future plan for open stack meetup20150314 sahara intro and the future plan for open stack meetup
20150314 sahara intro and the future plan for open stack meetup
 
Information Security Analytics
Information Security AnalyticsInformation Security Analytics
Information Security Analytics
 
Impala Unlocks Interactive BI on Hadoop
Impala Unlocks Interactive BI on HadoopImpala Unlocks Interactive BI on Hadoop
Impala Unlocks Interactive BI on Hadoop
 
Making Big Data Easy for Everyone
Making Big Data Easy for EveryoneMaking Big Data Easy for Everyone
Making Big Data Easy for Everyone
 
Big Data
Big DataBig Data
Big Data
 
A new platform for a new era emc
A new platform for a new era   emcA new platform for a new era   emc
A new platform for a new era emc
 
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
 
Cw13 big data and apache hadoop by amr awadallah-cloudera
Cw13 big data and apache hadoop by amr awadallah-clouderaCw13 big data and apache hadoop by amr awadallah-cloudera
Cw13 big data and apache hadoop by amr awadallah-cloudera
 
Haven 2 0
Haven 2 0 Haven 2 0
Haven 2 0
 

Mehr von DataWorks Summit

Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisDataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiDataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...DataWorks Summit
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal SystemDataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExampleDataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberDataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixDataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiDataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsDataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureDataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EngineDataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudDataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiDataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerDataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouDataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkDataWorks Summit
 

Mehr von DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 

Kürzlich hochgeladen

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 

Kürzlich hochgeladen (20)

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 

Hadoop Turns a Corner and Sees the Future

  • 1.
  • 2. © 2013 Gartner, Inc. and/or its affiliates. All rights reserved. Gartner is a registered trademark of Gartner, Inc. or its affiliates. This publication may not be reproduced or distributed in any form without Gartner's prior written permission. If you are authorized to access this publication, your use of it is subject to the Usage Guidelines for Gartner Services posted on gartner.com. The information contained in this publication has been obtained from sources believed to be reliable. Gartner disclaims all warranties as to the accuracy, completeness or adequacy of such information and shall have no liability for errors, omissions or inadequacies in such information. This publication consists of the opinions of Gartner's research organization and should not be construed as statements of fact. The opinions expressed herein are subject to change without notice. Although Gartner research may include a discussion of related legal issues, Gartner does not provide legal advice or services and its research should not be construed or used as such. Gartner is a public company, and its shareholders may include firms and funds that have financial interests in entities covered in Gartner research. Gartner's Board of Directors may include senior managers of these firms or funds. Gartner research is produced independently by its research organization without input or influence from these firms, funds or their managers. For further information on the independence and integrity of Gartner research, see "Guiding Principles on Independence and Objectivity." Merv Adrian Research Vice President, Information Management Twitter: @merv Blogs.gartner.com/merv-adrian Hadoop — Entering Phase Two?
  • 3. © 2013 Gartner, Inc. and/or its affiliates. All rights reserved. NEXUS Nexus of Forces Drives Innovation Extreme Networking Pervasive Access Global-Class Delivery "Big," Rich Context
  • 4. © 2013 Gartner, Inc. and/or its affiliates. All rights reserved. Cameras and microphones widely deployed New routes to market via intelligent objects Content and services via connected products Everything has a URL Remote sensing of objects and environment Augmented reality Situational decision support Building and infrastructure management Over 50% of Internet connections are things: 2011: 15+ billion permanent, 50+ billion intermittent 2020: 30+ billion permanent, >200 billion intermittent Audio GPRS Wi-Fi NFC Higher-resolution display LTE Flash
  • 5. © 2013 Gartner, Inc. and/or its affiliates. All rights reserved. Gartner Definition of Big Data: High-volume, velocity and variety information assets that demand cost-effective, innovative forms of information processing for enhanced insight and decision making. Gartner Research Circle 2013 Big Data Survey 687Respondents Worldwide $3.2BMean Company Size 5,100 Mean Employees 60%Mainstream Adopters 18%Focused on Running/Maintaining
  • 6. © 2013 Gartner, Inc. and/or its affiliates. All rights reserved. Are They Investing? 30% Have 31% No plans at this time 19% Plan to within the next year 15% Plan to within two years 5% Don't know
  • 7. © 2013 Gartner, Inc. and/or its affiliates. All rights reserved. How Does That Compare to Last Year? Note — Survey base increased from 473 in 2012 to 687 in 2013 27 15 16 11 30 19 15 31 5 Have invested Within next year Within two years No plans Don't know 20132012 0 10 20 30 40
  • 8. © 2013 Gartner, Inc. and/or its affiliates. All rights reserved. Things Are Done Differently in Silicon Valley … Traditional IM • Requirements based • Top-down design • Integration and reuse • Technology consolidation • World of DW and ECM • Competence centers • Better decisions • Commercial software "Big Data" Style • Opportunity oriented • Bottom-up experimentation • Immediate use • Tool proliferation • "World of Hadoop" • Hackathons • Better business • Open source
  • 9. © 2013 Gartner, Inc. and/or its affiliates. All rights reserved. Introducing: The Open-Source Car!
  • 10. © 2013 Gartner, Inc. and/or its affiliates. All rights reserved. Apache Hadoop is a set of standard open-source software projects that provide a framework for using massive amounts of data across a distributed network The standards steward — Apache Software Foundation — manages and distributes many typical components of "Hadoop" platform Many distributions exist — Built and/or marketed by pure-play specialists or major vendors and they include additional open-source and commercial components
  • 11. © 2013 Gartner, Inc. and/or its affiliates. All rights reserved. Apache Hadoop is a set of standard open source software projects that provide a framework for using massive amounts of data across a distributed network The standards steward — Apache Software Foundation — manages and distributes many typical components of "Hadoop" platform Many distributions exist — Built and/or marketed by pure play specialists or major vendors and they include additional open source and commercial components
  • 12. © 2013 Gartner, Inc. and/or its affiliates. All rights reserved. Clients Ask: Which Projects Are "Hadoop"? • Minimum set (from Apache website): - Apache HDFS - Apache MapReduce - Apache Yarn • Other independent Apache projects: Ambari, Avro, Cassandra, Chukwa, HBase, Hive, Mahout, Pig, ZooKeeper - The virtuous circle of open-source community • Apache Hadoop is version 1.0. Version 2.0, including Yarn, is alpha.
  • 13. © 2013 Gartner, Inc. and/or its affiliates. All rights reserved. Rich, Complex Set of Functional Choices Ingest/Propagate Persist Describe, Develop Monitor, Administer Analytics, Machine Learning Compute, Search
  • 14. © 2013 Gartner, Inc. and/or its affiliates. All rights reserved. Ingest/Propagate Apache Flume, Apache Kafka, Apache Sqoop, HDFS NFS, Informatica HParser, DBMS vendor utilities, Talend, WebHDFS Import data into HDFS (or alternatives) • Commercial DBMS, DI or OSS • "Big data" ≠ Hadoop — import is not always required − MapReduce inside DBMSs, HPCC, SAS, Splunk, others Export data into RDBMS (or alternatives) • NoSQL DBMS supported, or offer integration • On same cluster (HBase), even same nodes (Hadapt)
  • 15. © 2013 Gartner, Inc. and/or its affiliates. All rights reserved. Also included here: "intercept-based" data remediation Develop refers to coding functions, as in Pig, for execution elsewhere, such as MapReduce Metadata (Hive, Hcatalog) describes for other stack components and external ones; e.g., DI and BI tools Describe, Develop Apache Crunch, Apache Hive, Apache Pig, Apache Tika, Cascading, Cloudera Hue, DataFu, Dataguise, IBM Jaql
  • 16. © 2013 Gartner, Inc. and/or its affiliates. All rights reserved. Runtime execution for programs created to run against HDFS or HBase data With Apache Hadoop 2.0, MapReduce will begin to lose its exclusivity in "the basic stack" with Yarn support MapReduce was first, but others have emerged as additions/ alternatives/supplements Compute, Search Apache Blur, Apache Drill, Apache Giraph, Apache Hama, Apache Lucene, Apache MapReduce, Apache Solr, Cloudera Impala, HP HAVEn, IBM BigSQL, IBM InfoSphere Streams, HStreaming, Pivotal HAWQ, SQLstream, Storm, Teradata SQL-H
  • 17. © 2013 Gartner, Inc. and/or its affiliates. All rights reserved. File system: Append only, access methods at OS level Database: Collected and structured to facilitate storage, retrieval, modification, and deletion in online, not only batch, mode Serialized: Format that can be stored in a database, eliminating byte ordering, adding metadata Persist File System: Apache HDFS, IBM GPFS, Lustre, MapR Data Platform Serialization: Apache Avro, RCFile (and ORCFile), SequenceFile, Text, Trevni DBMS: Apache Accumulo, Apache Cassandra, Apache HBase, Google Dremel, Hadapt, HP Vertica, IBM DB2, Kognitio, Oracle, Oracle MySQL, RainStor, Teradata Aster, Teradata, others
  • 18. © 2013 Gartner, Inc. and/or its affiliates. All rights reserved. System health and administration Cloud configuration and connection to resources Virtualization and resource management Job management and orchestration Monitor, Administer Apache Ambari, Apache Chukwa, Apache Falcon, Apache Oozie, Apache Whirr, Apache ZooKeeper, Cloudera Manager, Ganglia, Nagios, Pivotal Serengeti
  • 19. © 2013 Gartner, Inc. and/or its affiliates. All rights reserved. Analytics, Machine Learning Apache Drill, Apache Hive, Apache Mahout, Datameer, IBM Big Sheets, IBM BigSQL, Karmasphere, Microsoft Excel, Platfora, Revolution Analytics RHadoop, SAS, Skytree This is where the future is — it's not just "a part of the stack" but why it exists Machine learning, advanced statistical analysis, scenario modeling "BI for Hadoop": Statistical libraries for use in programs, spreadsheets, reporting, visualization tools
  • 20. © 2013 Gartner, Inc. and/or its affiliates. All rights reserved. Go Ahead — Pick the Pieces You Need Ingest/Propagate Persist Describe, Develop Monitor, Administer Analytics, Machine Learning Compute, Search
  • 21. © 2013 Gartner, Inc. and/or its affiliates. All rights reserved. Distribution Vendors Sort It Out for You Megavendors: Amazon, EMC Pivotal, IBM, Intel Megapartners: Dell, HP, NetApp, Microsoft, Oracle, Teradata Leading pure plays: Cloudera, Hortonworks, MapR Others: Datastax, LucidWorks, RainStor, Sqrrl, WANdisco, Zettaset
  • 22. © 2013 Gartner, Inc. and/or its affiliates. All rights reserved. Hadoop's Great Leap Forward Hadoop has moved to the next stage with Apache Hadoop 2.0. • Mainstream vendors are all interested, contributing and adding value • Skills development is ramping rapidly From To Single-stack Yarn-based multistyle environment, supporting multiple engines Batch-only, file-based stack Interactive capabilities with multiple optional databases SQL translation with Hive "SQL in front of Hadoop": Cloudera Impala, IBM Big SQL, Pivotal Hawq, Platfora, others Relatively unmanaged Ambari-based beginnings of real management
  • 23. © 2013 Gartner, Inc. and/or its affiliates. All rights reserved. What's Next? Search Advanced prebuilt analytic functions Cluster, appliance or cloud? Virtualization Graph processing
  • 24. © 2013 Gartner, Inc. and/or its affiliates. All rights reserved. What's Still Needed? Security Data Warehousing Tools Governance Distributed Optimization Subproject Optimization Skills
  • 25. © 2013 Gartner, Inc. and/or its affiliates. All rights reserved. By 2015, big data demand will reach 4.4 million jobs worldwide, but only one-third of those jobs will be filled. 0 500,000 1,000,000 1,500,000 2,000,000 2,500,000 Americas EMEA APJ Education Wholesale Trade Healthcare Providers Transportation Utilities Retail Insurance Communications, Media & Services Government Banking & Securities Manufacturing & Natural Resources
  • 26. © 2013 Gartner, Inc. and/or its affiliates. All rights reserved. Recommendations  Audit your data — find "dark data" and map it to business opportunities to identify pilot projects  Familiarize yourself with the capabilities of available Hadoop distributions  Build skills and recruit within the organization from early experimenters for a data science lab  Consider cloud pilots to minimize capital expenditure
  • 27. © 2013 Gartner, Inc. and/or its affiliates. All rights reserved. Thank you! http://www.flickr.com/photos/orinrobertjohn/3267286885/sizes/o/in/photostream/

Hinweis der Redaktion

  1. Economic Disruption: The Growth of Data