SlideShare ist ein Scribd-Unternehmen logo
1 von 25
Downloaden Sie, um offline zu lesen
©	
  2014	
  Trace3,	
  All	
  rights	
  reserved.	
  
BIG	
  DATA	
  INTELLIGENCE	
  PRACTICE	
  
HADOOP:	
  
PAST,	
  PRESENT	
  AND	
  FUTURE	
  
©	
  2014	
  Trace3,	
  All	
  rights	
  reserved.	
  
Roadmap	
  
1	
  
~1	
  hour	
  
1-­‐	
  What	
  Makes	
  Up	
  Hadoop	
  1.x?	
  
2-­‐	
  What’s	
  New	
  In	
  Hadoop	
  2.x?	
  
3-­‐	
  The	
  Future	
  Of	
  Hadoop	
  …	
  
©	
  2014	
  Trace3,	
  All	
  rights	
  reserved.	
  
WHAT	
  MAKES	
  UP	
  
HADOOP	
  1.0?	
  
©	
  2014	
  Trace3,	
  All	
  rights	
  reserved.	
  
What’s	
  a	
  “Node”?	
  
Node	
  aka	
  Server	
  
Compute	
  
Storage	
  
OperaVng	
  System	
  
Memory	
  
©	
  2014	
  Trace3,	
  All	
  rights	
  reserved.	
  
Hadoop	
  1.0:	
  HDFS	
  +	
  MapReduce	
  
4	
  
NameNode	
  
DataNode	
  /	
  TaskTracker	
   DataNode	
  /	
  TaskTracker	
  
DataNode	
  /	
  TaskTracker	
   DataNode	
  /	
  TaskTracker	
  
JobTracker	
  
Client	
  
1-­‐1	
  
1-­‐2	
  1-­‐3	
  
©	
  2014	
  Trace3,	
  All	
  rights	
  reserved.	
  
Hadoop	
  1.0:	
  HDFS	
  +	
  MapReduce	
  
5	
  
NameNode	
  
DataNode	
  /	
  TaskTracker	
   DataNode	
  /	
  TaskTracker	
  
DataNode	
  /	
  TaskTracker	
   DataNode	
  /	
  TaskTracker	
  
JobTracker	
  
Client	
  
1-­‐1	
   1-­‐2	
  
1-­‐3	
  
Reduce	
  Map	
  
2-­‐1	
   3-­‐2	
   3-­‐3	
   4-­‐1	
  
2-­‐3	
   4-­‐2	
   2-­‐2	
   3-­‐1	
   4-­‐3	
  
Reduce	
  Map	
  
©	
  2014	
  Trace3,	
  All	
  rights	
  reserved.	
  
MapReduce	
  v1	
  LimitaVons	
  
6	
  
Scalability	
  
Maximum	
  cluster	
  size	
  is	
  4,000	
  nodes	
  and	
  maximum	
  concurrent	
  tasks	
  is	
  40,000	
  
Availability	
  
JobTracker	
  failure	
  kills	
  all	
  queued	
  and	
  running	
  jobs	
  
Resources	
  ParVVoned	
  into	
  Map	
  and	
  Reduce	
  
Hard	
  parGGoning	
  of	
  Map	
  and	
  Reduce	
  slots	
  led	
  to	
  low	
  resource	
  uVlizaVon	
  
No	
  Support	
  for	
  Alternate	
  Paradigms	
  /	
  Services	
  
Only	
  MapReduce	
  batch	
  jobs,	
  nothing	
  else	
  
©	
  2014	
  Trace3,	
  All	
  rights	
  reserved.	
  
Hadoop	
  1.0:	
  Single	
  Use	
  System	
  
7	
  
HADOOP	
  1.0	
  
Single	
  Use	
  System	
  
Batch	
  Apps	
  
HDFS	
  
(redundant,	
  reliable	
  storage)	
  
MapReduce	
  
(cluster	
  resource	
  management	
  and	
  data	
  
processing)	
  
Pig	
   Hive	
  
©	
  2014	
  Trace3,	
  All	
  rights	
  reserved.	
  
WHAT’S	
  NEW	
  IN	
  
HADOOP	
  2.0?	
  
©	
  2014	
  Trace3,	
  All	
  rights	
  reserved.	
  
YARN	
  
9	
  
YARN	
  Replaces	
  
MapReduce	
  
Yet	
  Another	
  Resource	
  NegoVator	
  
YARN	
  will	
  be	
  the	
  de-­‐facto	
  distributed	
  
operaVng	
  system	
  for	
  Big	
  Data	
  
©	
  2014	
  Trace3,	
  All	
  rights	
  reserved.	
  10	
  
Store	
  DATA	
  in	
  one	
  place	
  
Interact	
  with	
  that	
  data	
  in	
  MULTIPLE	
  WAYS	
  
with	
  Predictable	
  Performance	
  and	
  Quality	
  of	
  Service	
  
	
  	
  	
  	
  	
  	
  ApplicaGons	
  Run	
  NaGvely	
  IN	
  Hadoop	
  
HDFS2	
  
(redundant,	
  reliable	
  storage)	
  
YARN	
  
(cluster	
  resource	
  management)	
  
BATCH	
  
(MapReduce)	
  
INTERACTIVE	
  
(Tez)	
  
ONLINE	
  
(HBase)	
  
STREAMING	
  
(DataTorrent)	
  
GRAPH	
  
(Giraph)	
  
YARN:	
  No	
  Longer	
  Just	
  Batch	
  Apps	
  
©	
  2014	
  Trace3,	
  All	
  rights	
  reserved.	
  11	
  
YARN:	
  ApplicaVons	
  
Running	
  all	
  on	
  the	
  same	
  Hadoop	
  cluster	
  to	
  give	
  
applicaVons	
  access	
  to	
  all	
  the	
  same	
  source	
  data!	
  
MapReduce	
  v2	
  
Stream	
  Processing	
  
Master-­‐Worker	
  Online	
  
In-­‐Memory	
  
Apache	
  Storm	
  
©	
  2014	
  Trace3,	
  All	
  rights	
  reserved.	
  12	
  
YARN:	
  Quickly	
  Maturing	
  
2010	
  
	
  
2011	
  
	
  
2012	
  
	
  
2013	
  
	
  
2014	
  
	
  
Today	
  
Conceived	
  at	
  Yahoo!	
  
Alpha	
  Releases	
  –	
  2.0	
  
Beta	
  Releases	
  –	
  2.1	
  
GA	
  Released	
  –	
  2.2	
  
100,000+	
  nodes,	
  400,000+	
  jobs	
  daily	
  
10	
  million+	
  hours	
  of	
  compute	
  daily	
  
Version	
  2.3	
  
Version	
  2.4	
  
©	
  2014	
  Trace3,	
  All	
  rights	
  reserved.	
  13	
  
YARN:	
  Dr.	
  Evil	
  Approved	
  
©	
  2014	
  Trace3,	
  All	
  rights	
  reserved.	
  14	
  
YARN:	
  What	
  Has	
  Changed?	
  
YARN	
   MRv1	
  
RM	
  
ResourceManager	
  
AM	
  ApplicaVonMaster	
  
JT	
  
JobTracker	
  
Scheduler	
   Scheduler	
  
NM	
  NodeManager	
  
TT	
  TaskTracker	
  
Container	
  
Map	
  &	
  
Reduce	
  
Slot	
  
ResourceManager	
  
Scheduler	
  
JobTracker	
  
Scheduler	
  
NodeManager	
  
ApplicaVonMaster	
  
TaskTracker	
  
Map	
   Reduce	
  
NodeManager	
  
Container	
   Container	
  
TaskTracker	
  
Map	
   Reduce	
  
©	
  2014	
  Trace3,	
  All	
  rights	
  reserved.	
  
The	
  6	
  Benefits	
  Of	
  YARN	
  
15	
  
• Scale	
  
• New	
  programming	
  models	
  
and	
  services	
  
• Improved	
  cluster	
  uVlizaVon	
  
• Agility	
  
• Backwards	
  compaVble	
  with	
  
MapReduce	
  v1	
  
• Mixed	
  workloads	
  on	
  the	
  
same	
  source	
  of	
  data	
  
©	
  2014	
  Trace3,	
  All	
  rights	
  reserved.	
  
THE	
  FUTURE	
  
OF	
  HADOOP	
  
©	
  2014	
  Trace3,	
  All	
  rights	
  reserved.	
  
SQL	
  on	
  Hadoop	
  
Speed	
  
Deliver	
  interacGve	
  query	
  performance.	
  
SQL	
  
Support	
  array	
  of	
  SQL	
  semanGcs	
  for	
  analyGc	
  
applicaGons	
  running	
  against	
  Hadoop.	
  
Scale	
  
SQL	
  interface	
  to	
  Hadoop	
  designed	
  for	
  queries	
  
that	
  scale	
  from	
  Terabytes	
  to	
  Petabytes	
  
	
  
©	
  2014	
  Trace3,	
  All	
  rights	
  reserved.	
  
SQL	
  on	
  Hadoop	
  
Hive	
  on	
  Apache	
  Tez	
  
Hortonworks	
  HDP2	
  
Hive	
  on	
  Apache	
  Spark	
  
Cloudera	
  CDH5	
  
Apache	
  Drill	
  
MapR	
  M7	
  
Cloudera	
  Impala	
  
Cloudera	
  CDH5	
  
Pivotal	
  HAWQ	
  
Pivotal	
  Big	
  Data	
  Suite	
  
©	
  2014	
  Trace3,	
  All	
  rights	
  reserved.	
  
HOYA:	
  HBase	
  (NoSQL)	
  on	
  YARN	
  
Dynamic	
  Scaling	
  
On-­‐demand	
  cluster	
  size.	
  Increase	
  and	
  decrease	
  
the	
  size	
  with	
  load.	
  
Easier	
  Deployment	
  
APIs	
  to	
  create,	
  start,	
  stop	
  and	
  delete	
  HBase	
  
clusters.	
  
Availability	
  
Recover	
  from	
  Region	
  Server	
  loss	
  with	
  a	
  new	
  
container.	
  
©	
  2014	
  Trace3,	
  All	
  rights	
  reserved.	
  
Microsoo	
  REEF	
  
Machine	
  Learning	
  
Framework	
  well	
  suited	
  for	
  building	
  machine	
  
learning	
  jobs.	
  
Scalable	
  /	
  Fault	
  Tolerant	
  
Makes	
  it	
  easy	
  to	
  implement	
  scalable,	
  fault-­‐
tolerant	
  runGme	
  environments	
  for	
  a	
  range	
  of	
  
computaGonal	
  models.	
  
Maintain	
  State	
  
Users	
  can	
  build	
  jobs	
  that	
  uGlize	
  data	
  from	
  
where	
  it’s	
  needed	
  and	
  also	
  maintain	
  state	
  a_er	
  
jobs	
  are	
  done.	
  
Retainable	
  
Evaluator	
  
ExecuGon	
  
Framework	
  
©	
  2014	
  Trace3,	
  All	
  rights	
  reserved.	
  
Heterogeneous	
  Storage	
  
NameNode	
  
Storage	
  
NameNode	
  
SATA	
   SSD	
  
Fusion	
  
IO	
  
THEN	
   NOW	
  
©	
  2014	
  Trace3,	
  All	
  rights	
  reserved.	
  
Hadoop	
  Roadmap	
  
	
  
• Apache	
  Hadoop	
  2.5	
  
–  NodeManager	
  Restart	
  w/o	
  disrupGon	
  
–  Dynamic	
  Resource	
  ConfiguraGon	
  
	
  
• Apache	
  Hadoop	
  2.6	
  
–  Memory	
  As	
  Storage	
  Tier	
  
–  Support	
  For	
  Docker	
  Containers	
  
Q3	
  2014	
  
Q4	
  2014	
  
©	
  2014	
  Trace3,	
  All	
  rights	
  reserved.	
  
HADOOP:	
  PAST,	
  PRESENT	
  &	
  FUTURE	
  
23	
  
I	
  KNOW	
  YOU	
  HAVE	
  
QUESTONS	
  
NO	
  SUCH	
  THING	
  AS	
  A	
  STUPID	
  QUESTION.	
  
©	
  2014	
  Trace3,	
  All	
  rights	
  reserved.	
  
ONE	
  LAST	
  THING	
  …	
  
24	
  
SD	
  Big	
  Data	
  Meetup	
  	
  
meetup.com/sdbigdata	
  
2nd	
  Wednesday	
  Of	
  The	
  Month	
  
Next:	
  August	
  13th	
  @	
  5:45P	
  

Weitere ähnliche Inhalte

Was ist angesagt?

An Introduction to Apache Hadoop Yarn
An Introduction to Apache Hadoop YarnAn Introduction to Apache Hadoop Yarn
An Introduction to Apache Hadoop YarnMike Frampton
 
Hive+Tez: A performance deep dive
Hive+Tez: A performance deep diveHive+Tez: A performance deep dive
Hive+Tez: A performance deep divet3rmin4t0r
 
Hadoop YARN | Hadoop YARN Architecture | Hadoop YARN Tutorial | Hadoop Tutori...
Hadoop YARN | Hadoop YARN Architecture | Hadoop YARN Tutorial | Hadoop Tutori...Hadoop YARN | Hadoop YARN Architecture | Hadoop YARN Tutorial | Hadoop Tutori...
Hadoop YARN | Hadoop YARN Architecture | Hadoop YARN Tutorial | Hadoop Tutori...Simplilearn
 
Apache Hadoop YARN - Enabling Next Generation Data Applications
Apache Hadoop YARN - Enabling Next Generation Data ApplicationsApache Hadoop YARN - Enabling Next Generation Data Applications
Apache Hadoop YARN - Enabling Next Generation Data ApplicationsHortonworks
 
The Future of Hadoop: MapR VP of Product Management, Tomer Shiran
The Future of Hadoop: MapR VP of Product Management, Tomer ShiranThe Future of Hadoop: MapR VP of Product Management, Tomer Shiran
The Future of Hadoop: MapR VP of Product Management, Tomer ShiranMapR Technologies
 
Hadoop Internals (2.3.0 or later)
Hadoop Internals (2.3.0 or later)Hadoop Internals (2.3.0 or later)
Hadoop Internals (2.3.0 or later)Emilio Coppa
 
Hadoop - Past, Present and Future - v1.2
Hadoop - Past, Present and Future - v1.2Hadoop - Past, Present and Future - v1.2
Hadoop - Past, Present and Future - v1.2Big Data Joe™ Rossi
 
Big Data Performance and Capacity Management
Big Data Performance and Capacity ManagementBig Data Performance and Capacity Management
Big Data Performance and Capacity Managementrightsize
 
MapR M7: Providing an enterprise quality Apache HBase API
MapR M7: Providing an enterprise quality Apache HBase APIMapR M7: Providing an enterprise quality Apache HBase API
MapR M7: Providing an enterprise quality Apache HBase APImcsrivas
 
NextGen Apache Hadoop MapReduce
NextGen Apache Hadoop MapReduceNextGen Apache Hadoop MapReduce
NextGen Apache Hadoop MapReduceHortonworks
 
Apache Hadoop YARN 2015: Present and Future
Apache Hadoop YARN 2015: Present and FutureApache Hadoop YARN 2015: Present and Future
Apache Hadoop YARN 2015: Present and FutureDataWorks Summit
 
Hive at Yahoo: Letters from the trenches
Hive at Yahoo: Letters from the trenchesHive at Yahoo: Letters from the trenches
Hive at Yahoo: Letters from the trenchesDataWorks Summit
 
Hadoop Summit San Jose 2015: Towards SLA-based Scheduling on YARN Clusters
Hadoop Summit San Jose 2015: Towards SLA-based Scheduling on YARN Clusters Hadoop Summit San Jose 2015: Towards SLA-based Scheduling on YARN Clusters
Hadoop Summit San Jose 2015: Towards SLA-based Scheduling on YARN Clusters Sumeet Singh
 
Hadoop Summit San Jose 2014: Costing Your Big Data Operations
Hadoop Summit San Jose 2014: Costing Your Big Data Operations Hadoop Summit San Jose 2014: Costing Your Big Data Operations
Hadoop Summit San Jose 2014: Costing Your Big Data Operations Sumeet Singh
 
BIGDATA- Survey on Scheduling Methods in Hadoop MapReduce
BIGDATA- Survey on Scheduling Methods in Hadoop MapReduceBIGDATA- Survey on Scheduling Methods in Hadoop MapReduce
BIGDATA- Survey on Scheduling Methods in Hadoop MapReduceMahantesh Angadi
 

Was ist angesagt? (20)

An Introduction to Apache Hadoop Yarn
An Introduction to Apache Hadoop YarnAn Introduction to Apache Hadoop Yarn
An Introduction to Apache Hadoop Yarn
 
Hive+Tez: A performance deep dive
Hive+Tez: A performance deep diveHive+Tez: A performance deep dive
Hive+Tez: A performance deep dive
 
Hadoop YARN | Hadoop YARN Architecture | Hadoop YARN Tutorial | Hadoop Tutori...
Hadoop YARN | Hadoop YARN Architecture | Hadoop YARN Tutorial | Hadoop Tutori...Hadoop YARN | Hadoop YARN Architecture | Hadoop YARN Tutorial | Hadoop Tutori...
Hadoop YARN | Hadoop YARN Architecture | Hadoop YARN Tutorial | Hadoop Tutori...
 
Apache Hadoop YARN - Enabling Next Generation Data Applications
Apache Hadoop YARN - Enabling Next Generation Data ApplicationsApache Hadoop YARN - Enabling Next Generation Data Applications
Apache Hadoop YARN - Enabling Next Generation Data Applications
 
The Future of Hadoop: MapR VP of Product Management, Tomer Shiran
The Future of Hadoop: MapR VP of Product Management, Tomer ShiranThe Future of Hadoop: MapR VP of Product Management, Tomer Shiran
The Future of Hadoop: MapR VP of Product Management, Tomer Shiran
 
Hadoop Internals (2.3.0 or later)
Hadoop Internals (2.3.0 or later)Hadoop Internals (2.3.0 or later)
Hadoop Internals (2.3.0 or later)
 
Hadoop - Past, Present and Future - v1.2
Hadoop - Past, Present and Future - v1.2Hadoop - Past, Present and Future - v1.2
Hadoop - Past, Present and Future - v1.2
 
Big Data Performance and Capacity Management
Big Data Performance and Capacity ManagementBig Data Performance and Capacity Management
Big Data Performance and Capacity Management
 
February 2014 HUG : Pig On Tez
February 2014 HUG : Pig On TezFebruary 2014 HUG : Pig On Tez
February 2014 HUG : Pig On Tez
 
Hadoop scheduler
Hadoop schedulerHadoop scheduler
Hadoop scheduler
 
MapR M7: Providing an enterprise quality Apache HBase API
MapR M7: Providing an enterprise quality Apache HBase APIMapR M7: Providing an enterprise quality Apache HBase API
MapR M7: Providing an enterprise quality Apache HBase API
 
NextGen Apache Hadoop MapReduce
NextGen Apache Hadoop MapReduceNextGen Apache Hadoop MapReduce
NextGen Apache Hadoop MapReduce
 
Apache Hadoop YARN 2015: Present and Future
Apache Hadoop YARN 2015: Present and FutureApache Hadoop YARN 2015: Present and Future
Apache Hadoop YARN 2015: Present and Future
 
10c introduction
10c introduction10c introduction
10c introduction
 
Yarn
YarnYarn
Yarn
 
Hive at Yahoo: Letters from the trenches
Hive at Yahoo: Letters from the trenchesHive at Yahoo: Letters from the trenches
Hive at Yahoo: Letters from the trenches
 
Hadoop Summit San Jose 2015: Towards SLA-based Scheduling on YARN Clusters
Hadoop Summit San Jose 2015: Towards SLA-based Scheduling on YARN Clusters Hadoop Summit San Jose 2015: Towards SLA-based Scheduling on YARN Clusters
Hadoop Summit San Jose 2015: Towards SLA-based Scheduling on YARN Clusters
 
Hadoop Summit San Jose 2014: Costing Your Big Data Operations
Hadoop Summit San Jose 2014: Costing Your Big Data Operations Hadoop Summit San Jose 2014: Costing Your Big Data Operations
Hadoop Summit San Jose 2014: Costing Your Big Data Operations
 
BIGDATA- Survey on Scheduling Methods in Hadoop MapReduce
BIGDATA- Survey on Scheduling Methods in Hadoop MapReduceBIGDATA- Survey on Scheduling Methods in Hadoop MapReduce
BIGDATA- Survey on Scheduling Methods in Hadoop MapReduce
 
Hadoop - Introduction to HDFS
Hadoop - Introduction to HDFSHadoop - Introduction to HDFS
Hadoop - Introduction to HDFS
 

Andere mochten auch

Building enterprise advance analytics platform
Building enterprise advance analytics platformBuilding enterprise advance analytics platform
Building enterprise advance analytics platformHaoran Du
 
Trace 3 interview questions and answers
Trace 3 interview questions and answersTrace 3 interview questions and answers
Trace 3 interview questions and answersselinasimpson205
 
Driving Retail Success with Machine Data Intelligence
Driving Retail Success with Machine Data IntelligenceDriving Retail Success with Machine Data Intelligence
Driving Retail Success with Machine Data IntelligenceSumo Logic
 
8 disciplines of Enterprise Modernizartion
8 disciplines of Enterprise Modernizartion8 disciplines of Enterprise Modernizartion
8 disciplines of Enterprise ModernizartionDr. Mohan K. Bavirisetty
 
Data scientist enablement dse 400 week 6 roadmap
Data scientist enablement   dse 400   week 6 roadmapData scientist enablement   dse 400   week 6 roadmap
Data scientist enablement dse 400 week 6 roadmapDr. Mohan K. Bavirisetty
 
Data scientist enablement dse 400 - week 1
Data scientist enablement   dse 400 - week 1Data scientist enablement   dse 400 - week 1
Data scientist enablement dse 400 - week 1Dr. Mohan K. Bavirisetty
 
Mohan k. bavirisetty introduction to semantic soa & bpm sept 14 2010 v 1.0
Mohan k. bavirisetty    introduction to semantic soa & bpm sept 14 2010 v 1.0Mohan k. bavirisetty    introduction to semantic soa & bpm sept 14 2010 v 1.0
Mohan k. bavirisetty introduction to semantic soa & bpm sept 14 2010 v 1.0Dr. Mohan K. Bavirisetty
 
Data scientist enablement dse 400 - week 1 roadmap
Data scientist enablement   dse 400 - week 1 roadmapData scientist enablement   dse 400 - week 1 roadmap
Data scientist enablement dse 400 - week 1 roadmapDr. Mohan K. Bavirisetty
 
Building Big Data Analytics Center of Excellence v 3.0 Final
Building Big Data Analytics Center of Excellence v 3.0 FinalBuilding Big Data Analytics Center of Excellence v 3.0 Final
Building Big Data Analytics Center of Excellence v 3.0 FinalDr. Mohan K. Bavirisetty
 
Data scientist enablement dse 400 week 8 roadmap
Data scientist enablement   dse 400   week 8 roadmap Data scientist enablement   dse 400   week 8 roadmap
Data scientist enablement dse 400 week 8 roadmap Dr. Mohan K. Bavirisetty
 
Dr Mohan K Bavirisetty - 8 Disciplines of Enterprise Modernization - Final
Dr  Mohan K  Bavirisetty - 8 Disciplines of Enterprise Modernization - FinalDr  Mohan K  Bavirisetty - 8 Disciplines of Enterprise Modernization - Final
Dr Mohan K Bavirisetty - 8 Disciplines of Enterprise Modernization - FinalDr. Mohan K. Bavirisetty
 
Advanced Analytics - Frameworks, Platforms and Metholodologies v 1.0
Advanced Analytics - Frameworks, Platforms and Metholodologies v 1.0Advanced Analytics - Frameworks, Platforms and Metholodologies v 1.0
Advanced Analytics - Frameworks, Platforms and Metholodologies v 1.0Dr. Mohan K. Bavirisetty
 
Data scientist enablement dse 400 week 7 roadmap
Data scientist enablement   dse 400   week 7 roadmapData scientist enablement   dse 400   week 7 roadmap
Data scientist enablement dse 400 week 7 roadmapDr. Mohan K. Bavirisetty
 
Dr Mohan K Bavirisetty - 8 Disciplines of Enterprise Modernization - Final
Dr  Mohan K  Bavirisetty - 8 Disciplines of Enterprise Modernization - FinalDr  Mohan K  Bavirisetty - 8 Disciplines of Enterprise Modernization - Final
Dr Mohan K Bavirisetty - 8 Disciplines of Enterprise Modernization - FinalDr. Mohan K. Bavirisetty
 
Polyglot Processing - An Introduction 1.0
Polyglot Processing - An Introduction 1.0 Polyglot Processing - An Introduction 1.0
Polyglot Processing - An Introduction 1.0 Dr. Mohan K. Bavirisetty
 
Business Analytics Competency centre: A strategic Differentiator
Business Analytics Competency centre: A strategic Differentiator Business Analytics Competency centre: A strategic Differentiator
Business Analytics Competency centre: A strategic Differentiator BSGAfrica
 
BICC - A key element to your BI strategy
BICC - A key element to your BI strategyBICC - A key element to your BI strategy
BICC - A key element to your BI strategyGuyVanderSande
 
Center of Excellence Building Blocks
Center of Excellence Building BlocksCenter of Excellence Building Blocks
Center of Excellence Building BlocksArup Dutta
 
The Road to Becoming a Center of Excellence
The Road to Becoming a Center of ExcellenceThe Road to Becoming a Center of Excellence
The Road to Becoming a Center of ExcellenceLisa D'Adamo-Weinstein
 

Andere mochten auch (20)

Building enterprise advance analytics platform
Building enterprise advance analytics platformBuilding enterprise advance analytics platform
Building enterprise advance analytics platform
 
Trace 3 interview questions and answers
Trace 3 interview questions and answersTrace 3 interview questions and answers
Trace 3 interview questions and answers
 
Driving Retail Success with Machine Data Intelligence
Driving Retail Success with Machine Data IntelligenceDriving Retail Success with Machine Data Intelligence
Driving Retail Success with Machine Data Intelligence
 
8 disciplines of Enterprise Modernizartion
8 disciplines of Enterprise Modernizartion8 disciplines of Enterprise Modernizartion
8 disciplines of Enterprise Modernizartion
 
Data scientist enablement dse 400 week 6 roadmap
Data scientist enablement   dse 400   week 6 roadmapData scientist enablement   dse 400   week 6 roadmap
Data scientist enablement dse 400 week 6 roadmap
 
Data scientist enablement dse 400 - week 1
Data scientist enablement   dse 400 - week 1Data scientist enablement   dse 400 - week 1
Data scientist enablement dse 400 - week 1
 
Mohan k. bavirisetty introduction to semantic soa & bpm sept 14 2010 v 1.0
Mohan k. bavirisetty    introduction to semantic soa & bpm sept 14 2010 v 1.0Mohan k. bavirisetty    introduction to semantic soa & bpm sept 14 2010 v 1.0
Mohan k. bavirisetty introduction to semantic soa & bpm sept 14 2010 v 1.0
 
Data scientist enablement dse 400 - week 1 roadmap
Data scientist enablement   dse 400 - week 1 roadmapData scientist enablement   dse 400 - week 1 roadmap
Data scientist enablement dse 400 - week 1 roadmap
 
Building Big Data Analytics Center of Excellence v 3.0 Final
Building Big Data Analytics Center of Excellence v 3.0 FinalBuilding Big Data Analytics Center of Excellence v 3.0 Final
Building Big Data Analytics Center of Excellence v 3.0 Final
 
Data scientist enablement dse 400 week 8 roadmap
Data scientist enablement   dse 400   week 8 roadmap Data scientist enablement   dse 400   week 8 roadmap
Data scientist enablement dse 400 week 8 roadmap
 
Dr Mohan K Bavirisetty - 8 Disciplines of Enterprise Modernization - Final
Dr  Mohan K  Bavirisetty - 8 Disciplines of Enterprise Modernization - FinalDr  Mohan K  Bavirisetty - 8 Disciplines of Enterprise Modernization - Final
Dr Mohan K Bavirisetty - 8 Disciplines of Enterprise Modernization - Final
 
Advanced Analytics - Frameworks, Platforms and Metholodologies v 1.0
Advanced Analytics - Frameworks, Platforms and Metholodologies v 1.0Advanced Analytics - Frameworks, Platforms and Metholodologies v 1.0
Advanced Analytics - Frameworks, Platforms and Metholodologies v 1.0
 
Data scientist enablement dse 400 week 7 roadmap
Data scientist enablement   dse 400   week 7 roadmapData scientist enablement   dse 400   week 7 roadmap
Data scientist enablement dse 400 week 7 roadmap
 
Dr Mohan K Bavirisetty - 8 Disciplines of Enterprise Modernization - Final
Dr  Mohan K  Bavirisetty - 8 Disciplines of Enterprise Modernization - FinalDr  Mohan K  Bavirisetty - 8 Disciplines of Enterprise Modernization - Final
Dr Mohan K Bavirisetty - 8 Disciplines of Enterprise Modernization - Final
 
Data Scientist Enablement roadmap 1.0
Data Scientist Enablement roadmap 1.0Data Scientist Enablement roadmap 1.0
Data Scientist Enablement roadmap 1.0
 
Polyglot Processing - An Introduction 1.0
Polyglot Processing - An Introduction 1.0 Polyglot Processing - An Introduction 1.0
Polyglot Processing - An Introduction 1.0
 
Business Analytics Competency centre: A strategic Differentiator
Business Analytics Competency centre: A strategic Differentiator Business Analytics Competency centre: A strategic Differentiator
Business Analytics Competency centre: A strategic Differentiator
 
BICC - A key element to your BI strategy
BICC - A key element to your BI strategyBICC - A key element to your BI strategy
BICC - A key element to your BI strategy
 
Center of Excellence Building Blocks
Center of Excellence Building BlocksCenter of Excellence Building Blocks
Center of Excellence Building Blocks
 
The Road to Becoming a Center of Excellence
The Road to Becoming a Center of ExcellenceThe Road to Becoming a Center of Excellence
The Road to Becoming a Center of Excellence
 

Ähnlich wie Hadoop - Past, Present and Future - v2.0

2013 Nov 20 Toronto Hadoop User Group (THUG) - Hadoop 2.2.0
2013 Nov 20 Toronto Hadoop User Group (THUG) - Hadoop 2.2.02013 Nov 20 Toronto Hadoop User Group (THUG) - Hadoop 2.2.0
2013 Nov 20 Toronto Hadoop User Group (THUG) - Hadoop 2.2.0Adam Muise
 
Bikas saha:the next generation of hadoop– hadoop 2 and yarn
Bikas saha:the next generation of hadoop– hadoop 2 and yarnBikas saha:the next generation of hadoop– hadoop 2 and yarn
Bikas saha:the next generation of hadoop– hadoop 2 and yarnhdhappy001
 
Apache Hadoop YARN: Understanding the Data Operating System of Hadoop
Apache Hadoop YARN: Understanding the Data Operating System of HadoopApache Hadoop YARN: Understanding the Data Operating System of Hadoop
Apache Hadoop YARN: Understanding the Data Operating System of HadoopHortonworks
 
Combine SAS High-Performance Capabilities with Hadoop YARN
Combine SAS High-Performance Capabilities with Hadoop YARNCombine SAS High-Performance Capabilities with Hadoop YARN
Combine SAS High-Performance Capabilities with Hadoop YARNHortonworks
 
Big Data Everywhere Chicago: Getting Real with the MapR Platform (MapR)
Big Data Everywhere Chicago: Getting Real with the MapR Platform (MapR)Big Data Everywhere Chicago: Getting Real with the MapR Platform (MapR)
Big Data Everywhere Chicago: Getting Real with the MapR Platform (MapR)BigDataEverywhere
 
Hadoop 2 - Beyond MapReduce
Hadoop 2 - Beyond MapReduceHadoop 2 - Beyond MapReduce
Hadoop 2 - Beyond MapReduceUwe Printz
 
Tez: Accelerating Data Pipelines - fifthel
Tez: Accelerating Data Pipelines - fifthelTez: Accelerating Data Pipelines - fifthel
Tez: Accelerating Data Pipelines - fifthelt3rmin4t0r
 
How YARN Enables Multiple Data Processing Engines in Hadoop
How YARN Enables Multiple Data Processing Engines in HadoopHow YARN Enables Multiple Data Processing Engines in Hadoop
How YARN Enables Multiple Data Processing Engines in HadoopPOSSCON
 
Developing YARN Applications - Integrating natively to YARN July 24 2014
Developing YARN Applications - Integrating natively to YARN July 24 2014Developing YARN Applications - Integrating natively to YARN July 24 2014
Developing YARN Applications - Integrating natively to YARN July 24 2014Hortonworks
 
Introduction sur Tez par Olivier RENAULT de HortonWorks Meetup du 25/11/2014
Introduction sur Tez par Olivier RENAULT de HortonWorks Meetup du 25/11/2014Introduction sur Tez par Olivier RENAULT de HortonWorks Meetup du 25/11/2014
Introduction sur Tez par Olivier RENAULT de HortonWorks Meetup du 25/11/2014Modern Data Stack France
 
Welcome to Hadoop2Land!
Welcome to Hadoop2Land!Welcome to Hadoop2Land!
Welcome to Hadoop2Land!Uwe Printz
 
Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...
Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...
Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...EMC
 
YARN - Presented At Dallas Hadoop User Group
YARN - Presented At Dallas Hadoop User GroupYARN - Presented At Dallas Hadoop User Group
YARN - Presented At Dallas Hadoop User GroupRommel Garcia
 
YARN - Next Generation Compute Platform fo Hadoop
YARN - Next Generation Compute Platform fo HadoopYARN - Next Generation Compute Platform fo Hadoop
YARN - Next Generation Compute Platform fo HadoopHortonworks
 

Ähnlich wie Hadoop - Past, Present and Future - v2.0 (20)

Huhadoop - v1.1
Huhadoop - v1.1Huhadoop - v1.1
Huhadoop - v1.1
 
2013 Nov 20 Toronto Hadoop User Group (THUG) - Hadoop 2.2.0
2013 Nov 20 Toronto Hadoop User Group (THUG) - Hadoop 2.2.02013 Nov 20 Toronto Hadoop User Group (THUG) - Hadoop 2.2.0
2013 Nov 20 Toronto Hadoop User Group (THUG) - Hadoop 2.2.0
 
Bikas saha:the next generation of hadoop– hadoop 2 and yarn
Bikas saha:the next generation of hadoop– hadoop 2 and yarnBikas saha:the next generation of hadoop– hadoop 2 and yarn
Bikas saha:the next generation of hadoop– hadoop 2 and yarn
 
Apache Hadoop YARN: Understanding the Data Operating System of Hadoop
Apache Hadoop YARN: Understanding the Data Operating System of HadoopApache Hadoop YARN: Understanding the Data Operating System of Hadoop
Apache Hadoop YARN: Understanding the Data Operating System of Hadoop
 
Combine SAS High-Performance Capabilities with Hadoop YARN
Combine SAS High-Performance Capabilities with Hadoop YARNCombine SAS High-Performance Capabilities with Hadoop YARN
Combine SAS High-Performance Capabilities with Hadoop YARN
 
Yarn
YarnYarn
Yarn
 
MapR Unique features
MapR Unique featuresMapR Unique features
MapR Unique features
 
Yarns About Yarn
Yarns About YarnYarns About Yarn
Yarns About Yarn
 
Big Data Everywhere Chicago: Getting Real with the MapR Platform (MapR)
Big Data Everywhere Chicago: Getting Real with the MapR Platform (MapR)Big Data Everywhere Chicago: Getting Real with the MapR Platform (MapR)
Big Data Everywhere Chicago: Getting Real with the MapR Platform (MapR)
 
Hadoop 2 - Beyond MapReduce
Hadoop 2 - Beyond MapReduceHadoop 2 - Beyond MapReduce
Hadoop 2 - Beyond MapReduce
 
Tez: Accelerating Data Pipelines - fifthel
Tez: Accelerating Data Pipelines - fifthelTez: Accelerating Data Pipelines - fifthel
Tez: Accelerating Data Pipelines - fifthel
 
How YARN Enables Multiple Data Processing Engines in Hadoop
How YARN Enables Multiple Data Processing Engines in HadoopHow YARN Enables Multiple Data Processing Engines in Hadoop
How YARN Enables Multiple Data Processing Engines in Hadoop
 
Developing YARN Applications - Integrating natively to YARN July 24 2014
Developing YARN Applications - Integrating natively to YARN July 24 2014Developing YARN Applications - Integrating natively to YARN July 24 2014
Developing YARN Applications - Integrating natively to YARN July 24 2014
 
MHUG - YARN
MHUG - YARNMHUG - YARN
MHUG - YARN
 
Introduction sur Tez par Olivier RENAULT de HortonWorks Meetup du 25/11/2014
Introduction sur Tez par Olivier RENAULT de HortonWorks Meetup du 25/11/2014Introduction sur Tez par Olivier RENAULT de HortonWorks Meetup du 25/11/2014
Introduction sur Tez par Olivier RENAULT de HortonWorks Meetup du 25/11/2014
 
Yarnthug2014
Yarnthug2014Yarnthug2014
Yarnthug2014
 
Welcome to Hadoop2Land!
Welcome to Hadoop2Land!Welcome to Hadoop2Land!
Welcome to Hadoop2Land!
 
Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...
Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...
Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...
 
YARN - Presented At Dallas Hadoop User Group
YARN - Presented At Dallas Hadoop User GroupYARN - Presented At Dallas Hadoop User Group
YARN - Presented At Dallas Hadoop User Group
 
YARN - Next Generation Compute Platform fo Hadoop
YARN - Next Generation Compute Platform fo HadoopYARN - Next Generation Compute Platform fo Hadoop
YARN - Next Generation Compute Platform fo Hadoop
 

Mehr von Big Data Joe™ Rossi

OC Big Data Monthly Meetup #6 - Session 2 - Basho/Riak
OC Big Data Monthly Meetup #6 - Session 2 - Basho/RiakOC Big Data Monthly Meetup #6 - Session 2 - Basho/Riak
OC Big Data Monthly Meetup #6 - Session 2 - Basho/RiakBig Data Joe™ Rossi
 
OC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBMOC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBMBig Data Joe™ Rossi
 
SD Big Data Monthly Meetup #4 - Session 2 - WANDisco
SD Big Data Monthly Meetup #4 - Session 2 - WANDiscoSD Big Data Monthly Meetup #4 - Session 2 - WANDisco
SD Big Data Monthly Meetup #4 - Session 2 - WANDiscoBig Data Joe™ Rossi
 
SD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBMSD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBMBig Data Joe™ Rossi
 
OC Big Data Monthly Meetup #5 - Session 1 - Altiscale
OC Big Data Monthly Meetup #5 - Session 1 - AltiscaleOC Big Data Monthly Meetup #5 - Session 1 - Altiscale
OC Big Data Monthly Meetup #5 - Session 1 - AltiscaleBig Data Joe™ Rossi
 
OC Big Data Monthly Meetup #5 - Session 2 - Sumo Logic
OC Big Data Monthly Meetup #5 - Session 2 - Sumo LogicOC Big Data Monthly Meetup #5 - Session 2 - Sumo Logic
OC Big Data Monthly Meetup #5 - Session 2 - Sumo LogicBig Data Joe™ Rossi
 

Mehr von Big Data Joe™ Rossi (6)

OC Big Data Monthly Meetup #6 - Session 2 - Basho/Riak
OC Big Data Monthly Meetup #6 - Session 2 - Basho/RiakOC Big Data Monthly Meetup #6 - Session 2 - Basho/Riak
OC Big Data Monthly Meetup #6 - Session 2 - Basho/Riak
 
OC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBMOC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBM
 
SD Big Data Monthly Meetup #4 - Session 2 - WANDisco
SD Big Data Monthly Meetup #4 - Session 2 - WANDiscoSD Big Data Monthly Meetup #4 - Session 2 - WANDisco
SD Big Data Monthly Meetup #4 - Session 2 - WANDisco
 
SD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBMSD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBM
 
OC Big Data Monthly Meetup #5 - Session 1 - Altiscale
OC Big Data Monthly Meetup #5 - Session 1 - AltiscaleOC Big Data Monthly Meetup #5 - Session 1 - Altiscale
OC Big Data Monthly Meetup #5 - Session 1 - Altiscale
 
OC Big Data Monthly Meetup #5 - Session 2 - Sumo Logic
OC Big Data Monthly Meetup #5 - Session 2 - Sumo LogicOC Big Data Monthly Meetup #5 - Session 2 - Sumo Logic
OC Big Data Monthly Meetup #5 - Session 2 - Sumo Logic
 

Kürzlich hochgeladen

Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxolyaivanovalion
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...amitlee9823
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 

Kürzlich hochgeladen (20)

Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 

Hadoop - Past, Present and Future - v2.0

  • 1. ©  2014  Trace3,  All  rights  reserved.   BIG  DATA  INTELLIGENCE  PRACTICE   HADOOP:   PAST,  PRESENT  AND  FUTURE  
  • 2. ©  2014  Trace3,  All  rights  reserved.   Roadmap   1   ~1  hour   1-­‐  What  Makes  Up  Hadoop  1.x?   2-­‐  What’s  New  In  Hadoop  2.x?   3-­‐  The  Future  Of  Hadoop  …  
  • 3. ©  2014  Trace3,  All  rights  reserved.   WHAT  MAKES  UP   HADOOP  1.0?  
  • 4. ©  2014  Trace3,  All  rights  reserved.   What’s  a  “Node”?   Node  aka  Server   Compute   Storage   OperaVng  System   Memory  
  • 5. ©  2014  Trace3,  All  rights  reserved.   Hadoop  1.0:  HDFS  +  MapReduce   4   NameNode   DataNode  /  TaskTracker   DataNode  /  TaskTracker   DataNode  /  TaskTracker   DataNode  /  TaskTracker   JobTracker   Client   1-­‐1   1-­‐2  1-­‐3  
  • 6. ©  2014  Trace3,  All  rights  reserved.   Hadoop  1.0:  HDFS  +  MapReduce   5   NameNode   DataNode  /  TaskTracker   DataNode  /  TaskTracker   DataNode  /  TaskTracker   DataNode  /  TaskTracker   JobTracker   Client   1-­‐1   1-­‐2   1-­‐3   Reduce  Map   2-­‐1   3-­‐2   3-­‐3   4-­‐1   2-­‐3   4-­‐2   2-­‐2   3-­‐1   4-­‐3   Reduce  Map  
  • 7. ©  2014  Trace3,  All  rights  reserved.   MapReduce  v1  LimitaVons   6   Scalability   Maximum  cluster  size  is  4,000  nodes  and  maximum  concurrent  tasks  is  40,000   Availability   JobTracker  failure  kills  all  queued  and  running  jobs   Resources  ParVVoned  into  Map  and  Reduce   Hard  parGGoning  of  Map  and  Reduce  slots  led  to  low  resource  uVlizaVon   No  Support  for  Alternate  Paradigms  /  Services   Only  MapReduce  batch  jobs,  nothing  else  
  • 8. ©  2014  Trace3,  All  rights  reserved.   Hadoop  1.0:  Single  Use  System   7   HADOOP  1.0   Single  Use  System   Batch  Apps   HDFS   (redundant,  reliable  storage)   MapReduce   (cluster  resource  management  and  data   processing)   Pig   Hive  
  • 9. ©  2014  Trace3,  All  rights  reserved.   WHAT’S  NEW  IN   HADOOP  2.0?  
  • 10. ©  2014  Trace3,  All  rights  reserved.   YARN   9   YARN  Replaces   MapReduce   Yet  Another  Resource  NegoVator   YARN  will  be  the  de-­‐facto  distributed   operaVng  system  for  Big  Data  
  • 11. ©  2014  Trace3,  All  rights  reserved.  10   Store  DATA  in  one  place   Interact  with  that  data  in  MULTIPLE  WAYS   with  Predictable  Performance  and  Quality  of  Service              ApplicaGons  Run  NaGvely  IN  Hadoop   HDFS2   (redundant,  reliable  storage)   YARN   (cluster  resource  management)   BATCH   (MapReduce)   INTERACTIVE   (Tez)   ONLINE   (HBase)   STREAMING   (DataTorrent)   GRAPH   (Giraph)   YARN:  No  Longer  Just  Batch  Apps  
  • 12. ©  2014  Trace3,  All  rights  reserved.  11   YARN:  ApplicaVons   Running  all  on  the  same  Hadoop  cluster  to  give   applicaVons  access  to  all  the  same  source  data!   MapReduce  v2   Stream  Processing   Master-­‐Worker  Online   In-­‐Memory   Apache  Storm  
  • 13. ©  2014  Trace3,  All  rights  reserved.  12   YARN:  Quickly  Maturing   2010     2011     2012     2013     2014     Today   Conceived  at  Yahoo!   Alpha  Releases  –  2.0   Beta  Releases  –  2.1   GA  Released  –  2.2   100,000+  nodes,  400,000+  jobs  daily   10  million+  hours  of  compute  daily   Version  2.3   Version  2.4  
  • 14. ©  2014  Trace3,  All  rights  reserved.  13   YARN:  Dr.  Evil  Approved  
  • 15. ©  2014  Trace3,  All  rights  reserved.  14   YARN:  What  Has  Changed?   YARN   MRv1   RM   ResourceManager   AM  ApplicaVonMaster   JT   JobTracker   Scheduler   Scheduler   NM  NodeManager   TT  TaskTracker   Container   Map  &   Reduce   Slot   ResourceManager   Scheduler   JobTracker   Scheduler   NodeManager   ApplicaVonMaster   TaskTracker   Map   Reduce   NodeManager   Container   Container   TaskTracker   Map   Reduce  
  • 16. ©  2014  Trace3,  All  rights  reserved.   The  6  Benefits  Of  YARN   15   • Scale   • New  programming  models   and  services   • Improved  cluster  uVlizaVon   • Agility   • Backwards  compaVble  with   MapReduce  v1   • Mixed  workloads  on  the   same  source  of  data  
  • 17. ©  2014  Trace3,  All  rights  reserved.   THE  FUTURE   OF  HADOOP  
  • 18. ©  2014  Trace3,  All  rights  reserved.   SQL  on  Hadoop   Speed   Deliver  interacGve  query  performance.   SQL   Support  array  of  SQL  semanGcs  for  analyGc   applicaGons  running  against  Hadoop.   Scale   SQL  interface  to  Hadoop  designed  for  queries   that  scale  from  Terabytes  to  Petabytes    
  • 19. ©  2014  Trace3,  All  rights  reserved.   SQL  on  Hadoop   Hive  on  Apache  Tez   Hortonworks  HDP2   Hive  on  Apache  Spark   Cloudera  CDH5   Apache  Drill   MapR  M7   Cloudera  Impala   Cloudera  CDH5   Pivotal  HAWQ   Pivotal  Big  Data  Suite  
  • 20. ©  2014  Trace3,  All  rights  reserved.   HOYA:  HBase  (NoSQL)  on  YARN   Dynamic  Scaling   On-­‐demand  cluster  size.  Increase  and  decrease   the  size  with  load.   Easier  Deployment   APIs  to  create,  start,  stop  and  delete  HBase   clusters.   Availability   Recover  from  Region  Server  loss  with  a  new   container.  
  • 21. ©  2014  Trace3,  All  rights  reserved.   Microsoo  REEF   Machine  Learning   Framework  well  suited  for  building  machine   learning  jobs.   Scalable  /  Fault  Tolerant   Makes  it  easy  to  implement  scalable,  fault-­‐ tolerant  runGme  environments  for  a  range  of   computaGonal  models.   Maintain  State   Users  can  build  jobs  that  uGlize  data  from   where  it’s  needed  and  also  maintain  state  a_er   jobs  are  done.   Retainable   Evaluator   ExecuGon   Framework  
  • 22. ©  2014  Trace3,  All  rights  reserved.   Heterogeneous  Storage   NameNode   Storage   NameNode   SATA   SSD   Fusion   IO   THEN   NOW  
  • 23. ©  2014  Trace3,  All  rights  reserved.   Hadoop  Roadmap     • Apache  Hadoop  2.5   –  NodeManager  Restart  w/o  disrupGon   –  Dynamic  Resource  ConfiguraGon     • Apache  Hadoop  2.6   –  Memory  As  Storage  Tier   –  Support  For  Docker  Containers   Q3  2014   Q4  2014  
  • 24. ©  2014  Trace3,  All  rights  reserved.   HADOOP:  PAST,  PRESENT  &  FUTURE   23   I  KNOW  YOU  HAVE   QUESTONS   NO  SUCH  THING  AS  A  STUPID  QUESTION.  
  • 25. ©  2014  Trace3,  All  rights  reserved.   ONE  LAST  THING  …   24   SD  Big  Data  Meetup     meetup.com/sdbigdata   2nd  Wednesday  Of  The  Month   Next:  August  13th  @  5:45P