SlideShare ist ein Scribd-Unternehmen logo
1 von 31
1 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Apache Hadoop YARN:
Past, Present and
Future
Dublin, April 2016
Varun Vasudev
2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
About myself
⬢ Apache Hadoop contributor since 2014
⬢ Apache Hadoop committer
⬢ Currently working for Hortonworks
⬢ vvasudev@apache.org
3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Introduction to Apache Hadoop YARN
YARN: Data Operating System
(Cluster Resource Management)
1 ° ° ° ° ° ° °
° ° ° ° ° ° ° °
Script
Pig
SQL
Hive
TezTez
Java
Scala
Cascading
Tez
° °
° °
° ° ° ° °
° ° ° ° °
Others
ISV
Engines
HDFS
(Hadoop Distributed File System)
Stream
Storm
Search
Solr
NoSQL
HBase
Accumulo
Slider Slider
BATCH, INTERACTIVE & REAL-TIME DATA ACCESS
In-Memory
Spark
YARN
The Architectural
Center of Hadoop
• Common data platform, many applications
• Support multi-tenant access & processing
• Batch, interactive & real-time use cases
4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Introduction to Apache Hadoop YARN
⬢ Architectural center of big data workloads
⬢ Enterprise adoption accelerating
–Secure mode becoming more widespread
–Multi-tenant support
–Diverse workloads
⬢ SLAs
–Tolerance for slow running jobs decreasing
–Consistent performance desired
5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Past – Apache Hadoop 2.6, 2.7
6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Apache Hadoop YARN
ResourceManager
(active)
ResourceManager
(standby)
NodeManager1
NodeManager2
NodeManager3
NodeManager4
Resources: 128G, 16 vcores
Label: SAS
7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Scheduler
Inter queue pre-emption
Application
Queue B – 25%
Queue C – 25%
Label: SAS (exclusive)
Queue A – 50%
FIFO
ResourceManager
(active)
Application, Queue A, 4G, 1 vcore
Reservation for application
User
8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Node 1
NodeManager128G, 16 vcores
Launch Applicaton 1 AMAM process
Launch AM process via
ContainerExecutor – DCE, LCE, WSCE.
Monitor/isolate memory and cpu
Application Lifecycle
ResourceManager
(active)
Request containers
Allocate containers
Container 1 process
Container 2 process
Launch containers on node using
DCE, LCE, WSCE. Monitor/isolate
memory and cpu
History Server(ATS – leveldb,
JHS - HDFS)
HDFS
Log aggregation
9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Operational support
⬢ Support added for work preserving restarts in the RM and the NM
⬢ Support added for rolling upgrades and downgrades from 2.6 onwards
1
0
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
Recent releases
⬢ 2.6 and 2.7 maintenance releases are carried out
–Only blockers and critical fixes are added
⬢ Apache Hadoop 2.7
–2.7.3 should be out soon
–2.7.2 released in January, 2016
–2.7.1 released in July, 2015
⬢ Apache Hadoop 2.6
–2.6.4 released in February, 2016
–2.6.3 released in December, 2015
–2.6.2 released in October, 2015
1
1
© Hortonworks Inc. 2011 – 2016. All Rights Reserved1
1
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
Present – Apache Hadoop 2.8
1
2
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
YARN
ResourceManager
(active)
ResourceManager
(standby)
NodeManager1
NodeManager2
NodeManager3
NodeManager4
Resources: 128G, 16 vcores
Auto-calculate node resources
Label: SAS
Dynamic NodeManager
resource configuration
1
3
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
NodeManager resource management
⬢ Options to report NM resources based on node hardware
–YARN-160
–Restart of the NM required to enable feature
⬢ Alternatively, admins can use the rmadmin command to update the node’s resources
–YARN-291
–Looks at the dynamic-resource.xml
–No restart of the NM or the RM required
1
4
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
YARN Scheduler
Inter queue pre-emption
Improvements to pre-emption
Application
Queue B – 25%
Queue C – 25%
Label: SAS (non-exclusive)
Queue A – 50%
Priority/FIFO, Fair
ResourceManager
(active)
Application, Queue A, 4G, 1 vcore
Support for application priority
Reservation for application
Support for cost based placement
agent
User
1
5
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
Scheduler
⬢ Support for application priority within a queue
–YARN-1963
–Users can specify application priority
–Specified as an integer, higher number is higher priority
–Application priority can be updated while it’s running
⬢ Improvements to reservations
–YARN-2572
–Support for cost based placement agent added in addition to greedy
⬢ Queue allocation policy can be switched to fair sharing
–YARN-3319
–Containers allocated on a fair share basis instead of FIFO
1
6
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
Scheduler
⬢ Support for non-exclusive node labels
–YARN-3214
–Improvement over partition that existed earlier
–Better for cluster utilization
⬢ Improvements to pre-emption
1
7
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
Node 1
NodeManager
Support added for graceful
decomissioning
128G, 16 vcores
Launch Applicaton 1 AMAM process/Docker container(alpha)
Launch AM via ContainerExecutor –
DCE, LCE, WSCE. Monitor/isolate
memory and cpu. Support added for
disk and network isolation via
CGroups(alpha)
Application Lifecycle
ResourceManager
(active)
Request containers
Allocate containers
Support added to resize containers. Container 1 process/Docker
container(alpha)
Container 2 process/Docker
container(alpha)
Launch containers on node using DCE,
LCE, WSCE. Monitor/isolate memory and
cpu. Support added for disk and network
isolation using Cgroups(alpha).
History Server(ATS 1.5– leveldb
+ HDFS, JHS - HDFS)
HDFS
Log aggregation
1
8
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
Application Lifecycle
⬢ Graceful decommissioning of NodeManagers
–YARN-914
–Drains a node that’s being decommissioned to allow running containers to finish
⬢ Resource isolation support for disk and network
–YARN-2619, YARN-2140
–Containers get a fair share of disk and network resources using CGroups
–Alpha feature
⬢ Docker support in LinuxContainerExecutor
–YARN-3853
–Support to launch Docker containers alongside process containers
–Alpha feature
–Talk by Sidharta Seethana at 12:20 tomorrow in Liffey A
1
9
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
Application Lifecycle
⬢ Support for container resizing
–YARN-1197
–Allows applications to change the size of an existing container
⬢ ATS 1.5
–YARN-4233
–Store timeline events on HDFS
–Better scalability and reliability
2
0
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
Operational support
⬢ Improvements to existing tools(like yarn logs)
⬢ New tools added(yarn top)
⬢ Improvements to the RM UI to expose more details about running applications
2
1
© Hortonworks Inc. 2011 – 2016. All Rights Reserved2
1
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
Future
2
2
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
Drivers for changes
⬢ Changing workload types
–Workloads have moved from batch to batch + interactive
–Workloads will change to batch + interactive + services
⬢ Big data workloads continue to evolve
–Spark on YARN the most popular way to run Spark in production
⬢ Containerization has taken off
–Docker becoming extremely popular
⬢ Improve ease of operations
–Easier to debug application failures/poor performance
–Make overall cluster management easier
–Improve existing tools such as yarn logs, yarn top, etc
2
3
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
Apache Hadoop YARN
ResourceManager
(active)
ResourceManager
(standby)
NodeManager1
NodeManager2
NodeManager3
NodeManager4
Resources: 128G, 16 vcores
Add support for arbitrary resource types
Label: SAS
Add support for
federation – allow YARN
to scale
New RM UI
2
4
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
Future work
⬢ Support for arbitrary resource types and resource profiles
–YARN-3926
–Admins can add arbitrary resource types for scheduling
–Users can specify resource profile name instead of individual resources
⬢ YARN federation
–YARN-2915
–Allows YARN to scale out to tens of thousands of nodes
–Cluster of clusters which appear as a single cluster to an end user
⬢ New RM UI
–YARN-3368
–Enhanced usability
–Easier to add new features
2
5
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
Scheduler
Inter queue pre-emption
Support for intra queue pre-emption
Application
Queue B – 25%
Queue C – 25%
Label: SAS (non-exclusive)
Queue A – 50%
Priority/FIFO, Fair
ResourceManager
(active)
Application, Queue A
Add support for resource profiles
Reservation for application
User
New scheduler API
Schedule based on actual resource usage
2
6
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
Future work
⬢ New scheduler features
–YARN-4902
–Support richer placement strategies such as affinity, anti-affinity
⬢ Support pre-emption within a queue
–YARN-4781
⬢ More improvements to pre-emption
–YARN-4108, YARN-4390
⬢ Scheduling based on actual resource usage
–YARN-1011
–Nodes report actual memory and cpu usage to the scheduler to make better decisions
2
7
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
Node 1
NodeManager
Add distributed scheduling
128G, 16 vcores
Launch Applicaton 1 AMAM process/Docker container
Launch AM process via
ContainerExecutor – DCE, LCE, WSCE.
Monitor/isolate memory and cpu.
Support for disk and network isolation
Application Lifecycle
ResourceManager
(active)
Request containers
Allocate containers
New scheduler API to allow far more
powerful placement strategies
Container 1 process/Docker
container. Support container restart.
Container 2 process/Docker
container. Support container restart.
Launch containers on node using DCE,
LCE, WSCE. Monitor/isolate memory and
cpu. Support for disk and network
isolation.
History Server(ATS v2 - HBase,
JHS - HDFS)
HDFS
Log aggregation
DNS sevice
2
8
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
Future work
⬢ Distributed scheduling
–YARN-2877, YARN-4742
–NMs run a local scheduler
–Allows faster scheduling turnaround
⬢ Better support for disk and network isolation
–Tied to supporting arbitrary resource types
⬢ Enhance Docker support
–YARN-3611
–Support to mount volumes
–Isolate containers using CGroups
2
9
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
Future work – support for services
⬢ YARN-4692
⬢ Container restart
–YARN-3988
–Allow container restart without losing allocation
⬢ Service discovery via DNS
–YARN-4757
–Running services can be discovered via DNS
⬢ Allocation re-use
–YARN-4726
–Allow AMs to stop a container but not lose resources on the node
–Required for application upgrades
3
0
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
Future work
⬢ ATS v2
–YARN-2928
–Run timeline service on Hbase
–Support for more data, better performance
⬢ Also in the pipeline
–Switch to Java 8 with Hadoop 3.0
–Add support for GPU isolation
–Better tools to detect limping nodes
3
1
© Hortonworks Inc. 2011 – 2016. All Rights Reserved3
1
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
Thank you!

Weitere ähnliche Inhalte

Was ist angesagt?

Apache Hadoop YARN: best practices
Apache Hadoop YARN: best practicesApache Hadoop YARN: best practices
Apache Hadoop YARN: best practices
DataWorks Summit
 
Apache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureApache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and Future
DataWorks Summit
 
Hadoop YARN | Hadoop YARN Architecture | Hadoop YARN Tutorial | Hadoop Tutori...
Hadoop YARN | Hadoop YARN Architecture | Hadoop YARN Tutorial | Hadoop Tutori...Hadoop YARN | Hadoop YARN Architecture | Hadoop YARN Tutorial | Hadoop Tutori...
Hadoop YARN | Hadoop YARN Architecture | Hadoop YARN Tutorial | Hadoop Tutori...
Simplilearn
 

Was ist angesagt? (20)

Apache Hadoop 3.0 What's new in YARN and MapReduce
Apache Hadoop 3.0 What's new in YARN and MapReduceApache Hadoop 3.0 What's new in YARN and MapReduce
Apache Hadoop 3.0 What's new in YARN and MapReduce
 
Apache Hadoop YARN: best practices
Apache Hadoop YARN: best practicesApache Hadoop YARN: best practices
Apache Hadoop YARN: best practices
 
Hadoop 3.0 features
Hadoop 3.0 featuresHadoop 3.0 features
Hadoop 3.0 features
 
Streamline Hadoop DevOps with Apache Ambari
Streamline Hadoop DevOps with Apache AmbariStreamline Hadoop DevOps with Apache Ambari
Streamline Hadoop DevOps with Apache Ambari
 
Cloudy with a chance of Hadoop - DataWorks Summit 2017 San Jose
Cloudy with a chance of Hadoop - DataWorks Summit 2017 San JoseCloudy with a chance of Hadoop - DataWorks Summit 2017 San Jose
Cloudy with a chance of Hadoop - DataWorks Summit 2017 San Jose
 
Apache Hadoop YARN 2015: Present and Future
Apache Hadoop YARN 2015: Present and FutureApache Hadoop YARN 2015: Present and Future
Apache Hadoop YARN 2015: Present and Future
 
Apache Phoenix and HBase: Past, Present and Future of SQL over HBase
Apache Phoenix and HBase: Past, Present and Future of SQL over HBaseApache Phoenix and HBase: Past, Present and Future of SQL over HBase
Apache Phoenix and HBase: Past, Present and Future of SQL over HBase
 
Application Timeline Server - Past, Present and Future
Application Timeline Server - Past, Present and FutureApplication Timeline Server - Past, Present and Future
Application Timeline Server - Past, Present and Future
 
Query Engines for Hive: MR, Spark, Tez with LLAP – Considerations!
Query Engines for Hive: MR, Spark, Tez with LLAP – Considerations!Query Engines for Hive: MR, Spark, Tez with LLAP – Considerations!
Query Engines for Hive: MR, Spark, Tez with LLAP – Considerations!
 
State of Security: Apache Spark & Apache Zeppelin
State of Security: Apache Spark & Apache ZeppelinState of Security: Apache Spark & Apache Zeppelin
State of Security: Apache Spark & Apache Zeppelin
 
Apache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureApache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and Future
 
Hadoop YARN | Hadoop YARN Architecture | Hadoop YARN Tutorial | Hadoop Tutori...
Hadoop YARN | Hadoop YARN Architecture | Hadoop YARN Tutorial | Hadoop Tutori...Hadoop YARN | Hadoop YARN Architecture | Hadoop YARN Tutorial | Hadoop Tutori...
Hadoop YARN | Hadoop YARN Architecture | Hadoop YARN Tutorial | Hadoop Tutori...
 
Yarn
YarnYarn
Yarn
 
Running Non-MapReduce Big Data Applications on Apache Hadoop
Running Non-MapReduce Big Data Applications on Apache HadoopRunning Non-MapReduce Big Data Applications on Apache Hadoop
Running Non-MapReduce Big Data Applications on Apache Hadoop
 
Jun 2017 HUG: YARN Scheduling – A Step Beyond
Jun 2017 HUG: YARN Scheduling – A Step BeyondJun 2017 HUG: YARN Scheduling – A Step Beyond
Jun 2017 HUG: YARN Scheduling – A Step Beyond
 
Tuning Apache Ambari performance for Big Data at scale with 3000 agents
Tuning Apache Ambari performance for Big Data at scale with 3000 agentsTuning Apache Ambari performance for Big Data at scale with 3000 agents
Tuning Apache Ambari performance for Big Data at scale with 3000 agents
 
Ozone- Object store for Apache Hadoop
Ozone- Object store for Apache HadoopOzone- Object store for Apache Hadoop
Ozone- Object store for Apache Hadoop
 
YARN and the Docker container runtime
YARN and the Docker container runtimeYARN and the Docker container runtime
YARN and the Docker container runtime
 
NextGen Apache Hadoop MapReduce
NextGen Apache Hadoop MapReduceNextGen Apache Hadoop MapReduce
NextGen Apache Hadoop MapReduce
 
Enabling Diverse Workload Scheduling in YARN
Enabling Diverse Workload Scheduling in YARNEnabling Diverse Workload Scheduling in YARN
Enabling Diverse Workload Scheduling in YARN
 

Ähnlich wie Apache Hadoop YARN: Past, Present and Future

Apache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the unionApache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the union
DataWorks Summit
 
Apache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureApache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and Future
DataWorks Summit
 
YARN - Past, Present, & Future
YARN - Past, Present, & FutureYARN - Past, Present, & Future
YARN - Past, Present, & Future
DataWorks Summit
 
Apache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the unionApache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the union
DataWorks Summit
 
Accumulo Summit 2016: Apache Accumulo on Docker with YARN Native Services
Accumulo Summit 2016: Apache Accumulo on Docker with YARN Native ServicesAccumulo Summit 2016: Apache Accumulo on Docker with YARN Native Services
Accumulo Summit 2016: Apache Accumulo on Docker with YARN Native Services
Accumulo Summit
 
Apache Hadoop YARN: State of the Union
Apache Hadoop YARN: State of the UnionApache Hadoop YARN: State of the Union
Apache Hadoop YARN: State of the Union
DataWorks Summit
 

Ähnlich wie Apache Hadoop YARN: Past, Present and Future (20)

Apache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and FutureApache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and Future
 
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The UnionDataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
 
Apache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the unionApache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the union
 
Apache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureApache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and Future
 
The Enterprise and Connected Data, Trends in the Apache Hadoop Ecosystem by A...
The Enterprise and Connected Data, Trends in the Apache Hadoop Ecosystem by A...The Enterprise and Connected Data, Trends in the Apache Hadoop Ecosystem by A...
The Enterprise and Connected Data, Trends in the Apache Hadoop Ecosystem by A...
 
Big data spain keynote nov 2016
Big data spain keynote nov 2016Big data spain keynote nov 2016
Big data spain keynote nov 2016
 
Hadoop & cloud storage object store integration in production (final)
Hadoop & cloud storage  object store integration in production (final)Hadoop & cloud storage  object store integration in production (final)
Hadoop & cloud storage object store integration in production (final)
 
YARN - Past, Present, & Future
YARN - Past, Present, & FutureYARN - Past, Present, & Future
YARN - Past, Present, & Future
 
Hadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in ProductionHadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in Production
 
Hadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in ProductionHadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in Production
 
Scheduling Policies in YARN
Scheduling Policies in YARNScheduling Policies in YARN
Scheduling Policies in YARN
 
Hadoop Summit - Scheduling policies in YARN - San Jose 2016
Hadoop Summit - Scheduling policies in YARN - San Jose 2016Hadoop Summit - Scheduling policies in YARN - San Jose 2016
Hadoop Summit - Scheduling policies in YARN - San Jose 2016
 
Hive edw-dataworks summit-eu-april-2017
Hive edw-dataworks summit-eu-april-2017Hive edw-dataworks summit-eu-april-2017
Hive edw-dataworks summit-eu-april-2017
 
An Apache Hive Based Data Warehouse
An Apache Hive Based Data WarehouseAn Apache Hive Based Data Warehouse
An Apache Hive Based Data Warehouse
 
Apache Hadoop 3 updates with migration story
Apache Hadoop 3 updates with migration storyApache Hadoop 3 updates with migration story
Apache Hadoop 3 updates with migration story
 
Apache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the unionApache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the union
 
Accumulo Summit 2016: Apache Accumulo on Docker with YARN Native Services
Accumulo Summit 2016: Apache Accumulo on Docker with YARN Native ServicesAccumulo Summit 2016: Apache Accumulo on Docker with YARN Native Services
Accumulo Summit 2016: Apache Accumulo on Docker with YARN Native Services
 
Apache Hadoop YARN: State of the Union
Apache Hadoop YARN: State of the UnionApache Hadoop YARN: State of the Union
Apache Hadoop YARN: State of the Union
 
Hadoop Summit San Jose 2015: YARN - Past, Present and Future
Hadoop Summit San Jose 2015: YARN - Past, Present and FutureHadoop Summit San Jose 2015: YARN - Past, Present and Future
Hadoop Summit San Jose 2015: YARN - Past, Present and Future
 
Cloudy with a chance of Hadoop - real world considerations
Cloudy with a chance of Hadoop - real world considerationsCloudy with a chance of Hadoop - real world considerations
Cloudy with a chance of Hadoop - real world considerations
 

Mehr von DataWorks Summit/Hadoop Summit

How Hadoop Makes the Natixis Pack More Efficient
How Hadoop Makes the Natixis Pack More Efficient How Hadoop Makes the Natixis Pack More Efficient
How Hadoop Makes the Natixis Pack More Efficient
DataWorks Summit/Hadoop Summit
 
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
Breaking the 1 Million OPS/SEC Barrier in HOPS HadoopBreaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
DataWorks Summit/Hadoop Summit
 

Mehr von DataWorks Summit/Hadoop Summit (20)

Running Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in ProductionRunning Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in Production
 
Unleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache RangerUnleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache Ranger
 
Enabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science PlatformEnabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science Platform
 
Revolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and ZeppelinRevolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and Zeppelin
 
Double Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSenseDouble Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSense
 
Hadoop Crash Course
Hadoop Crash CourseHadoop Crash Course
Hadoop Crash Course
 
Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Apache Spark Crash Course
Apache Spark Crash CourseApache Spark Crash Course
Apache Spark Crash Course
 
Dataflow with Apache NiFi
Dataflow with Apache NiFiDataflow with Apache NiFi
Dataflow with Apache NiFi
 
Schema Registry - Set you Data Free
Schema Registry - Set you Data FreeSchema Registry - Set you Data Free
Schema Registry - Set you Data Free
 
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
 
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
 
Mool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and MLMool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and ML
 
How Hadoop Makes the Natixis Pack More Efficient
How Hadoop Makes the Natixis Pack More Efficient How Hadoop Makes the Natixis Pack More Efficient
How Hadoop Makes the Natixis Pack More Efficient
 
HBase in Practice
HBase in Practice HBase in Practice
HBase in Practice
 
The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)
 
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
Breaking the 1 Million OPS/SEC Barrier in HOPS HadoopBreaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
 
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
 
Backup and Disaster Recovery in Hadoop
Backup and Disaster Recovery in Hadoop Backup and Disaster Recovery in Hadoop
Backup and Disaster Recovery in Hadoop
 
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
Scaling HDFS to Manage Billions of Files with Distributed Storage SchemesScaling HDFS to Manage Billions of Files with Distributed Storage Schemes
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
 

Kürzlich hochgeladen

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Kürzlich hochgeladen (20)

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 

Apache Hadoop YARN: Past, Present and Future

  • 1. 1 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Apache Hadoop YARN: Past, Present and Future Dublin, April 2016 Varun Vasudev
  • 2. 2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved About myself ⬢ Apache Hadoop contributor since 2014 ⬢ Apache Hadoop committer ⬢ Currently working for Hortonworks ⬢ vvasudev@apache.org
  • 3. 3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Introduction to Apache Hadoop YARN YARN: Data Operating System (Cluster Resource Management) 1 ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° Script Pig SQL Hive TezTez Java Scala Cascading Tez ° ° ° ° ° ° ° ° ° ° ° ° ° ° Others ISV Engines HDFS (Hadoop Distributed File System) Stream Storm Search Solr NoSQL HBase Accumulo Slider Slider BATCH, INTERACTIVE & REAL-TIME DATA ACCESS In-Memory Spark YARN The Architectural Center of Hadoop • Common data platform, many applications • Support multi-tenant access & processing • Batch, interactive & real-time use cases
  • 4. 4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Introduction to Apache Hadoop YARN ⬢ Architectural center of big data workloads ⬢ Enterprise adoption accelerating –Secure mode becoming more widespread –Multi-tenant support –Diverse workloads ⬢ SLAs –Tolerance for slow running jobs decreasing –Consistent performance desired
  • 5. 5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Past – Apache Hadoop 2.6, 2.7
  • 6. 6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Apache Hadoop YARN ResourceManager (active) ResourceManager (standby) NodeManager1 NodeManager2 NodeManager3 NodeManager4 Resources: 128G, 16 vcores Label: SAS
  • 7. 7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Scheduler Inter queue pre-emption Application Queue B – 25% Queue C – 25% Label: SAS (exclusive) Queue A – 50% FIFO ResourceManager (active) Application, Queue A, 4G, 1 vcore Reservation for application User
  • 8. 8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Node 1 NodeManager128G, 16 vcores Launch Applicaton 1 AMAM process Launch AM process via ContainerExecutor – DCE, LCE, WSCE. Monitor/isolate memory and cpu Application Lifecycle ResourceManager (active) Request containers Allocate containers Container 1 process Container 2 process Launch containers on node using DCE, LCE, WSCE. Monitor/isolate memory and cpu History Server(ATS – leveldb, JHS - HDFS) HDFS Log aggregation
  • 9. 9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Operational support ⬢ Support added for work preserving restarts in the RM and the NM ⬢ Support added for rolling upgrades and downgrades from 2.6 onwards
  • 10. 1 0 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Recent releases ⬢ 2.6 and 2.7 maintenance releases are carried out –Only blockers and critical fixes are added ⬢ Apache Hadoop 2.7 –2.7.3 should be out soon –2.7.2 released in January, 2016 –2.7.1 released in July, 2015 ⬢ Apache Hadoop 2.6 –2.6.4 released in February, 2016 –2.6.3 released in December, 2015 –2.6.2 released in October, 2015
  • 11. 1 1 © Hortonworks Inc. 2011 – 2016. All Rights Reserved1 1 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Present – Apache Hadoop 2.8
  • 12. 1 2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved YARN ResourceManager (active) ResourceManager (standby) NodeManager1 NodeManager2 NodeManager3 NodeManager4 Resources: 128G, 16 vcores Auto-calculate node resources Label: SAS Dynamic NodeManager resource configuration
  • 13. 1 3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved NodeManager resource management ⬢ Options to report NM resources based on node hardware –YARN-160 –Restart of the NM required to enable feature ⬢ Alternatively, admins can use the rmadmin command to update the node’s resources –YARN-291 –Looks at the dynamic-resource.xml –No restart of the NM or the RM required
  • 14. 1 4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved YARN Scheduler Inter queue pre-emption Improvements to pre-emption Application Queue B – 25% Queue C – 25% Label: SAS (non-exclusive) Queue A – 50% Priority/FIFO, Fair ResourceManager (active) Application, Queue A, 4G, 1 vcore Support for application priority Reservation for application Support for cost based placement agent User
  • 15. 1 5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Scheduler ⬢ Support for application priority within a queue –YARN-1963 –Users can specify application priority –Specified as an integer, higher number is higher priority –Application priority can be updated while it’s running ⬢ Improvements to reservations –YARN-2572 –Support for cost based placement agent added in addition to greedy ⬢ Queue allocation policy can be switched to fair sharing –YARN-3319 –Containers allocated on a fair share basis instead of FIFO
  • 16. 1 6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Scheduler ⬢ Support for non-exclusive node labels –YARN-3214 –Improvement over partition that existed earlier –Better for cluster utilization ⬢ Improvements to pre-emption
  • 17. 1 7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Node 1 NodeManager Support added for graceful decomissioning 128G, 16 vcores Launch Applicaton 1 AMAM process/Docker container(alpha) Launch AM via ContainerExecutor – DCE, LCE, WSCE. Monitor/isolate memory and cpu. Support added for disk and network isolation via CGroups(alpha) Application Lifecycle ResourceManager (active) Request containers Allocate containers Support added to resize containers. Container 1 process/Docker container(alpha) Container 2 process/Docker container(alpha) Launch containers on node using DCE, LCE, WSCE. Monitor/isolate memory and cpu. Support added for disk and network isolation using Cgroups(alpha). History Server(ATS 1.5– leveldb + HDFS, JHS - HDFS) HDFS Log aggregation
  • 18. 1 8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Application Lifecycle ⬢ Graceful decommissioning of NodeManagers –YARN-914 –Drains a node that’s being decommissioned to allow running containers to finish ⬢ Resource isolation support for disk and network –YARN-2619, YARN-2140 –Containers get a fair share of disk and network resources using CGroups –Alpha feature ⬢ Docker support in LinuxContainerExecutor –YARN-3853 –Support to launch Docker containers alongside process containers –Alpha feature –Talk by Sidharta Seethana at 12:20 tomorrow in Liffey A
  • 19. 1 9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Application Lifecycle ⬢ Support for container resizing –YARN-1197 –Allows applications to change the size of an existing container ⬢ ATS 1.5 –YARN-4233 –Store timeline events on HDFS –Better scalability and reliability
  • 20. 2 0 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Operational support ⬢ Improvements to existing tools(like yarn logs) ⬢ New tools added(yarn top) ⬢ Improvements to the RM UI to expose more details about running applications
  • 21. 2 1 © Hortonworks Inc. 2011 – 2016. All Rights Reserved2 1 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Future
  • 22. 2 2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Drivers for changes ⬢ Changing workload types –Workloads have moved from batch to batch + interactive –Workloads will change to batch + interactive + services ⬢ Big data workloads continue to evolve –Spark on YARN the most popular way to run Spark in production ⬢ Containerization has taken off –Docker becoming extremely popular ⬢ Improve ease of operations –Easier to debug application failures/poor performance –Make overall cluster management easier –Improve existing tools such as yarn logs, yarn top, etc
  • 23. 2 3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Apache Hadoop YARN ResourceManager (active) ResourceManager (standby) NodeManager1 NodeManager2 NodeManager3 NodeManager4 Resources: 128G, 16 vcores Add support for arbitrary resource types Label: SAS Add support for federation – allow YARN to scale New RM UI
  • 24. 2 4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Future work ⬢ Support for arbitrary resource types and resource profiles –YARN-3926 –Admins can add arbitrary resource types for scheduling –Users can specify resource profile name instead of individual resources ⬢ YARN federation –YARN-2915 –Allows YARN to scale out to tens of thousands of nodes –Cluster of clusters which appear as a single cluster to an end user ⬢ New RM UI –YARN-3368 –Enhanced usability –Easier to add new features
  • 25. 2 5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Scheduler Inter queue pre-emption Support for intra queue pre-emption Application Queue B – 25% Queue C – 25% Label: SAS (non-exclusive) Queue A – 50% Priority/FIFO, Fair ResourceManager (active) Application, Queue A Add support for resource profiles Reservation for application User New scheduler API Schedule based on actual resource usage
  • 26. 2 6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Future work ⬢ New scheduler features –YARN-4902 –Support richer placement strategies such as affinity, anti-affinity ⬢ Support pre-emption within a queue –YARN-4781 ⬢ More improvements to pre-emption –YARN-4108, YARN-4390 ⬢ Scheduling based on actual resource usage –YARN-1011 –Nodes report actual memory and cpu usage to the scheduler to make better decisions
  • 27. 2 7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Node 1 NodeManager Add distributed scheduling 128G, 16 vcores Launch Applicaton 1 AMAM process/Docker container Launch AM process via ContainerExecutor – DCE, LCE, WSCE. Monitor/isolate memory and cpu. Support for disk and network isolation Application Lifecycle ResourceManager (active) Request containers Allocate containers New scheduler API to allow far more powerful placement strategies Container 1 process/Docker container. Support container restart. Container 2 process/Docker container. Support container restart. Launch containers on node using DCE, LCE, WSCE. Monitor/isolate memory and cpu. Support for disk and network isolation. History Server(ATS v2 - HBase, JHS - HDFS) HDFS Log aggregation DNS sevice
  • 28. 2 8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Future work ⬢ Distributed scheduling –YARN-2877, YARN-4742 –NMs run a local scheduler –Allows faster scheduling turnaround ⬢ Better support for disk and network isolation –Tied to supporting arbitrary resource types ⬢ Enhance Docker support –YARN-3611 –Support to mount volumes –Isolate containers using CGroups
  • 29. 2 9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Future work – support for services ⬢ YARN-4692 ⬢ Container restart –YARN-3988 –Allow container restart without losing allocation ⬢ Service discovery via DNS –YARN-4757 –Running services can be discovered via DNS ⬢ Allocation re-use –YARN-4726 –Allow AMs to stop a container but not lose resources on the node –Required for application upgrades
  • 30. 3 0 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Future work ⬢ ATS v2 –YARN-2928 –Run timeline service on Hbase –Support for more data, better performance ⬢ Also in the pipeline –Switch to Java 8 with Hadoop 3.0 –Add support for GPU isolation –Better tools to detect limping nodes
  • 31. 3 1 © Hortonworks Inc. 2011 – 2016. All Rights Reserved3 1 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Thank you!