SlideShare ist ein Scribd-Unternehmen logo
1 von 20
Downloaden Sie, um offline zu lesen
1© 2016 Pivotal Software, Inc. All rights reserved. 1© 2016 Pivotal Software, Inc. All rights reserved.
Understanding	YARN	
and	HAWQ	Resource	Mgmt
Dan	Baskette
Director	of	Technical	Marketing
Apache HAWQ
2© 2016 Pivotal Software, Inc. All rights reserved.
Agenda
Basics	of	Hadoop	YARN
HAWQ	&	YARN	integration
Demo
Q&A
3© 2016 Pivotal Software, Inc. All rights reserved.
What	is	YARN
● YARN	Basics
● Why	do	you	care?
● YARN	Components	and	Architecture
4© 2016 Pivotal Software, Inc. All rights reserved.
YARN	101
● Why	YARN?	(limitation	of	mapreduce	1)
○ Multi-tenancy	(spark,	hbase,tez,accumulo,hawq...)
○ Resource	utilization	(SLA)
○ Scalability	(throttling	resource	dynamically)
○ Compatibility
● Cluster	resource	management
○ Global	resource	allocation
○ Local	resource	assignment	and	reporting
● What	resources	(resource	model)
○ Heterogeneous	servers/racks	
○ CPUs
○ Memory
5© 2016 Pivotal Software, Inc. All rights reserved.
How	Yarn	works
● Yarn	components
○ Resource	manager:	master	for	managing	global
resources
○ Node	managers:	master	for	local resource	
allocations.	
○ Application	master:	per	application,	manages	
application	life	cycle	and	task	scheduling.
○ Container:		abstracted	set	of	resources (cpu	
core/memory)	granted	by	resource	manager	to	run	
a	task	of	application.	
● Yarn	application	life	cycle
6© 2016 Pivotal Software, Inc. All rights reserved.
Capacity	scheduler	and	queues
● Capacity	scheduler
○ Elasticity	with	Multi-tenancy
○ Security
○ Resource	awareness
○ Granular	scheduling
○ Locality
○ Scheduling	policies
● Queues
○ Hierarchical
■ Parent-queues	don't	accept	applications.	
■ Leaf-queues	have	no	child	queue	and	accept	
applications.
○ Access	controls
© Copyright 2013 Pivotal. All rights reserved.
Apache	HAWQ	Resource	Management	
and	Integration	with	YARN
8© 2016 Pivotal Software, Inc. All rights reserved.
Cluster	level
Global	
(YARN)
HAWQ
(Resource	Qs)
Query	
(Internal)
Cluster-Admin	defined
Hardware	efficiency
Share	with	MR/Hive/+
Defined	in	XML
HAWQ	Internal
HAWQ-Admin	defined
Multi-tenancy
Workload	prioritization
Defined	in	DDL
Query	level
System	defined
Query	Optimization
Operator	prioritization
Dynamic
Resource	Management	in	HAWQ
3-Tier	Resource	Management
© Copyright 2013 Pivotal. All rights reserved.
Apache	HAWQ	High-Level	Architecture
YARN NameNode
Resource Broker
libYARN
Resource Manager
Fault Tolerance
Service
Optimizer
Parser/ Analyzer
Dispatcher Catalog Service
HDFS Catalog Cache
Client
Client
Client
YARN
Node Manager
HDFS DataNode
Segment
Virtual SegmentVirtual SegmentVirtual SegmentVirtual Segment
YARN
Node Manager
HDFS DataNode
Segment
Virtual SegmentVirtual SegmentVirtual SegmentVirtual Segment
YARN
Node Manager
HDFS DataNode
Segment
Virtual SegmentVirtual SegmentVirtual SegmentVirtual Segment
External System
© Copyright 2013 Pivotal. All rights reserved.
Apache	HAWQ	Resource	Concepts
Global resource manager (GRM)
containers are assigned to one
HAWQ segment. Resource
granularity is usually as large as
GBs per container.YARN is
considered as one GRM.
Host
HAWQ Segment
HAWQ Segment manages the resource consumable for QEs.
One host has at most one segmentdeployed for one HAWQ cluster.
TotalCapacity
AvailCapacity
Used
One Virtual Segments (VSEG) are
allocated containing query resource for
a group of QEs. Resource granularity
is usually small, 256MB for example.
VSEG
VSEG
VSEG GRM Container
GRM Container
GRM Container
GRM Container
Large granularity,Low frequency resource negotiation
between Apache HAWQ and GRM server.
Small granularity,High frequency resource
negotiation between QD and Apache HAWQ RM.
© Copyright 2013 Pivotal. All rights reserved.
High-Level	Flow	of	Query	Execution
Parser DispatcherPlanner
query
Parse
tree plan
slice and
resource
quota
Executor
result
QD QE
Resource
Manager
query resource
YARN
containers
In case running in YARN mode
© Copyright 2013 Pivotal. All rights reserved.
▪ Apache HAWQ forks resource broker process to register itself as a YARN application to YARN Resource Manager.
▪ Resource broker process acts as an unmanaged application master.An unmanaged application master is
managed outside ofYARN resource manager.There isn’ta container launched for application master.
▪ A thread in resource broker process is created sending heartbeatto YARN resource manager on behalfof Apache
HAWQ to keep registered application active and alive.
▪ QEs are running on segments with Node Manager running.If a HAWQ segmenthas no Node Manager running,it
will be marked as DOWN by HAWQ and will not be used by resource pool.
▪ Apache HAWQ activates YARN containers by dummy processes once getting the allocated container list.
▪ Apache HAWQ enforces resource usage in all segments by itself.
Apache	HAWQ	in	YARN	Mode
© Copyright 2013 Pivotal. All rights reserved.
Configure	to	Integrate	Apache	HAWQ	with	YARN
▪ YARN configuration required by Apache HAWQ
▪ Capacity Scheduler is required
▪ Assign one queue exclusively to Apache HAWQ
▪ Configure hawq-site.xml to enable yarn mode and let Apache HAWQ connect to YARNProperty Definition
hawq_global_rm_type Apache HAWQ global resource manager type that indicates Apache HAWQ runs in corresponding modes.
‘yarn’ means enabling yarn mode.
hawq_rm_yarn_address YARN resource manager address including.
hawq_rm_yarn_scheduler_address YARN scheduler address.
hawq_rm_yarn_queue_name YARN queue name that used by Apache HAWQ resource manager.
hawq_rm_yarn_app_name Apache HAWQ YARN application name.
© Copyright 2013 Pivotal. All rights reserved.
Friendly	Apache	HAWQ	YARN	Resource	Return
Apache HAWQ resource manager friendly uses YARN allocated resource,which makes it flexibly co-exist with another
YARN applications
▪ Only acquire resource on-demand
▪ Configurable minimum water-level ofholding resource
▪ Pause allocating query resource and return YARN containers when resource manager overuses YARN queue and it
finds the YARN queue is busy
▪ Return resource when resource in-use water mark becomes lower
Time(s)
In-use
resource
Configurable window size
Resource in-use water mark
Current
© Copyright 2013 Pivotal. All rights reserved.
Even	Resource	Consumption	among	Segments
▪ Apache HAWQ resource manager tries its best to negotiate with global resource manager ( YARN for example) to get
evenly allocated resource
▪ Apache HAWQ resource manager tries its best to allocate VSEGs among segments to make all available segments
evenly utilized
▪ Preferred hosts are passed to global resource manager to acquire containers
▪ When Apache HAWQ resource manager decides to return containers, the segments having most allocated containers are
considered preferentially
▪ Combined workload ordered index is dynamically maintained in resource pool so that the segments having lower workload and
more available resource are considered in priority
▪ Round-robin strategy is adopted generally to ensure that for one query execution, all possible available segments are utilized, and
the segments have almost the same number of VSEGs
▪ Data-locality based VSEG allocation strategy is restricted in some cases
© Copyright 2013 Pivotal. All rights reserved.
Allocating	VSEGs	According	to	HDFS	Data	Locality
▪ Apache HAWQ holds target table’s data locality information that is passed to resource manager when
acquiring query resource.
▪ Apache HAWQ resource manager allocates VSEGs among segments if it is appropriate to follow data
locality information.
▪ It works for small scale queries requiring a few VSEGs only
▪ If a lot of VSEGs are required for a large scale query execution, this strategy is not adopted
▪ If the VSEG allocation intensifies the balanceof resource usageamong segments too much, this strategy is not adopted
© Copyright 2013 Pivotal. All rights reserved.
Apache	HAWQ	Resource	Queues
DDL	for	manipulating	resource	queues
CREATE RESOURCE QUEUE queue_name WITH (queue_attr_list);
ALTER RESOURCE QUEUE queue_name WITH (queue_attr_list);
DROP RESOURCE QUEUE queue_name;
queue_attr_list ::= queue_attr=value | queue_attr=value,queue_attr_listAttribute Value
PARENT* The parent queue’s name.
ACTIVE_STATEMENTS The limit of number of parallel active statements. Resource queue’s concurrency control.
MEMORY_LIMIT_CLUSTER* The percentage of memory resource consumable from its parent queue’s capacity.
CORE_LIMIT_CLUSTER* The percentage of virtual core resource consumable from its parent queue’s capacity. Currently, this value must be identical with
MEMORY_LIMIT_CLUSTER.
ALLOCATION_POLICY The resource allocation policy, currently, only ‘even’ policy is supported.
VSEG_RESOURCE_QUOTA The resource quota of one virtual segment (VSEG).
RESOURCE_OVERCOMMIT_FACTOR The factor of overusing resource from the other resource queues.
NVSEG_UPPER_LIMIT The absolute maximum number of VSEGs for one statement. Default value 0.
NVSEG_LOWER_LIMIT The absolute minimum number of VSEGs for one statement. Default value 0.
NVSEG_UPPER_LIMIT_PERSEG NVSEG_UPPER_LIMIT_PERSEG times available segment count derives the absolute maximum number of VSEGs for one statement. Default value 0.0.
NVSEG_LOWER_LIMIT_PERSEG NVSEG_LOWER_LIMIT_PERSEG times available segment count derives the absolute minimum number of VSEGs for one statement. Default value 0.0.
* The attributes must be specified when creating a resource queue.
© Copyright 2013 Pivotal. All rights reserved.
Leaf queue
pg_root
Tree-structured	Resource	Queues
memory_limit_cluster 100%
core_limit_cluster 100%
pg_default
memory_limit_cluster 50%
core_limit_cluster 50%
active_statements 20
allocation_policy even
vseg_resource_quota 256M
overcommit_factor 2
pg_dep1
memory_limit_cluster 50%
core_limit_cluster 50%
active_statements 100
allocation_policy even
vseg_resource_quota 512M
overcommit_factor 1
CREATE RESOURCE QUEUE pg_dep1 WITH(
PARENT=‘pg_root’,
ACTIVE_STATEMENTS=100,
VSEG_RESOURCE_QUOTA=‘mem:512mb’,
OVERCOMMIT_FACTOR=1
);
ALTER RESOURCE QUEUE pg_default WITH(
MEMORY_LIMIT_CLUSTER=20%,
CORE_LIMIT_CLUSTER=20%
);
memory_limit_cluster 20%
core_limit_cluster 20%
active_statements 20
allocation_policy even
vseg_resource_quota 256M
overcommit_factor 2
CREATE ROLE demorole RESOURCE QUEUE pg_dep1;
ALTER RESOURCE QUEUE pg_dep1 WITH(
MEMORY_LIMIT_CLUSTER=80%,
CORE_LIMIT_CLUSTER=80%,
OVERCOMMIT_FACTOR=2
);
SET ROLE demorole;
SELECT COUNT(*) FROM foo;
memory_limit_cluster 80%
core_limit_cluster 80%
active_statements 100
allocation_policy even
vseg_resource_quota 512M
overcommit_factor 2
80GB
20 CORE
40GB 10 CORE
MAX 80GB 20 CORE
40GB 10 CORE
MAX 40GB 10 CORE
16GB 4 CORE
MAX 32GB 8 CORE
64GB 16 CORE
MAX 80GB 20 CORE
Resource queues’ capacities are automatically estimated based on dynamic cluster resource capacity and resource queue DDL manipulations.
Statements consume resource from leaf queues.
Branch queue
Leaf queue
© Copyright 2013 Pivotal. All rights reserved.
DEMO
© Copyright 2013 Pivotal. All rights reserved.
Thank	you

Weitere ähnliche Inhalte

Was ist angesagt?

a Secure Public Cache for YARN Application Resources
a Secure Public Cache for YARN Application Resourcesa Secure Public Cache for YARN Application Resources
a Secure Public Cache for YARN Application Resources
DataWorks Summit
 
De-Bugging Hive with Hadoop-in-the-Cloud
De-Bugging Hive with Hadoop-in-the-CloudDe-Bugging Hive with Hadoop-in-the-Cloud
De-Bugging Hive with Hadoop-in-the-Cloud
DataWorks Summit
 
Apache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureApache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and Future
DataWorks Summit
 

Was ist angesagt? (20)

Hortonworks Technical Workshop: HBase and Apache Phoenix
Hortonworks Technical Workshop: HBase and Apache Phoenix Hortonworks Technical Workshop: HBase and Apache Phoenix
Hortonworks Technical Workshop: HBase and Apache Phoenix
 
Cloudera Impala: A Modern SQL Engine for Apache Hadoop
Cloudera Impala: A Modern SQL Engine for Apache HadoopCloudera Impala: A Modern SQL Engine for Apache Hadoop
Cloudera Impala: A Modern SQL Engine for Apache Hadoop
 
a Secure Public Cache for YARN Application Resources
a Secure Public Cache for YARN Application Resourcesa Secure Public Cache for YARN Application Resources
a Secure Public Cache for YARN Application Resources
 
NYC HUG - Application Architectures with Apache Hadoop
NYC HUG - Application Architectures with Apache HadoopNYC HUG - Application Architectures with Apache Hadoop
NYC HUG - Application Architectures with Apache Hadoop
 
Cloudera Impala
Cloudera ImpalaCloudera Impala
Cloudera Impala
 
SQL Engines for Hadoop - The case for Impala
SQL Engines for Hadoop - The case for ImpalaSQL Engines for Hadoop - The case for Impala
SQL Engines for Hadoop - The case for Impala
 
Apache HBase + Spark: Leveraging your Non-Relational Datastore in Batch and S...
Apache HBase + Spark: Leveraging your Non-Relational Datastore in Batch and S...Apache HBase + Spark: Leveraging your Non-Relational Datastore in Batch and S...
Apache HBase + Spark: Leveraging your Non-Relational Datastore in Batch and S...
 
Hive on spark berlin buzzwords
Hive on spark berlin buzzwordsHive on spark berlin buzzwords
Hive on spark berlin buzzwords
 
De-Bugging Hive with Hadoop-in-the-Cloud
De-Bugging Hive with Hadoop-in-the-CloudDe-Bugging Hive with Hadoop-in-the-Cloud
De-Bugging Hive with Hadoop-in-the-Cloud
 
Application architectures with Hadoop – Big Data TechCon 2014
Application architectures with Hadoop – Big Data TechCon 2014Application architectures with Hadoop – Big Data TechCon 2014
Application architectures with Hadoop – Big Data TechCon 2014
 
Application Architectures with Hadoop
Application Architectures with HadoopApplication Architectures with Hadoop
Application Architectures with Hadoop
 
Disaster Recovery and Cloud Migration for your Apache Hive Warehouse
Disaster Recovery and Cloud Migration for your Apache Hive WarehouseDisaster Recovery and Cloud Migration for your Apache Hive Warehouse
Disaster Recovery and Cloud Migration for your Apache Hive Warehouse
 
Apache Falcon : 22 Sept 2014 for Hadoop User Group France (@Criteo)
Apache Falcon : 22 Sept 2014 for Hadoop User Group France (@Criteo)Apache Falcon : 22 Sept 2014 for Hadoop User Group France (@Criteo)
Apache Falcon : 22 Sept 2014 for Hadoop User Group France (@Criteo)
 
Node labels in YARN
Node labels in YARNNode labels in YARN
Node labels in YARN
 
Evolving HDFS to Generalized Storage Subsystem
Evolving HDFS to Generalized Storage SubsystemEvolving HDFS to Generalized Storage Subsystem
Evolving HDFS to Generalized Storage Subsystem
 
A New "Sparkitecture" for modernizing your data warehouse
A New "Sparkitecture" for modernizing your data warehouseA New "Sparkitecture" for modernizing your data warehouse
A New "Sparkitecture" for modernizing your data warehouse
 
Hive2.0 sql speed-scale--hadoop-summit-dublin-apr-2016
Hive2.0 sql speed-scale--hadoop-summit-dublin-apr-2016Hive2.0 sql speed-scale--hadoop-summit-dublin-apr-2016
Hive2.0 sql speed-scale--hadoop-summit-dublin-apr-2016
 
Apache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, ScaleApache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, Scale
 
Near Real-Time Network Anomaly Detection and Traffic Analysis using Spark bas...
Near Real-Time Network Anomaly Detection and Traffic Analysis using Spark bas...Near Real-Time Network Anomaly Detection and Traffic Analysis using Spark bas...
Near Real-Time Network Anomaly Detection and Traffic Analysis using Spark bas...
 
Apache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureApache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and Future
 

Andere mochten auch

gsoc_mentor for Shivram Mani
gsoc_mentor for Shivram Manigsoc_mentor for Shivram Mani
gsoc_mentor for Shivram Mani
Shivram Mani
 

Andere mochten auch (20)

Build & test Apache Hawq
Build & test Apache Hawq Build & test Apache Hawq
Build & test Apache Hawq
 
Pivotal HAWQ and Hortonworks Data Platform: Modern Data Architecture for IT T...
Pivotal HAWQ and Hortonworks Data Platform: Modern Data Architecture for IT T...Pivotal HAWQ and Hortonworks Data Platform: Modern Data Architecture for IT T...
Pivotal HAWQ and Hortonworks Data Platform: Modern Data Architecture for IT T...
 
Webinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_finalWebinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_final
 
Apache HAWQ Architecture
Apache HAWQ ArchitectureApache HAWQ Architecture
Apache HAWQ Architecture
 
The Power of your Data Achieved - Next Gen Modernization
The Power of your Data Achieved - Next Gen ModernizationThe Power of your Data Achieved - Next Gen Modernization
The Power of your Data Achieved - Next Gen Modernization
 
Apache Ambari: Past, Present, Future
Apache Ambari: Past, Present, FutureApache Ambari: Past, Present, Future
Apache Ambari: Past, Present, Future
 
How to Use Apache Zeppelin with HWX HDB
How to Use Apache Zeppelin with HWX HDBHow to Use Apache Zeppelin with HWX HDB
How to Use Apache Zeppelin with HWX HDB
 
Hortonworks Data In Motion Series Part 4
Hortonworks Data In Motion Series Part 4Hortonworks Data In Motion Series Part 4
Hortonworks Data In Motion Series Part 4
 
Top 5 Strategies for Retail Data Analytics
Top 5 Strategies for Retail Data AnalyticsTop 5 Strategies for Retail Data Analytics
Top 5 Strategies for Retail Data Analytics
 
Dynamic Column Masking and Row-Level Filtering in HDP
Dynamic Column Masking and Row-Level Filtering in HDPDynamic Column Masking and Row-Level Filtering in HDP
Dynamic Column Masking and Row-Level Filtering in HDP
 
Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...
Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...
Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...
 
gsoc_mentor for Shivram Mani
gsoc_mentor for Shivram Manigsoc_mentor for Shivram Mani
gsoc_mentor for Shivram Mani
 
PXF BDAM 2016
PXF BDAM 2016PXF BDAM 2016
PXF BDAM 2016
 
Managing Apache HAWQ with Apache AMBARI
Managing Apache HAWQ with Apache AMBARIManaging Apache HAWQ with Apache AMBARI
Managing Apache HAWQ with Apache AMBARI
 
Hawq Hcatalog Integration
Hawq Hcatalog IntegrationHawq Hcatalog Integration
Hawq Hcatalog Integration
 
Apache Zeppelin Meetup Christian Tzolov 1/21/16
Apache Zeppelin Meetup Christian Tzolov 1/21/16 Apache Zeppelin Meetup Christian Tzolov 1/21/16
Apache Zeppelin Meetup Christian Tzolov 1/21/16
 
PXF HAWQ Unmanaged Data
PXF HAWQ Unmanaged DataPXF HAWQ Unmanaged Data
PXF HAWQ Unmanaged Data
 
Apache HAWQ : An Introduction
Apache HAWQ : An IntroductionApache HAWQ : An Introduction
Apache HAWQ : An Introduction
 
HAWQ: a massively parallel processing SQL engine in hadoop
HAWQ: a massively parallel processing SQL engine in hadoopHAWQ: a massively parallel processing SQL engine in hadoop
HAWQ: a massively parallel processing SQL engine in hadoop
 
Pivotal HAWQ - High Availability (2014)
Pivotal HAWQ - High Availability (2014)Pivotal HAWQ - High Availability (2014)
Pivotal HAWQ - High Availability (2014)
 

Ähnlich wie How to manage Hortonworks HDB Resources with YARN

Combine SAS High-Performance Capabilities with Hadoop YARN
Combine SAS High-Performance Capabilities with Hadoop YARNCombine SAS High-Performance Capabilities with Hadoop YARN
Combine SAS High-Performance Capabilities with Hadoop YARN
Hortonworks
 
Hawq wp 042313_final
Hawq wp 042313_finalHawq wp 042313_final
Hawq wp 042313_final
EMC
 
Bikas saha:the next generation of hadoop– hadoop 2 and yarn
Bikas saha:the next generation of hadoop– hadoop 2 and yarnBikas saha:the next generation of hadoop– hadoop 2 and yarn
Bikas saha:the next generation of hadoop– hadoop 2 and yarn
hdhappy001
 

Ähnlich wie How to manage Hortonworks HDB Resources with YARN (20)

Combine SAS High-Performance Capabilities with Hadoop YARN
Combine SAS High-Performance Capabilities with Hadoop YARNCombine SAS High-Performance Capabilities with Hadoop YARN
Combine SAS High-Performance Capabilities with Hadoop YARN
 
Bdm hadoop ecosystem
Bdm hadoop ecosystemBdm hadoop ecosystem
Bdm hadoop ecosystem
 
Introduction to Hadoop
Introduction to HadoopIntroduction to Hadoop
Introduction to Hadoop
 
Jun 2017 HUG: YARN Scheduling – A Step Beyond
Jun 2017 HUG: YARN Scheduling – A Step BeyondJun 2017 HUG: YARN Scheduling – A Step Beyond
Jun 2017 HUG: YARN Scheduling – A Step Beyond
 
Running Services on YARN
Running Services on YARNRunning Services on YARN
Running Services on YARN
 
Changes Expected in Hadoop 3 | Getting to Know Hadoop 3 Alpha | Upcoming Hado...
Changes Expected in Hadoop 3 | Getting to Know Hadoop 3 Alpha | Upcoming Hado...Changes Expected in Hadoop 3 | Getting to Know Hadoop 3 Alpha | Upcoming Hado...
Changes Expected in Hadoop 3 | Getting to Know Hadoop 3 Alpha | Upcoming Hado...
 
Chapter 10
Chapter 10Chapter 10
Chapter 10
 
Scaling Spark Workloads on YARN - Boulder/Denver July 2015
Scaling Spark Workloads on YARN - Boulder/Denver July 2015Scaling Spark Workloads on YARN - Boulder/Denver July 2015
Scaling Spark Workloads on YARN - Boulder/Denver July 2015
 
YARN
YARNYARN
YARN
 
Hawq wp 042313_final
Hawq wp 042313_finalHawq wp 042313_final
Hawq wp 042313_final
 
Caching and JCache with Greg Luck 18.02.16
Caching and JCache with Greg Luck 18.02.16Caching and JCache with Greg Luck 18.02.16
Caching and JCache with Greg Luck 18.02.16
 
Data Analytics and IoT, how to analyze data from IoT
Data Analytics and IoT, how to analyze data from IoTData Analytics and IoT, how to analyze data from IoT
Data Analytics and IoT, how to analyze data from IoT
 
Put the ‘Auto’ in Autoscaling – Make Kubernetes VPA and HPA work together for...
Put the ‘Auto’ in Autoscaling – Make Kubernetes VPA and HPA work together for...Put the ‘Auto’ in Autoscaling – Make Kubernetes VPA and HPA work together for...
Put the ‘Auto’ in Autoscaling – Make Kubernetes VPA and HPA work together for...
 
Apache Spark: Usage and Roadmap in Hadoop
Apache Spark: Usage and Roadmap in HadoopApache Spark: Usage and Roadmap in Hadoop
Apache Spark: Usage and Roadmap in Hadoop
 
Die pacman nomaden opnfv summit 2016 berlin
Die pacman nomaden opnfv summit 2016 berlinDie pacman nomaden opnfv summit 2016 berlin
Die pacman nomaden opnfv summit 2016 berlin
 
SQL and Machine Learning on Hadoop
SQL and Machine Learning on HadoopSQL and Machine Learning on Hadoop
SQL and Machine Learning on Hadoop
 
xPatterns ... beyond Hadoop (Spark, Shark, Mesos, Tachyon)
xPatterns ... beyond Hadoop (Spark, Shark, Mesos, Tachyon)xPatterns ... beyond Hadoop (Spark, Shark, Mesos, Tachyon)
xPatterns ... beyond Hadoop (Spark, Shark, Mesos, Tachyon)
 
Bikas saha:the next generation of hadoop– hadoop 2 and yarn
Bikas saha:the next generation of hadoop– hadoop 2 and yarnBikas saha:the next generation of hadoop– hadoop 2 and yarn
Bikas saha:the next generation of hadoop– hadoop 2 and yarn
 
YARN - Hadoop Next Generation Compute Platform
YARN - Hadoop Next Generation Compute PlatformYARN - Hadoop Next Generation Compute Platform
YARN - Hadoop Next Generation Compute Platform
 
Apache Hadoop YARN: Understanding the Data Operating System of Hadoop
Apache Hadoop YARN: Understanding the Data Operating System of HadoopApache Hadoop YARN: Understanding the Data Operating System of Hadoop
Apache Hadoop YARN: Understanding the Data Operating System of Hadoop
 

Mehr von Hortonworks

Mehr von Hortonworks (20)

Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next LevelHortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
 
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT StrategyIoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
 
Getting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with CloudbreakGetting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with Cloudbreak
 
Johns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log EventsJohns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log Events
 
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad GuysCatch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
 
HDF 3.2 - What's New
HDF 3.2 - What's NewHDF 3.2 - What's New
HDF 3.2 - What's New
 
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging ManagerCuring Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
 
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical EnvironmentsInterpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
 
IBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data LandscapeIBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data Landscape
 
Premier Inside-Out: Apache Druid
Premier Inside-Out: Apache DruidPremier Inside-Out: Apache Druid
Premier Inside-Out: Apache Druid
 
Accelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at ScaleAccelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at Scale
 
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATATIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
 
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
 
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: ClearsenseDelivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
 
Making Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with EaseMaking Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with Ease
 
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World PresentationWebinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
 
Driving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data ManagementDriving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data Management
 
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming FeaturesHDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
 
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
 
Unlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDCUnlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDC
 

Kürzlich hochgeladen

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

Kürzlich hochgeladen (20)

DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 

How to manage Hortonworks HDB Resources with YARN

  • 1. 1© 2016 Pivotal Software, Inc. All rights reserved. 1© 2016 Pivotal Software, Inc. All rights reserved. Understanding YARN and HAWQ Resource Mgmt Dan Baskette Director of Technical Marketing Apache HAWQ
  • 2. 2© 2016 Pivotal Software, Inc. All rights reserved. Agenda Basics of Hadoop YARN HAWQ & YARN integration Demo Q&A
  • 3. 3© 2016 Pivotal Software, Inc. All rights reserved. What is YARN ● YARN Basics ● Why do you care? ● YARN Components and Architecture
  • 4. 4© 2016 Pivotal Software, Inc. All rights reserved. YARN 101 ● Why YARN? (limitation of mapreduce 1) ○ Multi-tenancy (spark, hbase,tez,accumulo,hawq...) ○ Resource utilization (SLA) ○ Scalability (throttling resource dynamically) ○ Compatibility ● Cluster resource management ○ Global resource allocation ○ Local resource assignment and reporting ● What resources (resource model) ○ Heterogeneous servers/racks ○ CPUs ○ Memory
  • 5. 5© 2016 Pivotal Software, Inc. All rights reserved. How Yarn works ● Yarn components ○ Resource manager: master for managing global resources ○ Node managers: master for local resource allocations. ○ Application master: per application, manages application life cycle and task scheduling. ○ Container: abstracted set of resources (cpu core/memory) granted by resource manager to run a task of application. ● Yarn application life cycle
  • 6. 6© 2016 Pivotal Software, Inc. All rights reserved. Capacity scheduler and queues ● Capacity scheduler ○ Elasticity with Multi-tenancy ○ Security ○ Resource awareness ○ Granular scheduling ○ Locality ○ Scheduling policies ● Queues ○ Hierarchical ■ Parent-queues don't accept applications. ■ Leaf-queues have no child queue and accept applications. ○ Access controls
  • 7. © Copyright 2013 Pivotal. All rights reserved. Apache HAWQ Resource Management and Integration with YARN
  • 8. 8© 2016 Pivotal Software, Inc. All rights reserved. Cluster level Global (YARN) HAWQ (Resource Qs) Query (Internal) Cluster-Admin defined Hardware efficiency Share with MR/Hive/+ Defined in XML HAWQ Internal HAWQ-Admin defined Multi-tenancy Workload prioritization Defined in DDL Query level System defined Query Optimization Operator prioritization Dynamic Resource Management in HAWQ 3-Tier Resource Management
  • 9. © Copyright 2013 Pivotal. All rights reserved. Apache HAWQ High-Level Architecture YARN NameNode Resource Broker libYARN Resource Manager Fault Tolerance Service Optimizer Parser/ Analyzer Dispatcher Catalog Service HDFS Catalog Cache Client Client Client YARN Node Manager HDFS DataNode Segment Virtual SegmentVirtual SegmentVirtual SegmentVirtual Segment YARN Node Manager HDFS DataNode Segment Virtual SegmentVirtual SegmentVirtual SegmentVirtual Segment YARN Node Manager HDFS DataNode Segment Virtual SegmentVirtual SegmentVirtual SegmentVirtual Segment External System
  • 10. © Copyright 2013 Pivotal. All rights reserved. Apache HAWQ Resource Concepts Global resource manager (GRM) containers are assigned to one HAWQ segment. Resource granularity is usually as large as GBs per container.YARN is considered as one GRM. Host HAWQ Segment HAWQ Segment manages the resource consumable for QEs. One host has at most one segmentdeployed for one HAWQ cluster. TotalCapacity AvailCapacity Used One Virtual Segments (VSEG) are allocated containing query resource for a group of QEs. Resource granularity is usually small, 256MB for example. VSEG VSEG VSEG GRM Container GRM Container GRM Container GRM Container Large granularity,Low frequency resource negotiation between Apache HAWQ and GRM server. Small granularity,High frequency resource negotiation between QD and Apache HAWQ RM.
  • 11. © Copyright 2013 Pivotal. All rights reserved. High-Level Flow of Query Execution Parser DispatcherPlanner query Parse tree plan slice and resource quota Executor result QD QE Resource Manager query resource YARN containers In case running in YARN mode
  • 12. © Copyright 2013 Pivotal. All rights reserved. ▪ Apache HAWQ forks resource broker process to register itself as a YARN application to YARN Resource Manager. ▪ Resource broker process acts as an unmanaged application master.An unmanaged application master is managed outside ofYARN resource manager.There isn’ta container launched for application master. ▪ A thread in resource broker process is created sending heartbeatto YARN resource manager on behalfof Apache HAWQ to keep registered application active and alive. ▪ QEs are running on segments with Node Manager running.If a HAWQ segmenthas no Node Manager running,it will be marked as DOWN by HAWQ and will not be used by resource pool. ▪ Apache HAWQ activates YARN containers by dummy processes once getting the allocated container list. ▪ Apache HAWQ enforces resource usage in all segments by itself. Apache HAWQ in YARN Mode
  • 13. © Copyright 2013 Pivotal. All rights reserved. Configure to Integrate Apache HAWQ with YARN ▪ YARN configuration required by Apache HAWQ ▪ Capacity Scheduler is required ▪ Assign one queue exclusively to Apache HAWQ ▪ Configure hawq-site.xml to enable yarn mode and let Apache HAWQ connect to YARNProperty Definition hawq_global_rm_type Apache HAWQ global resource manager type that indicates Apache HAWQ runs in corresponding modes. ‘yarn’ means enabling yarn mode. hawq_rm_yarn_address YARN resource manager address including. hawq_rm_yarn_scheduler_address YARN scheduler address. hawq_rm_yarn_queue_name YARN queue name that used by Apache HAWQ resource manager. hawq_rm_yarn_app_name Apache HAWQ YARN application name.
  • 14. © Copyright 2013 Pivotal. All rights reserved. Friendly Apache HAWQ YARN Resource Return Apache HAWQ resource manager friendly uses YARN allocated resource,which makes it flexibly co-exist with another YARN applications ▪ Only acquire resource on-demand ▪ Configurable minimum water-level ofholding resource ▪ Pause allocating query resource and return YARN containers when resource manager overuses YARN queue and it finds the YARN queue is busy ▪ Return resource when resource in-use water mark becomes lower Time(s) In-use resource Configurable window size Resource in-use water mark Current
  • 15. © Copyright 2013 Pivotal. All rights reserved. Even Resource Consumption among Segments ▪ Apache HAWQ resource manager tries its best to negotiate with global resource manager ( YARN for example) to get evenly allocated resource ▪ Apache HAWQ resource manager tries its best to allocate VSEGs among segments to make all available segments evenly utilized ▪ Preferred hosts are passed to global resource manager to acquire containers ▪ When Apache HAWQ resource manager decides to return containers, the segments having most allocated containers are considered preferentially ▪ Combined workload ordered index is dynamically maintained in resource pool so that the segments having lower workload and more available resource are considered in priority ▪ Round-robin strategy is adopted generally to ensure that for one query execution, all possible available segments are utilized, and the segments have almost the same number of VSEGs ▪ Data-locality based VSEG allocation strategy is restricted in some cases
  • 16. © Copyright 2013 Pivotal. All rights reserved. Allocating VSEGs According to HDFS Data Locality ▪ Apache HAWQ holds target table’s data locality information that is passed to resource manager when acquiring query resource. ▪ Apache HAWQ resource manager allocates VSEGs among segments if it is appropriate to follow data locality information. ▪ It works for small scale queries requiring a few VSEGs only ▪ If a lot of VSEGs are required for a large scale query execution, this strategy is not adopted ▪ If the VSEG allocation intensifies the balanceof resource usageamong segments too much, this strategy is not adopted
  • 17. © Copyright 2013 Pivotal. All rights reserved. Apache HAWQ Resource Queues DDL for manipulating resource queues CREATE RESOURCE QUEUE queue_name WITH (queue_attr_list); ALTER RESOURCE QUEUE queue_name WITH (queue_attr_list); DROP RESOURCE QUEUE queue_name; queue_attr_list ::= queue_attr=value | queue_attr=value,queue_attr_listAttribute Value PARENT* The parent queue’s name. ACTIVE_STATEMENTS The limit of number of parallel active statements. Resource queue’s concurrency control. MEMORY_LIMIT_CLUSTER* The percentage of memory resource consumable from its parent queue’s capacity. CORE_LIMIT_CLUSTER* The percentage of virtual core resource consumable from its parent queue’s capacity. Currently, this value must be identical with MEMORY_LIMIT_CLUSTER. ALLOCATION_POLICY The resource allocation policy, currently, only ‘even’ policy is supported. VSEG_RESOURCE_QUOTA The resource quota of one virtual segment (VSEG). RESOURCE_OVERCOMMIT_FACTOR The factor of overusing resource from the other resource queues. NVSEG_UPPER_LIMIT The absolute maximum number of VSEGs for one statement. Default value 0. NVSEG_LOWER_LIMIT The absolute minimum number of VSEGs for one statement. Default value 0. NVSEG_UPPER_LIMIT_PERSEG NVSEG_UPPER_LIMIT_PERSEG times available segment count derives the absolute maximum number of VSEGs for one statement. Default value 0.0. NVSEG_LOWER_LIMIT_PERSEG NVSEG_LOWER_LIMIT_PERSEG times available segment count derives the absolute minimum number of VSEGs for one statement. Default value 0.0. * The attributes must be specified when creating a resource queue.
  • 18. © Copyright 2013 Pivotal. All rights reserved. Leaf queue pg_root Tree-structured Resource Queues memory_limit_cluster 100% core_limit_cluster 100% pg_default memory_limit_cluster 50% core_limit_cluster 50% active_statements 20 allocation_policy even vseg_resource_quota 256M overcommit_factor 2 pg_dep1 memory_limit_cluster 50% core_limit_cluster 50% active_statements 100 allocation_policy even vseg_resource_quota 512M overcommit_factor 1 CREATE RESOURCE QUEUE pg_dep1 WITH( PARENT=‘pg_root’, ACTIVE_STATEMENTS=100, VSEG_RESOURCE_QUOTA=‘mem:512mb’, OVERCOMMIT_FACTOR=1 ); ALTER RESOURCE QUEUE pg_default WITH( MEMORY_LIMIT_CLUSTER=20%, CORE_LIMIT_CLUSTER=20% ); memory_limit_cluster 20% core_limit_cluster 20% active_statements 20 allocation_policy even vseg_resource_quota 256M overcommit_factor 2 CREATE ROLE demorole RESOURCE QUEUE pg_dep1; ALTER RESOURCE QUEUE pg_dep1 WITH( MEMORY_LIMIT_CLUSTER=80%, CORE_LIMIT_CLUSTER=80%, OVERCOMMIT_FACTOR=2 ); SET ROLE demorole; SELECT COUNT(*) FROM foo; memory_limit_cluster 80% core_limit_cluster 80% active_statements 100 allocation_policy even vseg_resource_quota 512M overcommit_factor 2 80GB 20 CORE 40GB 10 CORE MAX 80GB 20 CORE 40GB 10 CORE MAX 40GB 10 CORE 16GB 4 CORE MAX 32GB 8 CORE 64GB 16 CORE MAX 80GB 20 CORE Resource queues’ capacities are automatically estimated based on dynamic cluster resource capacity and resource queue DDL manipulations. Statements consume resource from leaf queues. Branch queue Leaf queue
  • 19. © Copyright 2013 Pivotal. All rights reserved. DEMO
  • 20. © Copyright 2013 Pivotal. All rights reserved. Thank you