SlideShare ist ein Scribd-Unternehmen logo
1 von 36
한국해양과학기술진흥원
Cluster and Grid Computing
2013.10.6
Sayed Chhattan Shah, PhD
Senior Researcher
Electronics and Telecommunications Research Institute, Korea
한국해양과학기술진흥원
Outline
 Cluster Computing
 Architecture
 Key Components
 Grid Computing
 Architecture
 Key Components
 Resource Management
• Discovery
• QoS Support
• Scheduling
Cluster Computing
한국해양과학기술진흥원
Cluster
A type of distributed system
A collection of workstations of PCs that are
interconnected by a high-speed network
Work as an integrated collection of resources
Have a single system image spanning all its nodes
한국해양과학기술진흥원
Sequential Applications
Parallel Applications
Parallel Programming Environment
Cluster Middleware
(Single System Image and Availability Infrastructure)
Cluster Interconnection Network/Switch
PC/Workstation
Network Interface
Hardware
Communications
Software
PC/Workstation
Network Interface
Hardware
Communications
Software
PC/Workstation
Network Interface
Hardware
Communications
Software
PC/Workstation
Network Interface
Hardware
Communications
Software
Sequential Applications
Sequential Applications
Parallel Applications
Parallel Applications
Cluster Computer Architecture
한국해양과학기술진흥원
Prominent Components of Cluster Computers
Multiple High Performance Computers
 PCs
 Workstations
State of the art Operating Systems
 Linux (MOSIX, Beowulf, and many more)
 Microsoft NT (Illinois HPVM, Cornell Velocity)
 SUN Solaris (Berkeley NOW, C-DAC PARAM)
 IBM AIX (IBM SP2)
한국해양과학기술진흥원
Prominent Components of Cluster Computers
High Performance Networks
 Ethernet (10Mbps),
 Fast Ethernet (100Mbps),
 Gigabit Ethernet (1Gbps)
 SCI (Scalable Coherent Interface- MPI- 12µsec latency)
 ATM (Asynchronous Transfer Mode)
 Myrinet (1.2Gbps)
 Digital Memory Channel
 FDDI (fiber distributed data interface)
 InfiniBand
한국해양과학기술진흥원
Fast Communication Protocols and Services
 Active Messages (Berkeley)
 Fast Messages (Illinois)
 U-net (Cornell)
 XTP (Virginia)
 Virtual Interface Architecture (VIA)
Prominent Components of Cluster Computers
한국해양과학기술진흥원
Myrinet QSnet Giganet ServerNet2
SCI Gigabit
Ethernet
Bandwidth
(MBytes/s)
140 – 33MHz
215 – 66 Mhz 208 ~105 165 ~80 30 - 50
MPI
Latency (µs)
16.5 – 33Nhz
11 – 66 Mhz
5 ~20 - 40 20.2 6 100 - 200
List price/port $1.5K $6.5K $1.5K ~$1.5K
Hardware
Availability
Now Now Now Q2‘00 Now Now
Linux Support Now Late‘00 Now Q2‘00 Now Now
Maximum
#nodes
1000’s 1000’s 1000’s 64K 1000’s
Protocol
Implementation
Firmware on
adapter
Firmware
on adapter
Firmware on
adapter
Implemented in h
ardware
Implemented
in hardware
VIA support Soon None NT/Linux Done in hardware Software
TCP/IP, VIA
NT/Linux
MPI support 3rd party Quadrics/
Compaq
3rd Party Compaq/3rd party MPICH – TCP/IP
1000’s
Firmware
on adapter
~$1.5K
3rd Party
~$1.5K
Prominent Components of Cluster Computers
한국해양과학기술진흥원
Cluster Middleware
 Resource management and scheduling
 Fault handling
 Migration
 Load balancing
Prominent Components of Cluster Computers
Grid Computing
한국해양과학기술진흥원
Overview: Clusters x GridsCluster - How can we use local networked resources
to achieve better performance for large scale
applications?
 High speed networks
 Centralized resource and task management
How can we put together geographically distributed
resources to achieve even better results?
 Distributed resource and task management
 No high speed connections
Grid Computing
Information
Generators
Information Distributed
Over the Grid
Customer
Access to
Information
Grid
 Computing power should be available on demand, for a fee
 Just like the electrical power grid.
Basic Idea
Grid and Cluster
한국해양과학기술진흥원
Grid Computing 15
Core networking technology now accelerates at a much
faster rate than advances in microprocessor speeds
Exploiting under utilized resources
Parallel CPU capacity
Access to additional resources
Why Grid Computing?
한국해양과학기술진흥원
Grid Computing
 Several clusters in Grid
 May include super computers, desktops, laptops, mobile devices
한국해양과학기술진흥원
1800 Physicists, 150 Institutes, 32 Countries
100 PB of data by 2010; 50,000 CPUs?
CERNs Large Hadron Collider
한국해양과학기술진흥원
Data Grids for High Energy Physics
Tier2 Centre
~1 TIPS
Online System
Offline Processor Farm
~20 TIPS
CERN Computer Centre
FermiLab ~4 TIPSFrance Regional
Centre
Italy Regional
Centre
Germany Regional
Centre
InstituteInstituteInstitute
Institute
~0.25TIPS
Physicist workstations
~100 MBytes/sec
~100 MBytes/sec
~622 Mbit/sec
~1 MBytes/sec
There is a “bunch crossing” every 25 nsecs.
There are 100 “triggers” per second
Each triggered event is ~1 MByte in size
Physicists work on analysis “channels”.
Each institute will have ~10 physicists working on one or more
channels; data for these channels should be cached by the
institute server
Physics data cache
~PBytes/sec
~622 Mbits/sec
or Air Freight (deprecated)
Tier2 Centre
~1 TIPS
Tier2 Centre
~1 TIPS
Tier2 Centre
~1 TIPS
Caltech
~1 TIPS
~622 Mbits/sec
Tier 0
Tier 1
Tier 2
Tier 4
1 TIPS is approximately 25,000
SpecInt95 equivalents
한국해양과학기술진흥원
Grid
Fabric
Grid
Apps.
Grid
Middleware
Grid
Tools
Networked Resources across Organisations
Computers Clusters Data Sources Scientific InstrumentsStorage Systems
Local Resource Managers
Operating Systems Queuing Systems TCP/IP & UDP
…
Libraries & App Kernels …
Distributed Resources Coupling Services
Security Information … QoSProcess
Development Environments and Tools
Languages Libraries Debuggers … Web toolsResource BrokersMonitoring
Applications and Portals
Prob. Solving Env.Scientific …CollaborationEngineering Web enabled Apps
Resource Trading
Grid Components
Market Info
한국해양과학기술진흥원
Overview: Clusters x GridsA large proportion of personal computer’s
computational power is left unused
A desktop grid takes this unused capacity
 Local Desktop Grid
• Comprised mainly of a set of computers at one location
 Volunteer Desktop Grid
• Resources in a volunteer desktop grid are provided by citizens
all over the world
Desktop Grid
한국해양과학기술진흥원
Types of Grids
 Computational Grid
 Processing power is the main computing resource shared
amongst nodes
 Distributed Supercomputing
• Executes the application in parallel on multiple machines to reduce
the completion time
 High throughput
• Increases the completion rate of a stream of jobs
 Data Grid
 Data storage capacity as the main shared resource amongst
nodes
Resource Management
한국해양과학기술진흥원
Overview: Clusters x GridsManages the pool of resources available to Grid
 Processors
 Network bandwidth
 Disk storage
The pool includes resources from different providers
 RMS should maintain the required level of trust
• Without affecting performance
 RMS should adhere to different policies
 RMS should meet QoS requirements
Resource Management System
한국해양과학기술진흥원
Overview: Clusters x Grids
Core Functions of Resource Management System
한국해양과학기술진흥원
Overview: Clusters x GridsResource Dissemination and Discovery Protocols
 Used to determine the state of the resources
• Resource Dissemination Protocol
• Provides information about the resources
• Discovery Protocol
• Provides a mechanism by which resource information can be found
Resource resolution and co-allocation protocols
 To schedule the job at the remote resource
 Simultaneously acquire multiple resources
Core Functions of Resource Management System
한국해양과학기술진흥원
Overview: Clusters x GridsMachine Organization
 Organization of the machines in the Grid affects the
communication patterns and thus
• determines the scalability
Resource Management System
한국해양과학기술진흥원
Overview: Clusters x Grids Centralized Organization
• a single controller or designated set of controllers performs the
scheduling for all machines
• suffer from scalability issues
 Decentralized Organization
• Roles are distributed among machines
• Sender initiated
• Receiver initiated
Resource Management System
한국해양과학기술진흥원
Overview: Clusters x Grids
 Flat Organization
• All machines can directly communicate with each other without going
through
 Hierarchical Organization
• Machines in the same level can directly communicate with the
machines directly above them or below them
 Cell or Group Organization
• Machines within the cell communicate between themselves using flat
organization
• Designated machines within the cell function acts as boundary elements
that are responsible for all communication outside the cell
• Flat cell structure has only one level of cells
• Hierarchical cell structure can have cells that contain other cells
Resource Management System
한국해양과학기술진흥원
Overview: Clusters x GridsQoS Support
 QoS is not limited to network bandwidth but extends to the
processing and storage capabilities of the nodes
 Resource reservation is one of the ways of providing guaranteed
QoS
 Key components of QoS
• Admission control determines if requested level of service can be given
• Policing ensures that job does not violate agreed upon level of service
Resource Management System
한국해양과학기술진흥원
Overview: Clusters x GridsResource Discovery and Dissemination
 Discovery is initiated by applications to find suitable resources
 Dissemination is initiated by resources to find suitable application
Resource Management System
한국해양과학기술진흥원
Overview: Clusters x GridsScheduling
 Determining when and where the jobs are executed and how
many resources are allocated
 Time-shared job-scheduling approaches
• Multiple jobs share the same resources
 Space-shared job-scheduling approaches
• Multiple jobs can run at any point of time by the available nodes
 Gang or Synchronous Scheduling
• Scheduling all tasks of application at the same time
 Loosely coordinated co-scheduling
• Schedule communicating tasks of application at the same time
Resource Management System
한국해양과학기술진흥원
Overview: Clusters x GridsScheduling Objectives
 Minimize response time and
 Maximize system utilization
 Trade-off
• Maximizing system utilization may increase response time
Resource Management System
한국해양과학기술진흥원
Overview: Clusters x GridsJob Requirements
 Independent jobs
 Dependent jobs
• Precedence dependency
• Parallel Dependency
Resource Management System
한국해양과학기술진흥원
Overview: Clusters x GridsScheduling
Resource Management System
한국해양과학기술진흥원
Overview: Clusters x GridsState Estimation
 Predictive state estimation uses current and historical job and
resource status information
 Non-predictive state estimation uses only the current job and
resource status information
Resource Management System
한국해양과학기술진흥원
Overview: Clusters x GridsRescheduling
 To improve utilization, balance load, etc
 Periodic or batch rescheduling approaches group resource
requests and system events which are then processed at
intervals
 Event driven online rescheduling performs rescheduling as soon
the RMS receives the resource request or system event
Resource Management System

Weitere ähnliche Inhalte

Was ist angesagt?

Distributed & parallel system
Distributed & parallel systemDistributed & parallel system
Distributed & parallel system
Manish Singh
 
CS8791 Cloud Computing - Question Bank
CS8791 Cloud Computing - Question BankCS8791 Cloud Computing - Question Bank
CS8791 Cloud Computing - Question Bank
pkaviya
 
distributed shared memory
 distributed shared memory distributed shared memory
distributed shared memory
Ashish Kumar
 
remote procedure calls
  remote procedure calls  remote procedure calls
remote procedure calls
Ashish Kumar
 

Was ist angesagt? (20)

Characteristics of cloud computing
Characteristics of cloud computingCharacteristics of cloud computing
Characteristics of cloud computing
 
Cluster Computing
Cluster ComputingCluster Computing
Cluster Computing
 
Distributed Mutual exclusion algorithms
Distributed Mutual exclusion algorithmsDistributed Mutual exclusion algorithms
Distributed Mutual exclusion algorithms
 
Aneka platform
Aneka platformAneka platform
Aneka platform
 
Communication in Distributed Systems
Communication in Distributed SystemsCommunication in Distributed Systems
Communication in Distributed Systems
 
Distributed & parallel system
Distributed & parallel systemDistributed & parallel system
Distributed & parallel system
 
Presentation on Shared Memory Parallel Programming
Presentation on Shared Memory Parallel ProgrammingPresentation on Shared Memory Parallel Programming
Presentation on Shared Memory Parallel Programming
 
Cloud computing architecture
Cloud computing architectureCloud computing architecture
Cloud computing architecture
 
CS8791 Cloud Computing - Question Bank
CS8791 Cloud Computing - Question BankCS8791 Cloud Computing - Question Bank
CS8791 Cloud Computing - Question Bank
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
Cloud Security, Standards and Applications
Cloud Security, Standards and ApplicationsCloud Security, Standards and Applications
Cloud Security, Standards and Applications
 
distributed shared memory
 distributed shared memory distributed shared memory
distributed shared memory
 
Introduction to Parallel Computing
Introduction to Parallel ComputingIntroduction to Parallel Computing
Introduction to Parallel Computing
 
Distributed Systems
Distributed SystemsDistributed Systems
Distributed Systems
 
remote procedure calls
  remote procedure calls  remote procedure calls
remote procedure calls
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
Distributed file system
Distributed file systemDistributed file system
Distributed file system
 
The CAP Theorem
The CAP Theorem The CAP Theorem
The CAP Theorem
 
Historical development of cloud computing
Historical development of cloud computingHistorical development of cloud computing
Historical development of cloud computing
 
Cloud Computing Using OpenStack
Cloud Computing Using OpenStack Cloud Computing Using OpenStack
Cloud Computing Using OpenStack
 

Ähnlich wie Cluster and Grid Computing

Microx - A Unix like kernel for Embedded Systems written from scratch.
Microx - A Unix like kernel for Embedded Systems written from scratch.Microx - A Unix like kernel for Embedded Systems written from scratch.
Microx - A Unix like kernel for Embedded Systems written from scratch.
Waqar Sheikh
 
CSense: A Stream-Processing Toolkit for Robust and High-Rate Mobile Sensing A...
CSense: A Stream-Processing Toolkit for Robust and High-Rate Mobile Sensing A...CSense: A Stream-Processing Toolkit for Robust and High-Rate Mobile Sensing A...
CSense: A Stream-Processing Toolkit for Robust and High-Rate Mobile Sensing A...
Farley Lai
 
Unit i introduction to grid computing
Unit i   introduction to grid computingUnit i   introduction to grid computing
Unit i introduction to grid computing
sudha kar
 
SDN Security Talk - (ISC)2_3
SDN Security Talk - (ISC)2_3SDN Security Talk - (ISC)2_3
SDN Security Talk - (ISC)2_3
Wen-Pai Lu
 

Ähnlich wie Cluster and Grid Computing (20)

Grid optical network service architecture for data intensive applications
Grid optical network service architecture for data intensive applicationsGrid optical network service architecture for data intensive applications
Grid optical network service architecture for data intensive applications
 
Grid Computing
Grid ComputingGrid Computing
Grid Computing
 
Distributed Clouds and Software Defined Networking
Distributed Clouds and Software Defined NetworkingDistributed Clouds and Software Defined Networking
Distributed Clouds and Software Defined Networking
 
Inroduction to grid computing by gargi shankar verma
Inroduction to grid computing by gargi shankar vermaInroduction to grid computing by gargi shankar verma
Inroduction to grid computing by gargi shankar verma
 
IEEE HPSR 2017 Keynote: Softwarized Dataplanes and the P^3 trade-offs: Progra...
IEEE HPSR 2017 Keynote: Softwarized Dataplanes and the P^3 trade-offs: Progra...IEEE HPSR 2017 Keynote: Softwarized Dataplanes and the P^3 trade-offs: Progra...
IEEE HPSR 2017 Keynote: Softwarized Dataplanes and the P^3 trade-offs: Progra...
 
Microx - A Unix like kernel for Embedded Systems written from scratch.
Microx - A Unix like kernel for Embedded Systems written from scratch.Microx - A Unix like kernel for Embedded Systems written from scratch.
Microx - A Unix like kernel for Embedded Systems written from scratch.
 
Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18
Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18
Dataservices based on mesos and kafka kostiantyn bokhan dataconf 21 04 18
 
MetaCloud Computing Environment
MetaCloud Computing EnvironmentMetaCloud Computing Environment
MetaCloud Computing Environment
 
Grid computing
Grid computingGrid computing
Grid computing
 
CSense: A Stream-Processing Toolkit for Robust and High-Rate Mobile Sensing A...
CSense: A Stream-Processing Toolkit for Robust and High-Rate Mobile Sensing A...CSense: A Stream-Processing Toolkit for Robust and High-Rate Mobile Sensing A...
CSense: A Stream-Processing Toolkit for Robust and High-Rate Mobile Sensing A...
 
Software defined network-- SDN
Software defined network-- SDNSoftware defined network-- SDN
Software defined network-- SDN
 
Manage Microservices & Fast Data Systems on One Platform w/ DC/OS
Manage Microservices & Fast Data Systems on One Platform w/ DC/OSManage Microservices & Fast Data Systems on One Platform w/ DC/OS
Manage Microservices & Fast Data Systems on One Platform w/ DC/OS
 
Service Fabric and Azure Service Fabric Mesh introduction
Service Fabric and Azure Service Fabric Mesh introductionService Fabric and Azure Service Fabric Mesh introduction
Service Fabric and Azure Service Fabric Mesh introduction
 
Grid computing
Grid computingGrid computing
Grid computing
 
Software Defined Networking - Huawei, June 2017
Software Defined Networking - Huawei, June 2017Software Defined Networking - Huawei, June 2017
Software Defined Networking - Huawei, June 2017
 
SDN 101: Software Defined Networking Course - Sameh Zaghloul/IBM - 2014
SDN 101: Software Defined Networking Course - Sameh Zaghloul/IBM - 2014SDN 101: Software Defined Networking Course - Sameh Zaghloul/IBM - 2014
SDN 101: Software Defined Networking Course - Sameh Zaghloul/IBM - 2014
 
Unit i introduction to grid computing
Unit i   introduction to grid computingUnit i   introduction to grid computing
Unit i introduction to grid computing
 
SDN Security Talk - (ISC)2_3
SDN Security Talk - (ISC)2_3SDN Security Talk - (ISC)2_3
SDN Security Talk - (ISC)2_3
 
Hannover Messe 2017 - Systems Federation in industrie 4.0
Hannover Messe 2017 - Systems Federation in industrie 4.0Hannover Messe 2017 - Systems Federation in industrie 4.0
Hannover Messe 2017 - Systems Federation in industrie 4.0
 
Proactive ops for container orchestration environments
Proactive ops for container orchestration environmentsProactive ops for container orchestration environments
Proactive ops for container orchestration environments
 

Mehr von Sayed Chhattan Shah

Mehr von Sayed Chhattan Shah (17)

Introduction to System Programming
Introduction to System ProgrammingIntroduction to System Programming
Introduction to System Programming
 
Introduction to Differential Equations
Introduction to Differential EquationsIntroduction to Differential Equations
Introduction to Differential Equations
 
Algorithm Design and Analysis
Algorithm Design and AnalysisAlgorithm Design and Analysis
Algorithm Design and Analysis
 
Cloud and Edge Computing Systems
Cloud and Edge Computing SystemsCloud and Edge Computing Systems
Cloud and Edge Computing Systems
 
Introduction to Internet of Things
Introduction to Internet of ThingsIntroduction to Internet of Things
Introduction to Internet of Things
 
IoT Network Technologies
IoT Network TechnologiesIoT Network Technologies
IoT Network Technologies
 
5G Network: Requirements, Design Principles, Architectures, and Enabling Tech...
5G Network: Requirements, Design Principles, Architectures, and Enabling Tech...5G Network: Requirements, Design Principles, Architectures, and Enabling Tech...
5G Network: Requirements, Design Principles, Architectures, and Enabling Tech...
 
Data Center Networks
Data Center NetworksData Center Networks
Data Center Networks
 
IEEE 802.11 Architecture and Services
IEEE 802.11 Architecture and ServicesIEEE 802.11 Architecture and Services
IEEE 802.11 Architecture and Services
 
Routing in Mobile Ad hoc Networks
Routing in Mobile Ad hoc NetworksRouting in Mobile Ad hoc Networks
Routing in Mobile Ad hoc Networks
 
Keynote Talk on Recent Advances in Mobile Grid and Cloud Computing
Keynote Talk on Recent Advances in Mobile Grid and Cloud ComputingKeynote Talk on Recent Advances in Mobile Grid and Cloud Computing
Keynote Talk on Recent Advances in Mobile Grid and Cloud Computing
 
Keynote on Mobile Grid and Cloud Computing
Keynote on Mobile Grid and Cloud ComputingKeynote on Mobile Grid and Cloud Computing
Keynote on Mobile Grid and Cloud Computing
 
Introduction to Mobile Ad hoc Networks
Introduction to Mobile Ad hoc NetworksIntroduction to Mobile Ad hoc Networks
Introduction to Mobile Ad hoc Networks
 
Cloud Robotics
Cloud RoboticsCloud Robotics
Cloud Robotics
 
Introduction to Cloud Computing
Introduction to Cloud ComputingIntroduction to Cloud Computing
Introduction to Cloud Computing
 
Tips on Applying for a Scholarship
Tips on Applying for a ScholarshipTips on Applying for a Scholarship
Tips on Applying for a Scholarship
 
Introduction to Parallel and Distributed Computing
Introduction to Parallel and Distributed ComputingIntroduction to Parallel and Distributed Computing
Introduction to Parallel and Distributed Computing
 

Kürzlich hochgeladen

The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
AnaAcapella
 

Kürzlich hochgeladen (20)

TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 

Cluster and Grid Computing

  • 1. 한국해양과학기술진흥원 Cluster and Grid Computing 2013.10.6 Sayed Chhattan Shah, PhD Senior Researcher Electronics and Telecommunications Research Institute, Korea
  • 2. 한국해양과학기술진흥원 Outline  Cluster Computing  Architecture  Key Components  Grid Computing  Architecture  Key Components  Resource Management • Discovery • QoS Support • Scheduling
  • 4. 한국해양과학기술진흥원 Cluster A type of distributed system A collection of workstations of PCs that are interconnected by a high-speed network Work as an integrated collection of resources Have a single system image spanning all its nodes
  • 5. 한국해양과학기술진흥원 Sequential Applications Parallel Applications Parallel Programming Environment Cluster Middleware (Single System Image and Availability Infrastructure) Cluster Interconnection Network/Switch PC/Workstation Network Interface Hardware Communications Software PC/Workstation Network Interface Hardware Communications Software PC/Workstation Network Interface Hardware Communications Software PC/Workstation Network Interface Hardware Communications Software Sequential Applications Sequential Applications Parallel Applications Parallel Applications Cluster Computer Architecture
  • 6. 한국해양과학기술진흥원 Prominent Components of Cluster Computers Multiple High Performance Computers  PCs  Workstations State of the art Operating Systems  Linux (MOSIX, Beowulf, and many more)  Microsoft NT (Illinois HPVM, Cornell Velocity)  SUN Solaris (Berkeley NOW, C-DAC PARAM)  IBM AIX (IBM SP2)
  • 7. 한국해양과학기술진흥원 Prominent Components of Cluster Computers High Performance Networks  Ethernet (10Mbps),  Fast Ethernet (100Mbps),  Gigabit Ethernet (1Gbps)  SCI (Scalable Coherent Interface- MPI- 12µsec latency)  ATM (Asynchronous Transfer Mode)  Myrinet (1.2Gbps)  Digital Memory Channel  FDDI (fiber distributed data interface)  InfiniBand
  • 8. 한국해양과학기술진흥원 Fast Communication Protocols and Services  Active Messages (Berkeley)  Fast Messages (Illinois)  U-net (Cornell)  XTP (Virginia)  Virtual Interface Architecture (VIA) Prominent Components of Cluster Computers
  • 9. 한국해양과학기술진흥원 Myrinet QSnet Giganet ServerNet2 SCI Gigabit Ethernet Bandwidth (MBytes/s) 140 – 33MHz 215 – 66 Mhz 208 ~105 165 ~80 30 - 50 MPI Latency (µs) 16.5 – 33Nhz 11 – 66 Mhz 5 ~20 - 40 20.2 6 100 - 200 List price/port $1.5K $6.5K $1.5K ~$1.5K Hardware Availability Now Now Now Q2‘00 Now Now Linux Support Now Late‘00 Now Q2‘00 Now Now Maximum #nodes 1000’s 1000’s 1000’s 64K 1000’s Protocol Implementation Firmware on adapter Firmware on adapter Firmware on adapter Implemented in h ardware Implemented in hardware VIA support Soon None NT/Linux Done in hardware Software TCP/IP, VIA NT/Linux MPI support 3rd party Quadrics/ Compaq 3rd Party Compaq/3rd party MPICH – TCP/IP 1000’s Firmware on adapter ~$1.5K 3rd Party ~$1.5K Prominent Components of Cluster Computers
  • 10. 한국해양과학기술진흥원 Cluster Middleware  Resource management and scheduling  Fault handling  Migration  Load balancing Prominent Components of Cluster Computers
  • 12. 한국해양과학기술진흥원 Overview: Clusters x GridsCluster - How can we use local networked resources to achieve better performance for large scale applications?  High speed networks  Centralized resource and task management How can we put together geographically distributed resources to achieve even better results?  Distributed resource and task management  No high speed connections Grid Computing
  • 13. Information Generators Information Distributed Over the Grid Customer Access to Information Grid  Computing power should be available on demand, for a fee  Just like the electrical power grid. Basic Idea
  • 15. 한국해양과학기술진흥원 Grid Computing 15 Core networking technology now accelerates at a much faster rate than advances in microprocessor speeds Exploiting under utilized resources Parallel CPU capacity Access to additional resources Why Grid Computing?
  • 16. 한국해양과학기술진흥원 Grid Computing  Several clusters in Grid  May include super computers, desktops, laptops, mobile devices
  • 17. 한국해양과학기술진흥원 1800 Physicists, 150 Institutes, 32 Countries 100 PB of data by 2010; 50,000 CPUs? CERNs Large Hadron Collider
  • 18. 한국해양과학기술진흥원 Data Grids for High Energy Physics Tier2 Centre ~1 TIPS Online System Offline Processor Farm ~20 TIPS CERN Computer Centre FermiLab ~4 TIPSFrance Regional Centre Italy Regional Centre Germany Regional Centre InstituteInstituteInstitute Institute ~0.25TIPS Physicist workstations ~100 MBytes/sec ~100 MBytes/sec ~622 Mbit/sec ~1 MBytes/sec There is a “bunch crossing” every 25 nsecs. There are 100 “triggers” per second Each triggered event is ~1 MByte in size Physicists work on analysis “channels”. Each institute will have ~10 physicists working on one or more channels; data for these channels should be cached by the institute server Physics data cache ~PBytes/sec ~622 Mbits/sec or Air Freight (deprecated) Tier2 Centre ~1 TIPS Tier2 Centre ~1 TIPS Tier2 Centre ~1 TIPS Caltech ~1 TIPS ~622 Mbits/sec Tier 0 Tier 1 Tier 2 Tier 4 1 TIPS is approximately 25,000 SpecInt95 equivalents
  • 19. 한국해양과학기술진흥원 Grid Fabric Grid Apps. Grid Middleware Grid Tools Networked Resources across Organisations Computers Clusters Data Sources Scientific InstrumentsStorage Systems Local Resource Managers Operating Systems Queuing Systems TCP/IP & UDP … Libraries & App Kernels … Distributed Resources Coupling Services Security Information … QoSProcess Development Environments and Tools Languages Libraries Debuggers … Web toolsResource BrokersMonitoring Applications and Portals Prob. Solving Env.Scientific …CollaborationEngineering Web enabled Apps Resource Trading Grid Components Market Info
  • 20. 한국해양과학기술진흥원 Overview: Clusters x GridsA large proportion of personal computer’s computational power is left unused A desktop grid takes this unused capacity  Local Desktop Grid • Comprised mainly of a set of computers at one location  Volunteer Desktop Grid • Resources in a volunteer desktop grid are provided by citizens all over the world Desktop Grid
  • 21. 한국해양과학기술진흥원 Types of Grids  Computational Grid  Processing power is the main computing resource shared amongst nodes  Distributed Supercomputing • Executes the application in parallel on multiple machines to reduce the completion time  High throughput • Increases the completion rate of a stream of jobs  Data Grid  Data storage capacity as the main shared resource amongst nodes
  • 23. 한국해양과학기술진흥원 Overview: Clusters x GridsManages the pool of resources available to Grid  Processors  Network bandwidth  Disk storage The pool includes resources from different providers  RMS should maintain the required level of trust • Without affecting performance  RMS should adhere to different policies  RMS should meet QoS requirements Resource Management System
  • 24. 한국해양과학기술진흥원 Overview: Clusters x Grids Core Functions of Resource Management System
  • 25. 한국해양과학기술진흥원 Overview: Clusters x GridsResource Dissemination and Discovery Protocols  Used to determine the state of the resources • Resource Dissemination Protocol • Provides information about the resources • Discovery Protocol • Provides a mechanism by which resource information can be found Resource resolution and co-allocation protocols  To schedule the job at the remote resource  Simultaneously acquire multiple resources Core Functions of Resource Management System
  • 26. 한국해양과학기술진흥원 Overview: Clusters x GridsMachine Organization  Organization of the machines in the Grid affects the communication patterns and thus • determines the scalability Resource Management System
  • 27. 한국해양과학기술진흥원 Overview: Clusters x Grids Centralized Organization • a single controller or designated set of controllers performs the scheduling for all machines • suffer from scalability issues  Decentralized Organization • Roles are distributed among machines • Sender initiated • Receiver initiated Resource Management System
  • 28. 한국해양과학기술진흥원 Overview: Clusters x Grids  Flat Organization • All machines can directly communicate with each other without going through  Hierarchical Organization • Machines in the same level can directly communicate with the machines directly above them or below them  Cell or Group Organization • Machines within the cell communicate between themselves using flat organization • Designated machines within the cell function acts as boundary elements that are responsible for all communication outside the cell • Flat cell structure has only one level of cells • Hierarchical cell structure can have cells that contain other cells Resource Management System
  • 29. 한국해양과학기술진흥원 Overview: Clusters x GridsQoS Support  QoS is not limited to network bandwidth but extends to the processing and storage capabilities of the nodes  Resource reservation is one of the ways of providing guaranteed QoS  Key components of QoS • Admission control determines if requested level of service can be given • Policing ensures that job does not violate agreed upon level of service Resource Management System
  • 30. 한국해양과학기술진흥원 Overview: Clusters x GridsResource Discovery and Dissemination  Discovery is initiated by applications to find suitable resources  Dissemination is initiated by resources to find suitable application Resource Management System
  • 31. 한국해양과학기술진흥원 Overview: Clusters x GridsScheduling  Determining when and where the jobs are executed and how many resources are allocated  Time-shared job-scheduling approaches • Multiple jobs share the same resources  Space-shared job-scheduling approaches • Multiple jobs can run at any point of time by the available nodes  Gang or Synchronous Scheduling • Scheduling all tasks of application at the same time  Loosely coordinated co-scheduling • Schedule communicating tasks of application at the same time Resource Management System
  • 32. 한국해양과학기술진흥원 Overview: Clusters x GridsScheduling Objectives  Minimize response time and  Maximize system utilization  Trade-off • Maximizing system utilization may increase response time Resource Management System
  • 33. 한국해양과학기술진흥원 Overview: Clusters x GridsJob Requirements  Independent jobs  Dependent jobs • Precedence dependency • Parallel Dependency Resource Management System
  • 34. 한국해양과학기술진흥원 Overview: Clusters x GridsScheduling Resource Management System
  • 35. 한국해양과학기술진흥원 Overview: Clusters x GridsState Estimation  Predictive state estimation uses current and historical job and resource status information  Non-predictive state estimation uses only the current job and resource status information Resource Management System
  • 36. 한국해양과학기술진흥원 Overview: Clusters x GridsRescheduling  To improve utilization, balance load, etc  Periodic or batch rescheduling approaches group resource requests and system events which are then processed at intervals  Event driven online rescheduling performs rescheduling as soon the RMS receives the resource request or system event Resource Management System