SlideShare ist ein Scribd-Unternehmen logo
1 von 8
Downloaden Sie, um offline zu lesen
 
For	
  More	
  Details	
  :	
  info@kpmlearnings.com	
  	
  /	
  +91	
  8041705679	
  /	
  	
  	
   Website:	
  kpmlearnings.com	
  	
  	
  
Introduction :
Big Data and Hadoop training course is designed to provide knowledge and skills to
become a successful Hadoop Developer. In-depth knowledge of concepts such as
Hadoop Distributed File System, Hadoop Cluster, Map-Reduce, Hbase Zookeeper etc.
will be covered in the course.
Reason To Attend :
After the completion of the Big Data and Hadoop Course at KPM, you
should be able to:
• Master the concepts of Hadoop Distributed File System and
MapReduce framework
• Setup a Hadoop Cluster
• Understand Data Loading Techniques using Sqoop and Flume
• Program in MapReduce (Both MRv1 and MRv2)
• Learn to write Complex MapReduce programs
• Program in YARN (MRv2)
• Perform Data Analytics using Pig and Hive
• Implement HBase, MapReduce Integration, Advanced Usage
and Advanced Indexing
• Have a good understanding of ZooKeeper service
• New features in Hadoop 2.0 -- YARN, HDFS Federation,
NameNode High Availability
• Implement best Practices for Hadoop Development and
Debugging
• Implement a Hadoop Project
• Work on a Real Life Project on Big Data Analytics and gain
Hands on Project Experience
Who should attend :
This course is designed for
professionals aspiring to make a
career in Big Data Analytics
using Hadoop Framework.
Software Professionals,
Analytics Professionals, ETL
developers, Project Managers,
Testing Professionals are the
key beneficiaries of this course.
Other professionals who are
looking forward to acquire a
solid foundation of Hadoop
Architecture can also opt for this
course.
 
For	
  More	
  Details	
  :	
  info@kpmlearnings.com	
  	
  /	
  +91	
  8041705679	
  /	
  	
  	
   Website:	
  kpmlearnings.com	
  	
  	
  
Course Content :
Big Data Economy …………………………………………………………… 1.5 Hrs.
• What is Big Data
• Characteristics of Big Data
• How did data become so Big
• Why should you care about Big Data
• Uses Cases of Big Data Analysis
• What are possible options for analyzing big data
• Traditional Distributed Systems
• Problem with traditional Distributed systems
Hadoop Introduction………………………………………………………… 1.5 Hrs.
• What is Hadoop
• History of Hadoop
• How does Hadoop solve Big Data Problem
• Components of Hadoop
• Hadoop Flavours
Hadoop Distributed File System Part 1…...……………………………… 2 Hrs
• HDFS Architecture
• HDFS Internals
• HDFS Use Cases
• HDFS Daemons
• Files and Blocks
• Namenode Memory Concerns
• Secondary Namenode
• HDFS Access Options
 
For	
  More	
  Details	
  :	
  info@kpmlearnings.com	
  	
  /	
  +91	
  8041705679	
  /	
  	
  	
   Website:	
  kpmlearnings.com	
  	
  	
  
Installing Hadoop (Single Node)…......……..……….…………………… 1 Hrs
• Installation Overview
• Hadoop Installation
• Hadoop Daemons Stuff
Advanced Hadoop Distributed File System Concepts………….…… 2 Hrs.
• HDFS Workshops
• HDFS API
• How to use Configuration class
• Using HDFS in MapReduce
• Using HDFS Programmatically
• HDFS Permission and Security
• Additional HDFS Tasks
• Rebalancing Blocks
• Copying Large Sets of Files
• Decommissioning Nodes
• Verifying File System Health
• Rack Awareness
• HDFS Web Interface
Map-Reduce Workshops………...…..……………………………………....… 5 Hrs
 
For	
  More	
  Details	
  :	
  info@kpmlearnings.com	
  	
  /	
  +91	
  8041705679	
  /	
  	
  	
   Website:	
  kpmlearnings.com	
  	
  	
  
Introduction to MapReduce ……….…………………………………..…… 3 Hrs
• MapReduce Basics
• Functional Programming Concepts
• List Processing
• Mapping Lists
• Reducing Lists
• Putting them Together in MapReduce
• An Example Application: Word Count
• Understanding the Driver
• Understanding the Mapper
• Understanding the Reducer
• MapReduce Data Flow
• A Closer look
• Additional MapReduce Functionality
• Fault Tolerance
Advanced MapReduce Concepts…..……………………………………..…. 2 Hrs
• Understanding Combiners
• Understanding Partitioners
• Understanding input formats
• Understanding output formats
• Distributed Cache
• Understanding Counters
• More Tips
• Chaining Jobs
• Listing and Killing Jobs
 
For	
  More	
  Details	
  :	
  info@kpmlearnings.com	
  	
  /	
  +91	
  8041705679	
  /	
  	
  	
   Website:	
  kpmlearnings.com	
  	
  	
  
Cloud Computing Overview………..…………………………...…….....…… 1 Hrs
• Cloud Computing Introduction
• SaaS/PaaS/IaaS
• Characteristics
Installing Hadoop (Multi Node)………..………………………..............…… 1 Hrs
• Cluster Configurations
• Configuring Masters
• Configuring Slaves
• Cluster Stuff
Hadoop Ecosystem Pig ….………………………………………………………. 1 Hrs
• Pig Programs structure and Execution Process
• Joins
• Filtering
• Group and Co-Group
• Schema merging and redefining schema
• Pig functions
Hadoop Ecosystem Hive…………………………………………………………. 2 Hrs
• Motivation and Understanding Hive
• Using Hive Command line interface
• Data types and File Formats
• Basic DDL operations
• Schema Design
• An Example of Pig and Hive
 
For	
  More	
  Details	
  :	
  info@kpmlearnings.com	
  	
  /	
  +91	
  8041705679	
  /	
  	
  	
   Website:	
  kpmlearnings.com	
  	
  	
  
Hadoop Ecosystem HBase and Zookeeper………….………………………. 1 Hrs
• HBase Overview
• HBase Architecture
• HBase Installation
• HBase Admin : Test
• HBase Client: Client Loading Overview
• Fully Distributed HBase Configuration
• Loading HBase
• HBase Data Access
Hadoop Ecosystem Sqoop …………………………………………………. 1 Hrs
• Sqoop Overview
• Sqoop Installation
• Importing Data
• Exporting Data
Hadoop Ecosystem Oozie………………………………………………..…. 1 Hrs
• Oozie overview
• Oozie Features
• Bundle
• Scalability
• Usability
• Oozie challenges
Hadoop Ecosystem Apache Flume……………….…………………..……. 1 Hrs
• Apache Flume Overview
• How it Works
• Flume Connection with HDFS
 
For	
  More	
  Details	
  :	
  info@kpmlearnings.com	
  	
  /	
  +91	
  8041705679	
  /	
  	
  	
   Website:	
  kpmlearnings.com	
  	
  	
  
Hadoop Version 2 Concepts …………………….………………………….. 2 Hrs
• Yarn
• Hadoop Federation
• Authentication in Hadoop
• High Availability
Administration Refresher……………………………………………………… 1 Hrs
• Setting up Hadoop Cluster – Considerations
• Most Important Configurations
• Installation Options
• Scheduling in Hadoop
• FIFO Scheduler
• FAIR Scheduler
Building a Web Log Analysis POC using MapReduce..…….……….…... 2 Hrs
• Designing Structures for POC
• With MapReduce develop code
• Push data using Flume into HDFS
• Run MapReduce Code
• Analyse the Output
Real Life Project and POC…………………………………….……….....……….... 6 Hrs
 
For	
  More	
  Details	
  :	
  info@kpmlearnings.com	
  	
  /	
  +91	
  8041705679	
  /	
  	
  	
   Website:	
  kpmlearnings.com	
  	
  	
  
Training Methodlogy :
- 80% training is practical
- The duration of course is 36 - 40 Hrs
- Individual attention is provided to all candidates
- Training involves multiple workshops to explain the practical concepts
- Regular assignments will be given to the candidates
- Study material, PPTs, Project and POC codes, etc. will be given to the candidates
- Course involves 3 Proof Of Concepts
- Course involves a Real Life Project
- Trainer will assist you for interview preparation
About The Organizer :
KPM Learning Solutions – Shaping your Future
KPI is one-stop learning solutions that offer a wide portfolio of learning and consulting services. We
provide tailored, practical, in-house and open house learning solutions in sync with the recent industrial
and technological trends.
We design, develop and deliver world-class academic and highly innovative learning programs in IT
and Mobility, Leadership & Management and other related areas world across.
“KPM” denotes the success factors and performance measurement which is directed towards the
strategic goals of any organization and few sets of key skills.
Our aim is to upgrade and set those key skills that are result oriented and bring organizational
excellence by all means.
You can log on to – www.kpmlearnings.com

Weitere ähnliche Inhalte

Was ist angesagt?

Bikas saha:the next generation of hadoop– hadoop 2 and yarn
Bikas saha:the next generation of hadoop– hadoop 2 and yarnBikas saha:the next generation of hadoop– hadoop 2 and yarn
Bikas saha:the next generation of hadoop– hadoop 2 and yarn
hdhappy001
 
Hadoop applicationarchitectures
Hadoop applicationarchitecturesHadoop applicationarchitectures
Hadoop applicationarchitectures
Doug Chang
 

Was ist angesagt? (20)

Bikas saha:the next generation of hadoop– hadoop 2 and yarn
Bikas saha:the next generation of hadoop– hadoop 2 and yarnBikas saha:the next generation of hadoop– hadoop 2 and yarn
Bikas saha:the next generation of hadoop– hadoop 2 and yarn
 
Back to School - St. Louis Hadoop Meetup September 2016
Back to School - St. Louis Hadoop Meetup September 2016Back to School - St. Louis Hadoop Meetup September 2016
Back to School - St. Louis Hadoop Meetup September 2016
 
hadoop_module6
hadoop_module6hadoop_module6
hadoop_module6
 
Hadoop_Its_Not_Just_Internal_Storage_V14
Hadoop_Its_Not_Just_Internal_Storage_V14Hadoop_Its_Not_Just_Internal_Storage_V14
Hadoop_Its_Not_Just_Internal_Storage_V14
 
Best hadoop-online-training
Best hadoop-online-trainingBest hadoop-online-training
Best hadoop-online-training
 
Capacity Management and BigData/Hadoop - Hitchhiker's guide for the Capacity ...
Capacity Management and BigData/Hadoop - Hitchhiker's guide for the Capacity ...Capacity Management and BigData/Hadoop - Hitchhiker's guide for the Capacity ...
Capacity Management and BigData/Hadoop - Hitchhiker's guide for the Capacity ...
 
Big Data and Hadoop in Cloud - Leveraging Amazon EMR
Big Data and Hadoop in Cloud - Leveraging Amazon EMRBig Data and Hadoop in Cloud - Leveraging Amazon EMR
Big Data and Hadoop in Cloud - Leveraging Amazon EMR
 
Introduction to Hadoop
Introduction to HadoopIntroduction to Hadoop
Introduction to Hadoop
 
Philly DB MapR Overview
Philly DB MapR OverviewPhilly DB MapR Overview
Philly DB MapR Overview
 
Hadoop 31-frequently-asked-interview-questions
Hadoop 31-frequently-asked-interview-questionsHadoop 31-frequently-asked-interview-questions
Hadoop 31-frequently-asked-interview-questions
 
Drill dchug-29 nov2012
Drill dchug-29 nov2012Drill dchug-29 nov2012
Drill dchug-29 nov2012
 
2015 GHC Presentation - High Availability and High Frequency Big Data Analytics
2015 GHC Presentation - High Availability and High Frequency Big Data Analytics2015 GHC Presentation - High Availability and High Frequency Big Data Analytics
2015 GHC Presentation - High Availability and High Frequency Big Data Analytics
 
Apache Spark & Hadoop
Apache Spark & HadoopApache Spark & Hadoop
Apache Spark & Hadoop
 
Hadoop applicationarchitectures
Hadoop applicationarchitecturesHadoop applicationarchitectures
Hadoop applicationarchitectures
 
Advanced Hadoop Tuning and Optimization - Hadoop Consulting
Advanced Hadoop Tuning and Optimization - Hadoop ConsultingAdvanced Hadoop Tuning and Optimization - Hadoop Consulting
Advanced Hadoop Tuning and Optimization - Hadoop Consulting
 
HUG slides on NFS and ODBC
HUG slides on NFS and ODBCHUG slides on NFS and ODBC
HUG slides on NFS and ODBC
 
Challenges & Capabilites in Managing a MapR Cluster by David Tucker
Challenges & Capabilites in Managing a MapR Cluster by David TuckerChallenges & Capabilites in Managing a MapR Cluster by David Tucker
Challenges & Capabilites in Managing a MapR Cluster by David Tucker
 
Training
TrainingTraining
Training
 
Hadoop Interview Questions and Answers
Hadoop Interview Questions and AnswersHadoop Interview Questions and Answers
Hadoop Interview Questions and Answers
 
The Search Is Over: Integrating Solr and Hadoop in the Same Cluster to Simpli...
The Search Is Over: Integrating Solr and Hadoop in the Same Cluster to Simpli...The Search Is Over: Integrating Solr and Hadoop in the Same Cluster to Simpli...
The Search Is Over: Integrating Solr and Hadoop in the Same Cluster to Simpli...
 

Ähnlich wie Learn Hadoop at your Leisure time

Hadoop_Architect__eVenkat
Hadoop_Architect__eVenkatHadoop_Architect__eVenkat
Hadoop_Architect__eVenkat
Venkat Krishnan
 
Hadoop online training in india
Hadoop online training  in indiaHadoop online training  in india
Hadoop online training in india
Madhu Trainer
 
project--2 nd review_2
project--2 nd review_2project--2 nd review_2
project--2 nd review_2
Aswini Ashu
 
project--2 nd review_2
project--2 nd review_2project--2 nd review_2
project--2 nd review_2
aswini pilli
 
Cloudera hadoop developer training
Cloudera hadoop developer trainingCloudera hadoop developer training
Cloudera hadoop developer training
Magnific Trainings
 
Cloudera hadoop developer training
Cloudera hadoop developer trainingCloudera hadoop developer training
Cloudera hadoop developer training
Magnific Trainings
 
Cloudera hadoop developer training
Cloudera hadoop developer trainingCloudera hadoop developer training
Cloudera hadoop developer training
Magnific Trainings
 

Ähnlich wie Learn Hadoop at your Leisure time (20)

Hadoop_Architect__eVenkat
Hadoop_Architect__eVenkatHadoop_Architect__eVenkat
Hadoop_Architect__eVenkat
 
Big data analytics_using_hadoop
Big data analytics_using_hadoopBig data analytics_using_hadoop
Big data analytics_using_hadoop
 
List of Engineering Colleges in Uttarakhand
List of Engineering Colleges in UttarakhandList of Engineering Colleges in Uttarakhand
List of Engineering Colleges in Uttarakhand
 
Hadoop.pptx
Hadoop.pptxHadoop.pptx
Hadoop.pptx
 
Hadoop.pptx
Hadoop.pptxHadoop.pptx
Hadoop.pptx
 
Hadoop online training in india
Hadoop online training  in indiaHadoop online training  in india
Hadoop online training in india
 
Hadoop ppt1
Hadoop ppt1Hadoop ppt1
Hadoop ppt1
 
HadoopCon- Trend Micro SPN Hadoop Overview
HadoopCon- Trend Micro SPN Hadoop OverviewHadoopCon- Trend Micro SPN Hadoop Overview
HadoopCon- Trend Micro SPN Hadoop Overview
 
Hadoop
HadoopHadoop
Hadoop
 
Hadoop_Admin_eVenkat
Hadoop_Admin_eVenkatHadoop_Admin_eVenkat
Hadoop_Admin_eVenkat
 
project--2 nd review_2
project--2 nd review_2project--2 nd review_2
project--2 nd review_2
 
project--2 nd review_2
project--2 nd review_2project--2 nd review_2
project--2 nd review_2
 
Managing growth in Production Hadoop Deployments
Managing growth in Production Hadoop DeploymentsManaging growth in Production Hadoop Deployments
Managing growth in Production Hadoop Deployments
 
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
 
Cloudera hadoop developer training
Cloudera hadoop developer trainingCloudera hadoop developer training
Cloudera hadoop developer training
 
Cloudera hadoop developer training
Cloudera hadoop developer trainingCloudera hadoop developer training
Cloudera hadoop developer training
 
Manoj CV
Manoj CVManoj CV
Manoj CV
 
Cloudera hadoop developer training
Cloudera hadoop developer trainingCloudera hadoop developer training
Cloudera hadoop developer training
 
Hadoop
HadoopHadoop
Hadoop
 
Strata NY 2014 - Architectural considerations for Hadoop applications tutorial
Strata NY 2014 - Architectural considerations for Hadoop applications tutorialStrata NY 2014 - Architectural considerations for Hadoop applications tutorial
Strata NY 2014 - Architectural considerations for Hadoop applications tutorial
 

Kürzlich hochgeladen

Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
KarakKing
 

Kürzlich hochgeladen (20)

Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
OSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & SystemsOSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & Systems
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 

Learn Hadoop at your Leisure time

  • 1.   For  More  Details  :  info@kpmlearnings.com    /  +91  8041705679  /       Website:  kpmlearnings.com       Introduction : Big Data and Hadoop training course is designed to provide knowledge and skills to become a successful Hadoop Developer. In-depth knowledge of concepts such as Hadoop Distributed File System, Hadoop Cluster, Map-Reduce, Hbase Zookeeper etc. will be covered in the course. Reason To Attend : After the completion of the Big Data and Hadoop Course at KPM, you should be able to: • Master the concepts of Hadoop Distributed File System and MapReduce framework • Setup a Hadoop Cluster • Understand Data Loading Techniques using Sqoop and Flume • Program in MapReduce (Both MRv1 and MRv2) • Learn to write Complex MapReduce programs • Program in YARN (MRv2) • Perform Data Analytics using Pig and Hive • Implement HBase, MapReduce Integration, Advanced Usage and Advanced Indexing • Have a good understanding of ZooKeeper service • New features in Hadoop 2.0 -- YARN, HDFS Federation, NameNode High Availability • Implement best Practices for Hadoop Development and Debugging • Implement a Hadoop Project • Work on a Real Life Project on Big Data Analytics and gain Hands on Project Experience Who should attend : This course is designed for professionals aspiring to make a career in Big Data Analytics using Hadoop Framework. Software Professionals, Analytics Professionals, ETL developers, Project Managers, Testing Professionals are the key beneficiaries of this course. Other professionals who are looking forward to acquire a solid foundation of Hadoop Architecture can also opt for this course.
  • 2.   For  More  Details  :  info@kpmlearnings.com    /  +91  8041705679  /       Website:  kpmlearnings.com       Course Content : Big Data Economy …………………………………………………………… 1.5 Hrs. • What is Big Data • Characteristics of Big Data • How did data become so Big • Why should you care about Big Data • Uses Cases of Big Data Analysis • What are possible options for analyzing big data • Traditional Distributed Systems • Problem with traditional Distributed systems Hadoop Introduction………………………………………………………… 1.5 Hrs. • What is Hadoop • History of Hadoop • How does Hadoop solve Big Data Problem • Components of Hadoop • Hadoop Flavours Hadoop Distributed File System Part 1…...……………………………… 2 Hrs • HDFS Architecture • HDFS Internals • HDFS Use Cases • HDFS Daemons • Files and Blocks • Namenode Memory Concerns • Secondary Namenode • HDFS Access Options
  • 3.   For  More  Details  :  info@kpmlearnings.com    /  +91  8041705679  /       Website:  kpmlearnings.com       Installing Hadoop (Single Node)…......……..……….…………………… 1 Hrs • Installation Overview • Hadoop Installation • Hadoop Daemons Stuff Advanced Hadoop Distributed File System Concepts………….…… 2 Hrs. • HDFS Workshops • HDFS API • How to use Configuration class • Using HDFS in MapReduce • Using HDFS Programmatically • HDFS Permission and Security • Additional HDFS Tasks • Rebalancing Blocks • Copying Large Sets of Files • Decommissioning Nodes • Verifying File System Health • Rack Awareness • HDFS Web Interface Map-Reduce Workshops………...…..……………………………………....… 5 Hrs
  • 4.   For  More  Details  :  info@kpmlearnings.com    /  +91  8041705679  /       Website:  kpmlearnings.com       Introduction to MapReduce ……….…………………………………..…… 3 Hrs • MapReduce Basics • Functional Programming Concepts • List Processing • Mapping Lists • Reducing Lists • Putting them Together in MapReduce • An Example Application: Word Count • Understanding the Driver • Understanding the Mapper • Understanding the Reducer • MapReduce Data Flow • A Closer look • Additional MapReduce Functionality • Fault Tolerance Advanced MapReduce Concepts…..……………………………………..…. 2 Hrs • Understanding Combiners • Understanding Partitioners • Understanding input formats • Understanding output formats • Distributed Cache • Understanding Counters • More Tips • Chaining Jobs • Listing and Killing Jobs
  • 5.   For  More  Details  :  info@kpmlearnings.com    /  +91  8041705679  /       Website:  kpmlearnings.com       Cloud Computing Overview………..…………………………...…….....…… 1 Hrs • Cloud Computing Introduction • SaaS/PaaS/IaaS • Characteristics Installing Hadoop (Multi Node)………..………………………..............…… 1 Hrs • Cluster Configurations • Configuring Masters • Configuring Slaves • Cluster Stuff Hadoop Ecosystem Pig ….………………………………………………………. 1 Hrs • Pig Programs structure and Execution Process • Joins • Filtering • Group and Co-Group • Schema merging and redefining schema • Pig functions Hadoop Ecosystem Hive…………………………………………………………. 2 Hrs • Motivation and Understanding Hive • Using Hive Command line interface • Data types and File Formats • Basic DDL operations • Schema Design • An Example of Pig and Hive
  • 6.   For  More  Details  :  info@kpmlearnings.com    /  +91  8041705679  /       Website:  kpmlearnings.com       Hadoop Ecosystem HBase and Zookeeper………….………………………. 1 Hrs • HBase Overview • HBase Architecture • HBase Installation • HBase Admin : Test • HBase Client: Client Loading Overview • Fully Distributed HBase Configuration • Loading HBase • HBase Data Access Hadoop Ecosystem Sqoop …………………………………………………. 1 Hrs • Sqoop Overview • Sqoop Installation • Importing Data • Exporting Data Hadoop Ecosystem Oozie………………………………………………..…. 1 Hrs • Oozie overview • Oozie Features • Bundle • Scalability • Usability • Oozie challenges Hadoop Ecosystem Apache Flume……………….…………………..……. 1 Hrs • Apache Flume Overview • How it Works • Flume Connection with HDFS
  • 7.   For  More  Details  :  info@kpmlearnings.com    /  +91  8041705679  /       Website:  kpmlearnings.com       Hadoop Version 2 Concepts …………………….………………………….. 2 Hrs • Yarn • Hadoop Federation • Authentication in Hadoop • High Availability Administration Refresher……………………………………………………… 1 Hrs • Setting up Hadoop Cluster – Considerations • Most Important Configurations • Installation Options • Scheduling in Hadoop • FIFO Scheduler • FAIR Scheduler Building a Web Log Analysis POC using MapReduce..…….……….…... 2 Hrs • Designing Structures for POC • With MapReduce develop code • Push data using Flume into HDFS • Run MapReduce Code • Analyse the Output Real Life Project and POC…………………………………….……….....……….... 6 Hrs
  • 8.   For  More  Details  :  info@kpmlearnings.com    /  +91  8041705679  /       Website:  kpmlearnings.com       Training Methodlogy : - 80% training is practical - The duration of course is 36 - 40 Hrs - Individual attention is provided to all candidates - Training involves multiple workshops to explain the practical concepts - Regular assignments will be given to the candidates - Study material, PPTs, Project and POC codes, etc. will be given to the candidates - Course involves 3 Proof Of Concepts - Course involves a Real Life Project - Trainer will assist you for interview preparation About The Organizer : KPM Learning Solutions – Shaping your Future KPI is one-stop learning solutions that offer a wide portfolio of learning and consulting services. We provide tailored, practical, in-house and open house learning solutions in sync with the recent industrial and technological trends. We design, develop and deliver world-class academic and highly innovative learning programs in IT and Mobility, Leadership & Management and other related areas world across. “KPM” denotes the success factors and performance measurement which is directed towards the strategic goals of any organization and few sets of key skills. Our aim is to upgrade and set those key skills that are result oriented and bring organizational excellence by all means. You can log on to – www.kpmlearnings.com