SlideShare ist ein Scribd-Unternehmen logo
1 von 13
Downloaden Sie, um offline zu lesen
@RMSSoftwareTech training@rmssoftwaretech.com
http://www.rmssoftwaretech.com
Apache Hadoop
(Big Data)
Big Data Training
2
© 2014 RMS Software Tech (rmssoftwaretech.com)
Logo’s & Trademarks
• Note: Any logos used in this presentation are owned by their
respective companies and are only used in this slide deck for
educational purposes. No other companies are responsible for or
provide attribution for any of the material in these slides.
• This slide deck is released under a Creative Commons License
and can be reused in your own presentations, however please
research the specific meanings of these symbols:
• You may be able to use the slide deck for purposes beyond the
CC license if you email me with the special request.
•  All third party trademark rights acknowledged
3
© 2014 RMS Software Tech (rmssoftwaretech.com)
Profile : RMS Software Technologies
•  About us :
◦  Leading provider of Software solutions, System integration Services and
Professional Training
◦  Based in San Jose, CA & Mumbai, India started in 2012.
•  Professional Courses :
◦  Expert Training team of 10 people who provide training in various leading
technologies like iPhone iOS, Android, Java, Big Data Hadoop, QA & Agile Scrum
Methodologies.
◦  Team of developers using AngularJS at the client projects.
◦  Focused on Interns and engineers looking to learn new exciting technologies.
•  Track Record:
◦  We work with emerging technologies to create mobile applications, rich-client desktop
software, and large-scale systems (CRM, ERP).
◦  We create high quality solutions for hard problems, to help our customers thrive
◦  We can share this expertise with you developers, in the form of workshop style, hands-on
training classes.
4
© 2014 RMS Software Tech (rmssoftwaretech.com)
Course : Prerequisites and Equipment
•  Prerequisites :
◦  Students should have experience with Database (DBMS) like Oracle,
Informix, Sybase. No prior experience of Big Data or NOSQL and Hadoop
is required for the course.
•  Equipment :
◦  Please use - Laptop (Windows, Linux, or Mac).
5
© 2014 RMS Software Tech (rmssoftwaretech.com)
Training Agenda
Training Schedule & Agenda
Week 1 : Hadoop Overview
Week 2 : HDFS Deep Dive
Week 3 : MapReduce and Pig
Week 4 : Hive and HBase
Week 5 : Zookeeper, Oozie, Flume, Talend
Week 6 : Practice Questions, Q & A with Final Project
We believe this curriculum covers the basics well, and positions students to use
Hadoop effectively and efficiently. It provides a good overview on Hadoop and Big
Data
6
© 2014 RMS Software Tech (rmssoftwaretech.com)
Week 1 : Hadoop Overview
•  Brief History of Hadoop
•  RDBMS/SQL vs. Hadoop
•  Structured vs. Unstructured data
•  Introduction to Hadoop Ecosystem (HDFS, MapReduce, Pig, Hive, HBase)
•  HDFS Overview (NameNode vs. DataNode)
•  MapReduce overview (JobTracker vs. TaskTracker)
•  Hadoop XML files for configuration
•  Hadoop Ecosystem (Hive, Pig, Hbase, Zookeeper, Mahout, Oozie, Talend,
Scoop, Flume)
•  Lab #1 Virtual Machine Setup
7
© 2014 RMS Software Tech (rmssoftwaretech.com)
Week 2 : HDFS Deep Dive
•  NameNode Architecture
•  DataNode Architecture
•  Write Pipeline
•  Read Pipeline
•  HDFS Disk space quotas and number of file quotas
•  Quick Intro to Java API interface
•  Lab #2.
8
© 2014 RMS Software Tech (rmssoftwaretech.com)
Week 3 : MapReduce and Pig
•  MapReduce Architecture
•  Combiner, Partitioner
•  JobTracker & TaskTracker
•  Job Scheduling
•  Distributed Cache
•  Counters
•  MapReduce configuration files
•  Simple MapReduce example : WordCount
•  Next Gen MapReduce : YARN.
•  Lab #3 : MapReduce
•  Lab #4 : Pig
9
© 2014 RMS Software Tech (rmssoftwaretech.com)
Week 4: Hive and HBase
•  Hive architecture.
•  Hive vs. RDBMS.
•  HiveQL and Hive. Shell
•  Managing Tables
•  Querying Data
•  Data Types and Schemas
•  Introduction to UDF (User Defined Functions)
•  HBase Architecture
•  HBase vs. Cassandra
10
© 2014 RMS Software Tech (rmssoftwaretech.com)
Week 4: HBase
•  Bloom Filters and Block indexes
•  Table Scans and Filters
•  Lab # Intro to HBase command line.
11
© 2014 RMS Software Tech (rmssoftwaretech.com)
Week 5: Zookeeper, Oozie, Flume, Sqoop, Talend
•  Flume overview
•  Flume usage
•  Sqoop overview
•  Sqoop usage
•  Hadoop workflow
•  Jobcontrol
•  Oozie
•  Talend
•  Sqoop
12
© 2014 RMS Software Tech (rmssoftwaretech.com)
Week 6: Project and Practice Questions
•  Sample Project
•  Practice Questions
•  Q & A
Foundation for tomorrow
@RMSSoftwareTech
training@rmssoftwaretech.com

Weitere ähnliche Inhalte

Andere mochten auch (7)

Your Big Data Stack is Too Big!: Presented by Timothy Potter, Lucidworks
Your Big Data Stack is Too Big!: Presented by Timothy Potter, LucidworksYour Big Data Stack is Too Big!: Presented by Timothy Potter, Lucidworks
Your Big Data Stack is Too Big!: Presented by Timothy Potter, Lucidworks
 
07 Using Oracle-Supported Package in Application Development
07 Using Oracle-Supported Package in Application Development07 Using Oracle-Supported Package in Application Development
07 Using Oracle-Supported Package in Application Development
 
06 Using More Package Concepts
06 Using More Package Concepts06 Using More Package Concepts
06 Using More Package Concepts
 
03 Writing Control Structures, Writing with Compatible Data Types Using Expli...
03 Writing Control Structures, Writing with Compatible Data Types Using Expli...03 Writing Control Structures, Writing with Compatible Data Types Using Expli...
03 Writing Control Structures, Writing with Compatible Data Types Using Expli...
 
Big Data Tech Stack
Big Data Tech StackBig Data Tech Stack
Big Data Tech Stack
 
Big Data Technology Stack : Nutshell
Big Data Technology Stack : NutshellBig Data Technology Stack : Nutshell
Big Data Technology Stack : Nutshell
 
What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?
 

Ähnlich wie Big Data Hadoop Training Course

Hadoop online training in india
Hadoop online training  in indiaHadoop online training  in india
Hadoop online training in india
Madhu Trainer
 
Hadoop training kit from lcc infotech
Hadoop   training kit from lcc infotechHadoop   training kit from lcc infotech
Hadoop training kit from lcc infotech
lccinfotech
 
Hadoop course content
Hadoop course contentHadoop course content
Hadoop course content
RS Trainings
 
Sasmita bigdata resume
Sasmita bigdata resumeSasmita bigdata resume
Sasmita bigdata resume
Sasmita Swain
 

Ähnlich wie Big Data Hadoop Training Course (20)

Salesforce.com Training Course Agenda
Salesforce.com Training Course AgendaSalesforce.com Training Course Agenda
Salesforce.com Training Course Agenda
 
Spring Framework Training Course
Spring Framework Training Course Spring Framework Training Course
Spring Framework Training Course
 
DeepeshRehi
DeepeshRehiDeepeshRehi
DeepeshRehi
 
hadoop exp
hadoop exphadoop exp
hadoop exp
 
Hadoop 2.0-development
Hadoop 2.0-developmentHadoop 2.0-development
Hadoop 2.0-development
 
Hadoop online training in india
Hadoop online training  in indiaHadoop online training  in india
Hadoop online training in india
 
Sudhanshu kumar hadoop
Sudhanshu kumar hadoopSudhanshu kumar hadoop
Sudhanshu kumar hadoop
 
Hadoop training kit from lcc infotech
Hadoop   training kit from lcc infotechHadoop   training kit from lcc infotech
Hadoop training kit from lcc infotech
 
Hadoop course content
Hadoop course contentHadoop course content
Hadoop course content
 
Android Mobile Development Course
Android Mobile Development Course Android Mobile Development Course
Android Mobile Development Course
 
Resume
ResumeResume
Resume
 
Learn hadoop and big data technologies
Learn hadoop and big data technologiesLearn hadoop and big data technologies
Learn hadoop and big data technologies
 
Applications on Hadoop
Applications on HadoopApplications on Hadoop
Applications on Hadoop
 
Big data analytics_using_hadoop
Big data analytics_using_hadoopBig data analytics_using_hadoop
Big data analytics_using_hadoop
 
Angular JS Training Agenda
Angular JS Training AgendaAngular JS Training Agenda
Angular JS Training Agenda
 
Innovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data WarehouseInnovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data Warehouse
 
Sasmita bigdata resume
Sasmita bigdata resumeSasmita bigdata resume
Sasmita bigdata resume
 
What it takes to bring Hadoop to a production-ready state
What it takes to bring Hadoop to a production-ready stateWhat it takes to bring Hadoop to a production-ready state
What it takes to bring Hadoop to a production-ready state
 
HimaBindu
HimaBinduHimaBindu
HimaBindu
 
Apache hadoop-administrator-training
Apache hadoop-administrator-trainingApache hadoop-administrator-training
Apache hadoop-administrator-training
 

Kürzlich hochgeladen

Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
MateoGardella
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 
An Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfAn Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdf
SanaAli374401
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
Chris Hunter
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 

Kürzlich hochgeladen (20)

Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
An Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfAn Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdf
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 

Big Data Hadoop Training Course

  • 2. 2 © 2014 RMS Software Tech (rmssoftwaretech.com) Logo’s & Trademarks • Note: Any logos used in this presentation are owned by their respective companies and are only used in this slide deck for educational purposes. No other companies are responsible for or provide attribution for any of the material in these slides. • This slide deck is released under a Creative Commons License and can be reused in your own presentations, however please research the specific meanings of these symbols: • You may be able to use the slide deck for purposes beyond the CC license if you email me with the special request. •  All third party trademark rights acknowledged
  • 3. 3 © 2014 RMS Software Tech (rmssoftwaretech.com) Profile : RMS Software Technologies •  About us : ◦  Leading provider of Software solutions, System integration Services and Professional Training ◦  Based in San Jose, CA & Mumbai, India started in 2012. •  Professional Courses : ◦  Expert Training team of 10 people who provide training in various leading technologies like iPhone iOS, Android, Java, Big Data Hadoop, QA & Agile Scrum Methodologies. ◦  Team of developers using AngularJS at the client projects. ◦  Focused on Interns and engineers looking to learn new exciting technologies. •  Track Record: ◦  We work with emerging technologies to create mobile applications, rich-client desktop software, and large-scale systems (CRM, ERP). ◦  We create high quality solutions for hard problems, to help our customers thrive ◦  We can share this expertise with you developers, in the form of workshop style, hands-on training classes.
  • 4. 4 © 2014 RMS Software Tech (rmssoftwaretech.com) Course : Prerequisites and Equipment •  Prerequisites : ◦  Students should have experience with Database (DBMS) like Oracle, Informix, Sybase. No prior experience of Big Data or NOSQL and Hadoop is required for the course. •  Equipment : ◦  Please use - Laptop (Windows, Linux, or Mac).
  • 5. 5 © 2014 RMS Software Tech (rmssoftwaretech.com) Training Agenda Training Schedule & Agenda Week 1 : Hadoop Overview Week 2 : HDFS Deep Dive Week 3 : MapReduce and Pig Week 4 : Hive and HBase Week 5 : Zookeeper, Oozie, Flume, Talend Week 6 : Practice Questions, Q & A with Final Project We believe this curriculum covers the basics well, and positions students to use Hadoop effectively and efficiently. It provides a good overview on Hadoop and Big Data
  • 6. 6 © 2014 RMS Software Tech (rmssoftwaretech.com) Week 1 : Hadoop Overview •  Brief History of Hadoop •  RDBMS/SQL vs. Hadoop •  Structured vs. Unstructured data •  Introduction to Hadoop Ecosystem (HDFS, MapReduce, Pig, Hive, HBase) •  HDFS Overview (NameNode vs. DataNode) •  MapReduce overview (JobTracker vs. TaskTracker) •  Hadoop XML files for configuration •  Hadoop Ecosystem (Hive, Pig, Hbase, Zookeeper, Mahout, Oozie, Talend, Scoop, Flume) •  Lab #1 Virtual Machine Setup
  • 7. 7 © 2014 RMS Software Tech (rmssoftwaretech.com) Week 2 : HDFS Deep Dive •  NameNode Architecture •  DataNode Architecture •  Write Pipeline •  Read Pipeline •  HDFS Disk space quotas and number of file quotas •  Quick Intro to Java API interface •  Lab #2.
  • 8. 8 © 2014 RMS Software Tech (rmssoftwaretech.com) Week 3 : MapReduce and Pig •  MapReduce Architecture •  Combiner, Partitioner •  JobTracker & TaskTracker •  Job Scheduling •  Distributed Cache •  Counters •  MapReduce configuration files •  Simple MapReduce example : WordCount •  Next Gen MapReduce : YARN. •  Lab #3 : MapReduce •  Lab #4 : Pig
  • 9. 9 © 2014 RMS Software Tech (rmssoftwaretech.com) Week 4: Hive and HBase •  Hive architecture. •  Hive vs. RDBMS. •  HiveQL and Hive. Shell •  Managing Tables •  Querying Data •  Data Types and Schemas •  Introduction to UDF (User Defined Functions) •  HBase Architecture •  HBase vs. Cassandra
  • 10. 10 © 2014 RMS Software Tech (rmssoftwaretech.com) Week 4: HBase •  Bloom Filters and Block indexes •  Table Scans and Filters •  Lab # Intro to HBase command line.
  • 11. 11 © 2014 RMS Software Tech (rmssoftwaretech.com) Week 5: Zookeeper, Oozie, Flume, Sqoop, Talend •  Flume overview •  Flume usage •  Sqoop overview •  Sqoop usage •  Hadoop workflow •  Jobcontrol •  Oozie •  Talend •  Sqoop
  • 12. 12 © 2014 RMS Software Tech (rmssoftwaretech.com) Week 6: Project and Practice Questions •  Sample Project •  Practice Questions •  Q & A