SlideShare ist ein Scribd-Unternehmen logo
1 von 24
BIG DATA TRAINING
IN AMRITSAR
E2MATRIX
Contact : +91 9041262727
HISTORY OF HADOOP
Hadoop was created by Doug Cutting, the creator
of Apache Lucene, the widely used text search
library. Hadoop has its origins in Apache Nutch, an
open source web search engine, itself a part of the
Lucene project.
 The name Hadoop is not an acronym; it’s a made-up name. The project’s
creator, Doug Cutting, explains how the name came about:
The name my kid gave a stuffed yellow elephant. Short,
relatively easy to spell and pronounce, meaningless, and not used
elsewhere: those are my naming criteria. Kids are good at generating
such. Googol is a kid’s term.
 Subprojects and “contrib” modules in Hadoop also tend to have names
that are unre-lated to their function, often with an elephant or other animal
theme (“Pig,” for example). Smaller components are given more
descriptive (and therefore more mun-dane) names. This is a good
principle, as it means you can generally work out what something does
from its name. For example, the jobtracker9 keeps track of MapReduce
jobs.
INTRODUCTION TO HADOOP
• Hadoop is an open source software framework that supports data- intensive distributed
applications. It is licensed under the Apache v2 license, and generally known as Apache
Hadoop.
• Hadoop has been developed based on a paper originally written by Google on MapReduce
system and applies concepts of functional programming; It is written in Java programming
language and is the highest-level Apache project being constructed and used by a global
community of contributors.
INTRODUCTION TO HADOOP
Big giants like Yahoo and Facebook are using Hadoop as an
integral part of their functioning – in 2008, Yahoo! Inc. established the
world’s largest Hadoop production application. Also, the Yahoo! Search
Webmap is a Hadoop application that runs on over 10,000 core Linux
clusters, generating data that is now widely used in every Yahoo! Web
search query. On the other hand, Facebook uses Apache Hadoop to
keep track of its billions of user profiles as well as all the data related to
them like their images, posts, comments, videos, etc.
HADOOP IS NOT A DATABASE:
Hadoop an efficient distributed file system and not a
database. It is designed specifically for information that comes
in many forms, such as server log files or personal productivity
documents. Anything that can be stored as a file can be placed
in a Hadoop repository.
HADOOP IS USED FOR:
 Search - Yahoo, Amazon, Zvents
 Log processing - Facebook, Yahoo
 Data Warehouse - Facebook, AOL
 Video and Image Analysis - New York Times, Eyealike
WHY HADOOP ?
Hadoop is a free, Java-based programming framework
that supports the processing of large data sets in a distributed
computing environment.
Because Hadoop is open source and can run on commodity
hardware, the initial cost savings are dramatic and continue to
grow as your organizational data grows.
It is part of the Apache project sponsored by the Apache
Software Foundation.
WHY HADOOP ?
Single Source of Truth:-
With the enterprise data warehouse approach,
organizations find their data scattered across many systems and
silos. This decentralized environment can result in slow processing
and inefficient data analysis. Hadoop makes it possible to
consolidate your data and business intelligence capabilities within
an Enterprise Data Hub. The ability to save all organizational data
at its lowest level of granularity and bring all archive data into an
Enterprise Data Hub gives business users greater and faster
access to data.
WHY HADOOP ?
WHY HADOOP ?
Faster Data Processing:-
In legacy environments, traditional ETL and batch
processes can take hours, days, or even weeks, in a world where
businesses require access to data in minutes or seconds or even
sub-seconds. Hadoop excels at high-volume batch processing.
Because of its parallel processing, Hadoop can perform batch
processes 10 times faster than on a single thread server or on the
mainframe.
WHY HADOOP ?
Get More for Less:-
The true beauty of Hadoop is its ability to cost-effectively scale to
rapidly growing data demands. With its distributed computing power,
Hadoop configures across a cluster of commodity servers, or nodes. By
augmenting its EDW environment with Hadoop, the enterprise can
decrease its cost per terabyte of storage. With cheaper storage,
organizations can keep more data that was previously too expensive to
warehouse. This allows for the capture and storage of data from any
source within the organization while decreasing the amount of data that
is “thrown away” during data cleansing.
HADOOP INTERNAL SOFTWARE ARCHITECTURE
COMPONENTS OF HADOOP
The current Apache Hadoop ecosystem consists of
the Hadoop kernel, MapReduce, the Hadoop distributed file
system (HDFS) and a number of related projects such as
Apache Hive, HBase and Zookeeper. MapReduce and
Hadoop distributed file system (HDFS) are the main
component of Hadoop.
MapReduce:
The framework that understands and assigns work to
the nodes in a cluster
COMPONENTS OF HADOOP
Hadoop distributed file system (HDFS):
HDFS is the file system that spans all the nodes in a Hadoop
cluster for data storage. It links together the file systems on many
local nodes to make them into one big file system. HDFS assumes
nodes will fail, so it achieves reliability by replicating data across
multiple nodes.
HADOOP ECOSYSTEM
ADVANTAGE OF HADOOP
 Hadoop is Scalable
 Hadoop is Cost effective
 Hadoop is Flexible
 Hadoop is Fault tolerant
PREREQUISITE TO LEARN HADOOP ?
There is no strict prerequisite to start learning
Hadoop.
However, if you want to become an expert in
Hadoop and make an excellent career, you should
have at least basic knowledge of Java and Linux
IS JAVAREQUIRED TO LEARN HADOOP?
Knowing Java is an added advantage, but Java is not
strictly a prerequisite for working with Hadoop.
Why Java is not strictly a prerequisite:
Tools like Hive and Pig that are built on top of Hadoop offer
their own high-level languages for working with data on your cluster.If
you want to write your own MapReduce code, you can do so in any
language (e.g. Perl, Python, Ruby, C, etc.) that supports reading from
standard input and writing to standard output with Hadoop Streaming
IS JAVAREQUIRED TO LEARN HADOOP?
Added advantage of Java in Hadoop:
Although you can use Streaming to write your map
and reduce functions in the language of your choice, there
are some advanced features that are (at present) only
available via the Java API.
LINUX IS EXTRA BENEFIT WHILE LEARNING HADOOP?
Hadoop can run on Windows, it was built initially on Linux
and Linux is the preferred method for both installing and managing
Hadoop.
Having a solid understanding of getting around in a Linux
shell will also help you tremendously in digesting Hadoop,
especially with regards to many of the HDFS command line
parameters
COURSE CONTENT
Hadoop Introduction and Overview:
• What is Hadoop?
• History of Hadoop
• Building Blocks – Hadoop Eco-System
• Who is behind Hadoop?
• What Hadoop is good for and what it is not
Hadoop Distributed File System (HDFS):
• HDFS Overview and Architecture
• HDFS Installation
• Hadoop File System Shell
• File System Java API
COURSE CONTENT
Map/Reduce:
• Map/Reduce Overview and Architecture
• Installation
• Developing Map/Red Jobs
• Input and Output Formats
• Job Configuration
• Job Submission
• HDFS as a Source and Sink
• HBase as a Source and Sink
• Hadoop Streaming
E2MATRIX TRAINING AND RESEARCH INSTITUTE
Shop No. 12 Above Smartways Overseas
Sutlej Market, Near Bus Stand
Jalandhar, Punjab ( INDIA ).
Contact : +91 9041262727, 9779363902
Website : www.e2matrix.com
Email : support@e2matrix.com

Weitere ähnliche Inhalte

Was ist angesagt?

What are Hadoop Components? Hadoop Ecosystem and Architecture | Edureka
What are Hadoop Components? Hadoop Ecosystem and Architecture | EdurekaWhat are Hadoop Components? Hadoop Ecosystem and Architecture | Edureka
What are Hadoop Components? Hadoop Ecosystem and Architecture | EdurekaEdureka!
 
Hadoop in Practice (SDN Conference, Dec 2014)
Hadoop in Practice (SDN Conference, Dec 2014)Hadoop in Practice (SDN Conference, Dec 2014)
Hadoop in Practice (SDN Conference, Dec 2014)Marcel Krcah
 
The Hadoop Ecosystem
The Hadoop EcosystemThe Hadoop Ecosystem
The Hadoop EcosystemJ Singh
 
Hadoop distributions - ecosystem
Hadoop distributions - ecosystemHadoop distributions - ecosystem
Hadoop distributions - ecosystemJakub Stransky
 
20131205 hadoop-hdfs-map reduce-introduction
20131205 hadoop-hdfs-map reduce-introduction20131205 hadoop-hdfs-map reduce-introduction
20131205 hadoop-hdfs-map reduce-introductionXuan-Chao Huang
 
Apache Hadoop 1.1
Apache Hadoop 1.1Apache Hadoop 1.1
Apache Hadoop 1.1Sperasoft
 
Hadoop installation, Configuration, and Mapreduce program
Hadoop installation, Configuration, and Mapreduce programHadoop installation, Configuration, and Mapreduce program
Hadoop installation, Configuration, and Mapreduce programPraveen Kumar Donta
 
Hadoop hive presentation
Hadoop hive presentationHadoop hive presentation
Hadoop hive presentationArvind Kumar
 
Hadoop 101
Hadoop 101Hadoop 101
Hadoop 101EMC
 
Overview of Big data, Hadoop and Microsoft BI - version1
Overview of Big data, Hadoop and Microsoft BI - version1Overview of Big data, Hadoop and Microsoft BI - version1
Overview of Big data, Hadoop and Microsoft BI - version1Thanh Nguyen
 
Facebooks Petabyte Scale Data Warehouse using Hive and Hadoop
Facebooks Petabyte Scale Data Warehouse using Hive and HadoopFacebooks Petabyte Scale Data Warehouse using Hive and Hadoop
Facebooks Petabyte Scale Data Warehouse using Hive and Hadooproyans
 
Hadoop And Their Ecosystem
 Hadoop And Their Ecosystem Hadoop And Their Ecosystem
Hadoop And Their Ecosystemsunera pathan
 
Seminar Presentation Hadoop
Seminar Presentation HadoopSeminar Presentation Hadoop
Seminar Presentation HadoopVarun Narang
 

Was ist angesagt? (20)

Hadoop
HadoopHadoop
Hadoop
 
What are Hadoop Components? Hadoop Ecosystem and Architecture | Edureka
What are Hadoop Components? Hadoop Ecosystem and Architecture | EdurekaWhat are Hadoop Components? Hadoop Ecosystem and Architecture | Edureka
What are Hadoop Components? Hadoop Ecosystem and Architecture | Edureka
 
Hadoop in Practice (SDN Conference, Dec 2014)
Hadoop in Practice (SDN Conference, Dec 2014)Hadoop in Practice (SDN Conference, Dec 2014)
Hadoop in Practice (SDN Conference, Dec 2014)
 
The Hadoop Ecosystem
The Hadoop EcosystemThe Hadoop Ecosystem
The Hadoop Ecosystem
 
Hadoop distributions - ecosystem
Hadoop distributions - ecosystemHadoop distributions - ecosystem
Hadoop distributions - ecosystem
 
20131205 hadoop-hdfs-map reduce-introduction
20131205 hadoop-hdfs-map reduce-introduction20131205 hadoop-hdfs-map reduce-introduction
20131205 hadoop-hdfs-map reduce-introduction
 
Hadoop Technologies
Hadoop TechnologiesHadoop Technologies
Hadoop Technologies
 
Apache Hadoop 1.1
Apache Hadoop 1.1Apache Hadoop 1.1
Apache Hadoop 1.1
 
Hadoop installation, Configuration, and Mapreduce program
Hadoop installation, Configuration, and Mapreduce programHadoop installation, Configuration, and Mapreduce program
Hadoop installation, Configuration, and Mapreduce program
 
Hadoop hive presentation
Hadoop hive presentationHadoop hive presentation
Hadoop hive presentation
 
Hadoop 101
Hadoop 101Hadoop 101
Hadoop 101
 
Overview of Big data, Hadoop and Microsoft BI - version1
Overview of Big data, Hadoop and Microsoft BI - version1Overview of Big data, Hadoop and Microsoft BI - version1
Overview of Big data, Hadoop and Microsoft BI - version1
 
Facebooks Petabyte Scale Data Warehouse using Hive and Hadoop
Facebooks Petabyte Scale Data Warehouse using Hive and HadoopFacebooks Petabyte Scale Data Warehouse using Hive and Hadoop
Facebooks Petabyte Scale Data Warehouse using Hive and Hadoop
 
Hadoop
HadoopHadoop
Hadoop
 
Hadoop ecosystem
Hadoop ecosystemHadoop ecosystem
Hadoop ecosystem
 
HDFS
HDFSHDFS
HDFS
 
Hadoop And Their Ecosystem
 Hadoop And Their Ecosystem Hadoop And Their Ecosystem
Hadoop And Their Ecosystem
 
Hadoop - Introduction to Hadoop
Hadoop - Introduction to HadoopHadoop - Introduction to Hadoop
Hadoop - Introduction to Hadoop
 
Seminar Presentation Hadoop
Seminar Presentation HadoopSeminar Presentation Hadoop
Seminar Presentation Hadoop
 
Hadoop overview
Hadoop overviewHadoop overview
Hadoop overview
 

Ähnlich wie Big Data Training in Amritsar

Hadoop online training
Hadoop online training Hadoop online training
Hadoop online training Keylabs
 
Overview of big data & hadoop version 1 - Tony Nguyen
Overview of big data & hadoop   version 1 - Tony NguyenOverview of big data & hadoop   version 1 - Tony Nguyen
Overview of big data & hadoop version 1 - Tony NguyenThanh Nguyen
 
Apache hadoop introduction and architecture
Apache hadoop  introduction and architectureApache hadoop  introduction and architecture
Apache hadoop introduction and architectureHarikrishnan K
 
Introduction to Apache hadoop
Introduction to Apache hadoopIntroduction to Apache hadoop
Introduction to Apache hadoopOmar Jaber
 
What does the future of Big data look like?How to get a fresher job in data a...
What does the future of Big data look like?How to get a fresher job in data a...What does the future of Big data look like?How to get a fresher job in data a...
What does the future of Big data look like?How to get a fresher job in data a...Acutesoft Solutions India Pvt Ltd
 
Hadoop tutorial-pdf.pdf
Hadoop tutorial-pdf.pdfHadoop tutorial-pdf.pdf
Hadoop tutorial-pdf.pdfSheetal Jain
 
Hadoop essentials by shiva achari - sample chapter
Hadoop essentials by shiva achari - sample chapterHadoop essentials by shiva achari - sample chapter
Hadoop essentials by shiva achari - sample chapterShiva Achari
 
Hadoop a Natural Choice for Data Intensive Log Processing
Hadoop a Natural Choice for Data Intensive Log ProcessingHadoop a Natural Choice for Data Intensive Log Processing
Hadoop a Natural Choice for Data Intensive Log ProcessingHitendra Kumar
 

Ähnlich wie Big Data Training in Amritsar (20)

Hadoop online training
Hadoop online training Hadoop online training
Hadoop online training
 
Hadoop Tutorial for Beginners
Hadoop Tutorial for BeginnersHadoop Tutorial for Beginners
Hadoop Tutorial for Beginners
 
Bigdata ppt
Bigdata pptBigdata ppt
Bigdata ppt
 
Bigdata
BigdataBigdata
Bigdata
 
Overview of big data & hadoop version 1 - Tony Nguyen
Overview of big data & hadoop   version 1 - Tony NguyenOverview of big data & hadoop   version 1 - Tony Nguyen
Overview of big data & hadoop version 1 - Tony Nguyen
 
Apache hadoop introduction and architecture
Apache hadoop  introduction and architectureApache hadoop  introduction and architecture
Apache hadoop introduction and architecture
 
Hadoop .pdf
Hadoop .pdfHadoop .pdf
Hadoop .pdf
 
Introduction to Apache hadoop
Introduction to Apache hadoopIntroduction to Apache hadoop
Introduction to Apache hadoop
 
Lecture 2 Hadoop.pptx
Lecture 2 Hadoop.pptxLecture 2 Hadoop.pptx
Lecture 2 Hadoop.pptx
 
What does the future of Big data look like?How to get a fresher job in data a...
What does the future of Big data look like?How to get a fresher job in data a...What does the future of Big data look like?How to get a fresher job in data a...
What does the future of Big data look like?How to get a fresher job in data a...
 
HDFS
HDFSHDFS
HDFS
 
Hadoop in action
Hadoop in actionHadoop in action
Hadoop in action
 
Hadoop tutorial-pdf.pdf
Hadoop tutorial-pdf.pdfHadoop tutorial-pdf.pdf
Hadoop tutorial-pdf.pdf
 
Hadoop
HadoopHadoop
Hadoop
 
Hadoop content
Hadoop contentHadoop content
Hadoop content
 
Hadoop essentials by shiva achari - sample chapter
Hadoop essentials by shiva achari - sample chapterHadoop essentials by shiva achari - sample chapter
Hadoop essentials by shiva achari - sample chapter
 
Intro to Hadoop
Intro to HadoopIntro to Hadoop
Intro to Hadoop
 
Hadoop a Natural Choice for Data Intensive Log Processing
Hadoop a Natural Choice for Data Intensive Log ProcessingHadoop a Natural Choice for Data Intensive Log Processing
Hadoop a Natural Choice for Data Intensive Log Processing
 
Hadoop
HadoopHadoop
Hadoop
 
Hadoop hdfs
Hadoop hdfsHadoop hdfs
Hadoop hdfs
 

Mehr von E2MATRIX

Electrical Training in Phagwara
Electrical Training in PhagwaraElectrical Training in Phagwara
Electrical Training in PhagwaraE2MATRIX
 
Electrical Training in Mohali
Electrical Training in MohaliElectrical Training in Mohali
Electrical Training in MohaliE2MATRIX
 
Electrical Training in Ludhiana
Electrical Training in LudhianaElectrical Training in Ludhiana
Electrical Training in LudhianaE2MATRIX
 
Electrical Training in Jalandhar
Electrical Training in JalandharElectrical Training in Jalandhar
Electrical Training in JalandharE2MATRIX
 
Electrical Training in Chandigarh
Electrical Training in ChandigarhElectrical Training in Chandigarh
Electrical Training in ChandigarhE2MATRIX
 
Electrical Training in Amritsar
Electrical Training in AmritsarElectrical Training in Amritsar
Electrical Training in AmritsarE2MATRIX
 
Machine Learning Training in Phagwara
Machine Learning Training in PhagwaraMachine Learning Training in Phagwara
Machine Learning Training in PhagwaraE2MATRIX
 
Machine Learning Training in Ludhiana
Machine Learning Training in LudhianaMachine Learning Training in Ludhiana
Machine Learning Training in LudhianaE2MATRIX
 
Machine Learning Training in Amritsar
Machine Learning Training in AmritsarMachine Learning Training in Amritsar
Machine Learning Training in AmritsarE2MATRIX
 
Machine Learning Training in Mohali
Machine Learning Training in MohaliMachine Learning Training in Mohali
Machine Learning Training in MohaliE2MATRIX
 
Machine Learning Training in Jalandhar
Machine Learning Training in JalandharMachine Learning Training in Jalandhar
Machine Learning Training in JalandharE2MATRIX
 
Machine Learning Training in Chandigarh
Machine Learning Training in ChandigarhMachine Learning Training in Chandigarh
Machine Learning Training in ChandigarhE2MATRIX
 
Raspberry Pi training in Ludhiana
Raspberry Pi training in LudhianaRaspberry Pi training in Ludhiana
Raspberry Pi training in LudhianaE2MATRIX
 
Raspberry Pi Training in Phagwara
Raspberry Pi Training in PhagwaraRaspberry Pi Training in Phagwara
Raspberry Pi Training in PhagwaraE2MATRIX
 
Raspberry Pi Training in Mohali
Raspberry Pi Training in MohaliRaspberry Pi Training in Mohali
Raspberry Pi Training in MohaliE2MATRIX
 
Raspberry Pi Training in Chandigarh
Raspberry Pi Training in ChandigarhRaspberry Pi Training in Chandigarh
Raspberry Pi Training in ChandigarhE2MATRIX
 
Raspberry Pi Training in Amritsar
Raspberry Pi Training in AmritsarRaspberry Pi Training in Amritsar
Raspberry Pi Training in AmritsarE2MATRIX
 
Raspberry Pi Training in Jalandhar
Raspberry Pi Training in JalandharRaspberry Pi Training in Jalandhar
Raspberry Pi Training in JalandharE2MATRIX
 
Selenium Training in Phagwara
Selenium Training in PhagwaraSelenium Training in Phagwara
Selenium Training in PhagwaraE2MATRIX
 
Selenium Training in Mohali
Selenium Training in MohaliSelenium Training in Mohali
Selenium Training in MohaliE2MATRIX
 

Mehr von E2MATRIX (20)

Electrical Training in Phagwara
Electrical Training in PhagwaraElectrical Training in Phagwara
Electrical Training in Phagwara
 
Electrical Training in Mohali
Electrical Training in MohaliElectrical Training in Mohali
Electrical Training in Mohali
 
Electrical Training in Ludhiana
Electrical Training in LudhianaElectrical Training in Ludhiana
Electrical Training in Ludhiana
 
Electrical Training in Jalandhar
Electrical Training in JalandharElectrical Training in Jalandhar
Electrical Training in Jalandhar
 
Electrical Training in Chandigarh
Electrical Training in ChandigarhElectrical Training in Chandigarh
Electrical Training in Chandigarh
 
Electrical Training in Amritsar
Electrical Training in AmritsarElectrical Training in Amritsar
Electrical Training in Amritsar
 
Machine Learning Training in Phagwara
Machine Learning Training in PhagwaraMachine Learning Training in Phagwara
Machine Learning Training in Phagwara
 
Machine Learning Training in Ludhiana
Machine Learning Training in LudhianaMachine Learning Training in Ludhiana
Machine Learning Training in Ludhiana
 
Machine Learning Training in Amritsar
Machine Learning Training in AmritsarMachine Learning Training in Amritsar
Machine Learning Training in Amritsar
 
Machine Learning Training in Mohali
Machine Learning Training in MohaliMachine Learning Training in Mohali
Machine Learning Training in Mohali
 
Machine Learning Training in Jalandhar
Machine Learning Training in JalandharMachine Learning Training in Jalandhar
Machine Learning Training in Jalandhar
 
Machine Learning Training in Chandigarh
Machine Learning Training in ChandigarhMachine Learning Training in Chandigarh
Machine Learning Training in Chandigarh
 
Raspberry Pi training in Ludhiana
Raspberry Pi training in LudhianaRaspberry Pi training in Ludhiana
Raspberry Pi training in Ludhiana
 
Raspberry Pi Training in Phagwara
Raspberry Pi Training in PhagwaraRaspberry Pi Training in Phagwara
Raspberry Pi Training in Phagwara
 
Raspberry Pi Training in Mohali
Raspberry Pi Training in MohaliRaspberry Pi Training in Mohali
Raspberry Pi Training in Mohali
 
Raspberry Pi Training in Chandigarh
Raspberry Pi Training in ChandigarhRaspberry Pi Training in Chandigarh
Raspberry Pi Training in Chandigarh
 
Raspberry Pi Training in Amritsar
Raspberry Pi Training in AmritsarRaspberry Pi Training in Amritsar
Raspberry Pi Training in Amritsar
 
Raspberry Pi Training in Jalandhar
Raspberry Pi Training in JalandharRaspberry Pi Training in Jalandhar
Raspberry Pi Training in Jalandhar
 
Selenium Training in Phagwara
Selenium Training in PhagwaraSelenium Training in Phagwara
Selenium Training in Phagwara
 
Selenium Training in Mohali
Selenium Training in MohaliSelenium Training in Mohali
Selenium Training in Mohali
 

Kürzlich hochgeladen

How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSCeline George
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfDr Vijay Vishwakarma
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfNirmal Dwivedi
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxDr. Sarita Anand
 
How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17Celine George
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...pradhanghanshyam7136
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsKarakKing
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfSherif Taha
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...Nguyen Thanh Tu Collection
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxEsquimalt MFRC
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxPooja Bhuva
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Pooja Bhuva
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structuredhanjurrannsibayan2
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jisc
 

Kürzlich hochgeladen (20)

How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 

Big Data Training in Amritsar

  • 1. BIG DATA TRAINING IN AMRITSAR E2MATRIX Contact : +91 9041262727
  • 2. HISTORY OF HADOOP Hadoop was created by Doug Cutting, the creator of Apache Lucene, the widely used text search library. Hadoop has its origins in Apache Nutch, an open source web search engine, itself a part of the Lucene project.
  • 3.  The name Hadoop is not an acronym; it’s a made-up name. The project’s creator, Doug Cutting, explains how the name came about: The name my kid gave a stuffed yellow elephant. Short, relatively easy to spell and pronounce, meaningless, and not used elsewhere: those are my naming criteria. Kids are good at generating such. Googol is a kid’s term.  Subprojects and “contrib” modules in Hadoop also tend to have names that are unre-lated to their function, often with an elephant or other animal theme (“Pig,” for example). Smaller components are given more descriptive (and therefore more mun-dane) names. This is a good principle, as it means you can generally work out what something does from its name. For example, the jobtracker9 keeps track of MapReduce jobs.
  • 4. INTRODUCTION TO HADOOP • Hadoop is an open source software framework that supports data- intensive distributed applications. It is licensed under the Apache v2 license, and generally known as Apache Hadoop. • Hadoop has been developed based on a paper originally written by Google on MapReduce system and applies concepts of functional programming; It is written in Java programming language and is the highest-level Apache project being constructed and used by a global community of contributors.
  • 5. INTRODUCTION TO HADOOP Big giants like Yahoo and Facebook are using Hadoop as an integral part of their functioning – in 2008, Yahoo! Inc. established the world’s largest Hadoop production application. Also, the Yahoo! Search Webmap is a Hadoop application that runs on over 10,000 core Linux clusters, generating data that is now widely used in every Yahoo! Web search query. On the other hand, Facebook uses Apache Hadoop to keep track of its billions of user profiles as well as all the data related to them like their images, posts, comments, videos, etc.
  • 6. HADOOP IS NOT A DATABASE: Hadoop an efficient distributed file system and not a database. It is designed specifically for information that comes in many forms, such as server log files or personal productivity documents. Anything that can be stored as a file can be placed in a Hadoop repository.
  • 7. HADOOP IS USED FOR:  Search - Yahoo, Amazon, Zvents  Log processing - Facebook, Yahoo  Data Warehouse - Facebook, AOL  Video and Image Analysis - New York Times, Eyealike
  • 8. WHY HADOOP ? Hadoop is a free, Java-based programming framework that supports the processing of large data sets in a distributed computing environment. Because Hadoop is open source and can run on commodity hardware, the initial cost savings are dramatic and continue to grow as your organizational data grows. It is part of the Apache project sponsored by the Apache Software Foundation.
  • 9. WHY HADOOP ? Single Source of Truth:- With the enterprise data warehouse approach, organizations find their data scattered across many systems and silos. This decentralized environment can result in slow processing and inefficient data analysis. Hadoop makes it possible to consolidate your data and business intelligence capabilities within an Enterprise Data Hub. The ability to save all organizational data at its lowest level of granularity and bring all archive data into an Enterprise Data Hub gives business users greater and faster access to data.
  • 11. WHY HADOOP ? Faster Data Processing:- In legacy environments, traditional ETL and batch processes can take hours, days, or even weeks, in a world where businesses require access to data in minutes or seconds or even sub-seconds. Hadoop excels at high-volume batch processing. Because of its parallel processing, Hadoop can perform batch processes 10 times faster than on a single thread server or on the mainframe.
  • 12. WHY HADOOP ? Get More for Less:- The true beauty of Hadoop is its ability to cost-effectively scale to rapidly growing data demands. With its distributed computing power, Hadoop configures across a cluster of commodity servers, or nodes. By augmenting its EDW environment with Hadoop, the enterprise can decrease its cost per terabyte of storage. With cheaper storage, organizations can keep more data that was previously too expensive to warehouse. This allows for the capture and storage of data from any source within the organization while decreasing the amount of data that is “thrown away” during data cleansing.
  • 13. HADOOP INTERNAL SOFTWARE ARCHITECTURE
  • 14. COMPONENTS OF HADOOP The current Apache Hadoop ecosystem consists of the Hadoop kernel, MapReduce, the Hadoop distributed file system (HDFS) and a number of related projects such as Apache Hive, HBase and Zookeeper. MapReduce and Hadoop distributed file system (HDFS) are the main component of Hadoop. MapReduce: The framework that understands and assigns work to the nodes in a cluster
  • 15. COMPONENTS OF HADOOP Hadoop distributed file system (HDFS): HDFS is the file system that spans all the nodes in a Hadoop cluster for data storage. It links together the file systems on many local nodes to make them into one big file system. HDFS assumes nodes will fail, so it achieves reliability by replicating data across multiple nodes.
  • 17. ADVANTAGE OF HADOOP  Hadoop is Scalable  Hadoop is Cost effective  Hadoop is Flexible  Hadoop is Fault tolerant
  • 18. PREREQUISITE TO LEARN HADOOP ? There is no strict prerequisite to start learning Hadoop. However, if you want to become an expert in Hadoop and make an excellent career, you should have at least basic knowledge of Java and Linux
  • 19. IS JAVAREQUIRED TO LEARN HADOOP? Knowing Java is an added advantage, but Java is not strictly a prerequisite for working with Hadoop. Why Java is not strictly a prerequisite: Tools like Hive and Pig that are built on top of Hadoop offer their own high-level languages for working with data on your cluster.If you want to write your own MapReduce code, you can do so in any language (e.g. Perl, Python, Ruby, C, etc.) that supports reading from standard input and writing to standard output with Hadoop Streaming
  • 20. IS JAVAREQUIRED TO LEARN HADOOP? Added advantage of Java in Hadoop: Although you can use Streaming to write your map and reduce functions in the language of your choice, there are some advanced features that are (at present) only available via the Java API.
  • 21. LINUX IS EXTRA BENEFIT WHILE LEARNING HADOOP? Hadoop can run on Windows, it was built initially on Linux and Linux is the preferred method for both installing and managing Hadoop. Having a solid understanding of getting around in a Linux shell will also help you tremendously in digesting Hadoop, especially with regards to many of the HDFS command line parameters
  • 22. COURSE CONTENT Hadoop Introduction and Overview: • What is Hadoop? • History of Hadoop • Building Blocks – Hadoop Eco-System • Who is behind Hadoop? • What Hadoop is good for and what it is not Hadoop Distributed File System (HDFS): • HDFS Overview and Architecture • HDFS Installation • Hadoop File System Shell • File System Java API
  • 23. COURSE CONTENT Map/Reduce: • Map/Reduce Overview and Architecture • Installation • Developing Map/Red Jobs • Input and Output Formats • Job Configuration • Job Submission • HDFS as a Source and Sink • HBase as a Source and Sink • Hadoop Streaming
  • 24. E2MATRIX TRAINING AND RESEARCH INSTITUTE Shop No. 12 Above Smartways Overseas Sutlej Market, Near Bus Stand Jalandhar, Punjab ( INDIA ). Contact : +91 9041262727, 9779363902 Website : www.e2matrix.com Email : support@e2matrix.com