SlideShare ist ein Scribd-Unternehmen logo
1 von 3
• Need for Hadoop
o Introduction to Big Data
o Problem with existing traditional system
o Requirements for new approach
o Comparing SQL databases and NOSQL(Hadoop)
• Hadoop Basic Concepts
o An Overview of Hadoop
o Configuring a Hadoop in Ubuntu OS
o First example in Hadoop
• MapReduce
o What is MapReduce?
o Data flow in MapReduce
o Map operation
o Reduce operation
o Real-world "MapReduce" problems
o Execution strategies for MapReduce
• The Hadoop Distributed Filesystem
o Namenodes
o Datanodes
o The Command-Line Interface
o Reading and writing data using Java
o Hadoop Archives
• Delving Deeper Into the Hadoop API
o Using Combiners
o Reducing Intermediate Data with Combiners
o Writing Partitioners for Better Load Balancing
o Directly Accessing HDFS
o Hands-On Exercise
• Common MapReduce Algorithms
o Sorting
o Searching
o Indexing
• Hadoop Optimizations
• Hadoop Best Practices
• Introduction to HBase
o What is HBase?
o HBase Architecture
o HBase API
o Managing large data sets with HBase
o Using HBase in Hadoop applications
• Introduction to Zookeeper
• Summary
o Sample Applications
o References
Big data hadoop online training institute

Weitere ähnliche Inhalte

Mehr von Mindmajix Technologies

Mehr von Mindmajix Technologies (7)

Best Oracle hrms online training
Best Oracle hrms online trainingBest Oracle hrms online training
Best Oracle hrms online training
 
Best Oracle adf online training
Best Oracle adf online trainingBest Oracle adf online training
Best Oracle adf online training
 
Best Qlik view online training institute
Best Qlik view online training instituteBest Qlik view online training institute
Best Qlik view online training institute
 
Tibco business events (be) online training institute
Tibco business events (be) online training instituteTibco business events (be) online training institute
Tibco business events (be) online training institute
 
Best tibco activematrix soa online training
Best tibco activematrix soa online trainingBest tibco activematrix soa online training
Best tibco activematrix soa online training
 
Sales force development course content
Sales force development course contentSales force development course content
Sales force development course content
 
Introduction to integration
Introduction to integrationIntroduction to integration
Introduction to integration
 

Big data hadoop online training institute

  • 1. • Need for Hadoop o Introduction to Big Data o Problem with existing traditional system o Requirements for new approach o Comparing SQL databases and NOSQL(Hadoop) • Hadoop Basic Concepts o An Overview of Hadoop o Configuring a Hadoop in Ubuntu OS o First example in Hadoop • MapReduce o What is MapReduce? o Data flow in MapReduce o Map operation o Reduce operation o Real-world "MapReduce" problems o Execution strategies for MapReduce • The Hadoop Distributed Filesystem o Namenodes o Datanodes o The Command-Line Interface o Reading and writing data using Java o Hadoop Archives
  • 2. • Delving Deeper Into the Hadoop API o Using Combiners o Reducing Intermediate Data with Combiners o Writing Partitioners for Better Load Balancing o Directly Accessing HDFS o Hands-On Exercise • Common MapReduce Algorithms o Sorting o Searching o Indexing • Hadoop Optimizations • Hadoop Best Practices • Introduction to HBase o What is HBase? o HBase Architecture o HBase API o Managing large data sets with HBase o Using HBase in Hadoop applications • Introduction to Zookeeper • Summary o Sample Applications o References