Workshop1

•Als PPT, PDF herunterladen•

0 gefällt mir•36 views

This document provides an overview of a course on data science and big data analytics. The course aims to build a fundamental understanding of big data problems and Hadoop as a solution. It covers topics like understanding big data and Hadoop, the Hadoop architecture and components, using Pig, Hive, Hbase, and Oozie with Hadoop, and performing data manipulation and machine learning techniques using R. The course structure includes VM installation, lectures on various Hadoop topics, and hands-on practice with Hadoop and R. Assessments include quizzes, a case study, and installing Hadoop clusters. The target audience are students who have completed courses in C, Java, data structures, operating systems, and computer networks

Ingenieurwesen

Data Science and Big Data Analytics
N Chandra Shekar
Assistant Professor
Department of CSE
RGUKT RK Valley
1NChandu, CSE, RKV
Big Data by India is licensed under a Creative Commons Attribution 4.0 International License.

NChandu, CSE, RKV 2
Course Description –
This course builds a essential fundamental understanding of Big Data problems and
Hadoop as a solution. This course takes you through:
1.Understanding of Big Data problems with easy to understand examples.
2.History and advent of Hadoop right from when Hadoop wasn’t even named Hadoop.
3.What is Hadoop Magic which makes it so unique and powerful.
4.Understanding the difference between Data science and data engineering, which is one
of the big confusions in selecting a carrier or understanding a job role.
5.And most importantly, demystifying Hadoop vendors like Cloudera, MapR and
Hortonworks by understanding about them.

NChandu, CSE, RKV 3
Learning Outcomes–
* Describe the Big Data landscape including examples of real world big data problems
including the three key sources of Big Data: people, organizations, and sensors.
* Explain the V’s of Big Data (volume, velocity, variety, veracity, valence, and value)
and why each impacts data collection, monitoring, storage, analysis and reporting.
* Get value out of Big Data by using a 5-step process to structure your analysis.
* Identify what are and what are not big data problems and be able to recast big data
problems as data science questions.
* Provide an explanation of the architectural components and programming models used
for scalable big data analysis.
* Summarize the features and value of core Hadoop stack components including the
YARN resource and job management system, the HDFS file system and the Map Reduce
programming model.
* Install and run a program using Hadoop!

NChandu, CSE, RKV 4
Course Structure –
1.VM Installation
2.Understanding Big Data and Hadoop
3.Hadoop Architecture and HDFS
4.Hadoop MapReduce Framework
5.Pig, Hive, Hbase, Oozie
6.Data Manipulation Using R
7.Machine Learning Techniques using R

NChandu, CSE, RKV 5
Delivery Format –
Will be combination of blended and Online.

NChandu, CSE, RKV 6
Learning Activities–
1. A discussion forum will be created in the Moodle course page, where students can post
there doubts, which can be clarified by teacher / any other fellow student.
2.Students will participate in a activity where they will be evaluation the works of fellow
students, which might include evaluating quizzes etc.

NChandu, CSE, RKV 7
Assessement–
1.Quiz 1 – Introduction to Big Data
2.Case Study – Cloudera Cluster
3.Installation of Single Node and Multi Node cluster
4.Installation of R Studio and R in Ubuntu.

NChandu, CSE, RKV 8
Expected Participation –
1.Ideally students from E3 and E4 who have completed, courses such as C, Java, DS, OS
and CN will be preferable to enroll into the course.
2.Students from E1 and E2 and Enroll into the course only for learning “ R Programming
Language”.

Weitere ähnliche Inhalte

Ähnlich wie Workshop1

Hadoop essentials by shiva achari - sample chapterShiva Achari

Big Data and Hadoop Training in Bangalore by myTectramyTectra Learning Solutions Private Ltd

Big data analytics_using_hadoopKnowledgehut

17CS008.pdfSiva453615

BDAModule-1.pptxbharathmadival8055

Big data processing using - Hadoop TechnologyShital Kat

Hadoop framework thesis (3)JonySaini2

HareeshHareesh Ravulapati

Hadoop_Admin_eVenkatVenkat Krishnan

Hadoop Based Data DiscoveryBenjamin Ashkar

Hadoop - Architectural road map for Hadoop Ecosystemnallagangus

Hadoop Administration Certification Training in BangaloremyTectra Learning Solutions Private Ltd

Hadoop ReportNishant Gandhi

IJSRED-V2I3P43IJSRED

2. Develop a MapReduce program to calculate the frequency of a given word in ...Prof. Maulik Trivedi

Tlep rdbms iigursharan786

Big DataSridhar Mamella

Hadoop training kit from lcc infotechlccinfotech

hadoop expVenkata Ramakumar Maturu

Social Media Market Trender with Dache Manager Using Hadoop and Visualization...IRJET Journal

Ähnlich wie Workshop1 (20)

Hadoop essentials by shiva achari - sample chapter

Big Data and Hadoop Training in Bangalore by myTectra

Big data analytics_using_hadoop

17CS008.pdf

BDAModule-1.pptx

Big data processing using - Hadoop Technology

Hadoop framework thesis (3)

Hareesh

Hadoop_Admin_eVenkat

Hadoop Based Data Discovery

Hadoop - Architectural road map for Hadoop Ecosystem

Hadoop Administration Certification Training in Bangalore

Hadoop Report

IJSRED-V2I3P43

2. Develop a MapReduce program to calculate the frequency of a given word in ...

Tlep rdbms ii

Big Data

Hadoop training kit from lcc infotech

hadoop exp

Social Media Market Trender with Dache Manager Using Hadoop and Visualization...

Kürzlich hochgeladen

Moment Distribution Method For Btech CivilVinayVitekari

Computer Networks Basics of Network DevicesChandrakantDivate1

PE 459 LECTURE 2- natural gas basic concepts and propertiessarkmank1

Hospital management system project report.pdfKamal Acharya

FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsArindam Chakraborty, Ph.D., P.E. (CA, TX)

Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptxMuhammadAsimMuhammad6

Engineering Drawing focus on projection of planesRAJNEESHKUMAR341697

COST-EFFETIVE and Energy Efficient BUILDINGS ptxJIT KUMAR GUPTA

HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxSCMS School of Architecture

Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X79953056974 Low Rate Call Girls In Saket, Delhi NCR

Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Servicemeghakumariji156

Thermal Engineering-R & A / C - unit - VDineshKumar4165

Verification of thevenin's theorem for BEEE Lab (1).pptxchumtiyababu

Work-Permit-Receiver-in-Saudi-Aramco.pptxJuliansyahHarahap1

NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...Amil baba

HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARKOUSTAV SARKAR

Block diagram reduction techniques in control systems.pptNANDHAKUMARA10

"Lesotho Leaps Forward: A Chronicle of Transformative Developments"mphochane1998

School management system project Report.pdfKamal Acharya

Design For Accessibility: Getting it right from the startQuintin Balsdon

Kürzlich hochgeladen (20)

Moment Distribution Method For Btech Civil

Computer Networks Basics of Network Devices

PE 459 LECTURE 2- natural gas basic concepts and properties

Hospital management system project report.pdf

FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads

Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptx

Engineering Drawing focus on projection of planes

COST-EFFETIVE and Energy Efficient BUILDINGS ptx

HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx

Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7

Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service

Thermal Engineering-R & A / C - unit - V

Verification of thevenin's theorem for BEEE Lab (1).pptx

Work-Permit-Receiver-in-Saudi-Aramco.pptx

NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...

HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR

Block diagram reduction techniques in control systems.ppt

"Lesotho Leaps Forward: A Chronicle of Transformative Developments"

School management system project Report.pdf

Design For Accessibility: Getting it right from the start

Workshop1

1. Data Science and Big Data Analytics N Chandra Shekar Assistant Professor Department of CSE RGUKT RK Valley 1NChandu, CSE, RKV Big Data by India is licensed under a Creative Commons Attribution 4.0 International License.

2. NChandu, CSE, RKV 2 Course Description – This course builds a essential fundamental understanding of Big Data problems and Hadoop as a solution. This course takes you through: 1.Understanding of Big Data problems with easy to understand examples. 2.History and advent of Hadoop right from when Hadoop wasn’t even named Hadoop. 3.What is Hadoop Magic which makes it so unique and powerful. 4.Understanding the difference between Data science and data engineering, which is one of the big confusions in selecting a carrier or understanding a job role. 5.And most importantly, demystifying Hadoop vendors like Cloudera, MapR and Hortonworks by understanding about them.

3. NChandu, CSE, RKV 3 Learning Outcomes– * Describe the Big Data landscape including examples of real world big data problems including the three key sources of Big Data: people, organizations, and sensors. * Explain the V’s of Big Data (volume, velocity, variety, veracity, valence, and value) and why each impacts data collection, monitoring, storage, analysis and reporting. * Get value out of Big Data by using a 5-step process to structure your analysis. * Identify what are and what are not big data problems and be able to recast big data problems as data science questions. * Provide an explanation of the architectural components and programming models used for scalable big data analysis. * Summarize the features and value of core Hadoop stack components including the YARN resource and job management system, the HDFS file system and the Map Reduce programming model. * Install and run a program using Hadoop!

4. NChandu, CSE, RKV 4 Course Structure – 1.VM Installation 2.Understanding Big Data and Hadoop 3.Hadoop Architecture and HDFS 4.Hadoop MapReduce Framework 5.Pig, Hive, Hbase, Oozie 6.Data Manipulation Using R 7.Machine Learning Techniques using R

5. NChandu, CSE, RKV 5 Delivery Format – Will be combination of blended and Online.

6. NChandu, CSE, RKV 6 Learning Activities– 1. A discussion forum will be created in the Moodle course page, where students can post there doubts, which can be clarified by teacher / any other fellow student. 2.Students will participate in a activity where they will be evaluation the works of fellow students, which might include evaluating quizzes etc.

7. NChandu, CSE, RKV 7 Assessement– 1.Quiz 1 – Introduction to Big Data 2.Case Study – Cloudera Cluster 3.Installation of Single Node and Multi Node cluster 4.Installation of R Studio and R in Ubuntu.

8. NChandu, CSE, RKV 8 Expected Participation – 1.Ideally students from E3 and E4 who have completed, courses such as C, Java, DS, OS and CN will be preferable to enroll into the course. 2.Students from E1 and E2 and Enroll into the course only for learning “ R Programming Language”.

Workshop1

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Ähnlich wie Workshop1

Ähnlich wie Workshop1 (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Workshop1