Suche senden
Hochladen
Map Reduce introduction
•
2 gefällt mir
•
1,533 views
M
Muralidharan Deenathayalan
Folgen
Introduction about Map Reduce program
Weniger lesen
Mehr lesen
Technologie
Melden
Teilen
Melden
Teilen
1 von 19
Empfohlen
Spark at-hackthon8jan2014
Spark at-hackthon8jan2014
Madhukara Phatak
Hadoop Map Reduce
Hadoop Map Reduce
VNIT-ACM Student Chapter
Introduction to Map Reduce
Introduction to Map Reduce
Apache Apex
Map Reduce
Map Reduce
Michel Bruley
MapReduce basic
MapReduce basic
Chirag Ahuja
Mapreduce by examples
Mapreduce by examples
Andrea Iacono
Introduction to Map-Reduce
Introduction to Map-Reduce
Brendan Tierney
Map Reduce
Map Reduce
Rahul Agarwal
Empfohlen
Spark at-hackthon8jan2014
Spark at-hackthon8jan2014
Madhukara Phatak
Hadoop Map Reduce
Hadoop Map Reduce
VNIT-ACM Student Chapter
Introduction to Map Reduce
Introduction to Map Reduce
Apache Apex
Map Reduce
Map Reduce
Michel Bruley
MapReduce basic
MapReduce basic
Chirag Ahuja
Mapreduce by examples
Mapreduce by examples
Andrea Iacono
Introduction to Map-Reduce
Introduction to Map-Reduce
Brendan Tierney
Map Reduce
Map Reduce
Rahul Agarwal
Map reduce presentation
Map reduce presentation
ateeq ateeq
Introduction to MapReduce
Introduction to MapReduce
Hassan A-j
MapReduce Paradigm
MapReduce Paradigm
Dilip Reddy
Map Reduce
Map Reduce
Vigen Sahakyan
Map Reduce
Map Reduce
schapht
An Introduction to MapReduce
An Introduction to MapReduce
Frane Bandov
Map Reduce
Map Reduce
Sri Prasanna
Analysing of big data using map reduce
Analysing of big data using map reduce
Paladion Networks
Mapreduce Algorithms
Mapreduce Algorithms
Amund Tveit
Mastering Hadoop Map Reduce - Custom Types and Other Optimizations
Mastering Hadoop Map Reduce - Custom Types and Other Optimizations
scottcrespo
Large Scale Data Analysis with Map/Reduce, part I
Large Scale Data Analysis with Map/Reduce, part I
Marin Dimitrov
An Introduction To Map-Reduce
An Introduction To Map-Reduce
Francisco Pérez-Sorrosal
Map reduce paradigm explained
Map reduce paradigm explained
Dmytro Sandu
MapReduce Algorithm Design
MapReduce Algorithm Design
Gabriela Agustini
Hadoop - Introduction to map reduce programming - Reunião 12/04/2014
Hadoop - Introduction to map reduce programming - Reunião 12/04/2014
soujavajug
Introduction to MapReduce
Introduction to MapReduce
Chicago Hadoop Users Group
Hadoop/MapReduce/HDFS
Hadoop/MapReduce/HDFS
praveen bhat
Introduction To Map Reduce
Introduction To Map Reduce
rantav
Topic 6: MapReduce Applications
Topic 6: MapReduce Applications
Zubair Nabi
Hadoop MapReduce Fundamentals
Hadoop MapReduce Fundamentals
Lynn Langit
Behm Shah Pagerank
Behm Shah Pagerank
gothicane
Hadoop ecosystem
Hadoop ecosystem
Ran Silberman
Weitere ähnliche Inhalte
Was ist angesagt?
Map reduce presentation
Map reduce presentation
ateeq ateeq
Introduction to MapReduce
Introduction to MapReduce
Hassan A-j
MapReduce Paradigm
MapReduce Paradigm
Dilip Reddy
Map Reduce
Map Reduce
Vigen Sahakyan
Map Reduce
Map Reduce
schapht
An Introduction to MapReduce
An Introduction to MapReduce
Frane Bandov
Map Reduce
Map Reduce
Sri Prasanna
Analysing of big data using map reduce
Analysing of big data using map reduce
Paladion Networks
Mapreduce Algorithms
Mapreduce Algorithms
Amund Tveit
Mastering Hadoop Map Reduce - Custom Types and Other Optimizations
Mastering Hadoop Map Reduce - Custom Types and Other Optimizations
scottcrespo
Large Scale Data Analysis with Map/Reduce, part I
Large Scale Data Analysis with Map/Reduce, part I
Marin Dimitrov
An Introduction To Map-Reduce
An Introduction To Map-Reduce
Francisco Pérez-Sorrosal
Map reduce paradigm explained
Map reduce paradigm explained
Dmytro Sandu
MapReduce Algorithm Design
MapReduce Algorithm Design
Gabriela Agustini
Hadoop - Introduction to map reduce programming - Reunião 12/04/2014
Hadoop - Introduction to map reduce programming - Reunião 12/04/2014
soujavajug
Introduction to MapReduce
Introduction to MapReduce
Chicago Hadoop Users Group
Hadoop/MapReduce/HDFS
Hadoop/MapReduce/HDFS
praveen bhat
Introduction To Map Reduce
Introduction To Map Reduce
rantav
Topic 6: MapReduce Applications
Topic 6: MapReduce Applications
Zubair Nabi
Hadoop MapReduce Fundamentals
Hadoop MapReduce Fundamentals
Lynn Langit
Was ist angesagt?
(20)
Map reduce presentation
Map reduce presentation
Introduction to MapReduce
Introduction to MapReduce
MapReduce Paradigm
MapReduce Paradigm
Map Reduce
Map Reduce
Map Reduce
Map Reduce
An Introduction to MapReduce
An Introduction to MapReduce
Map Reduce
Map Reduce
Analysing of big data using map reduce
Analysing of big data using map reduce
Mapreduce Algorithms
Mapreduce Algorithms
Mastering Hadoop Map Reduce - Custom Types and Other Optimizations
Mastering Hadoop Map Reduce - Custom Types and Other Optimizations
Large Scale Data Analysis with Map/Reduce, part I
Large Scale Data Analysis with Map/Reduce, part I
An Introduction To Map-Reduce
An Introduction To Map-Reduce
Map reduce paradigm explained
Map reduce paradigm explained
MapReduce Algorithm Design
MapReduce Algorithm Design
Hadoop - Introduction to map reduce programming - Reunião 12/04/2014
Hadoop - Introduction to map reduce programming - Reunião 12/04/2014
Introduction to MapReduce
Introduction to MapReduce
Hadoop/MapReduce/HDFS
Hadoop/MapReduce/HDFS
Introduction To Map Reduce
Introduction To Map Reduce
Topic 6: MapReduce Applications
Topic 6: MapReduce Applications
Hadoop MapReduce Fundamentals
Hadoop MapReduce Fundamentals
Ähnlich wie Map Reduce introduction
Behm Shah Pagerank
Behm Shah Pagerank
gothicane
Hadoop ecosystem
Hadoop ecosystem
Ran Silberman
MAP REDUCE IN DATA SCIENCE.pptx
MAP REDUCE IN DATA SCIENCE.pptx
HARIKRISHNANU13
Map Reduce
Map Reduce
Prashant Gupta
Hadoop ecosystem
Hadoop ecosystem
Ran Silberman
Introduction to Map-Reduce Programming with Hadoop
Introduction to Map-Reduce Programming with Hadoop
Dilum Bandara
Big Data & Analytics MapReduce/Hadoop – A programmer’s perspective
Big Data & Analytics MapReduce/Hadoop – A programmer’s perspective
EMC
MapReduce basics
MapReduce basics
Harisankar H
MapReduce wordcount program
MapReduce wordcount program
Sarwan Singh
Spark what's new what's coming
Spark what's new what's coming
Databricks
Big-data-analysis-training-in-mumbai
Big-data-analysis-training-in-mumbai
Unmesh Baile
Intro to Apache Spark by Marco Vasquez
Intro to Apache Spark by Marco Vasquez
MapR Technologies
Advance Map reduce - Apache hadoop Bigdata training by Design Pathshala
Advance Map reduce - Apache hadoop Bigdata training by Design Pathshala
Desing Pathshala
A performance analysis of OpenStack Cloud vs Real System on Hadoop Clusters
A performance analysis of OpenStack Cloud vs Real System on Hadoop Clusters
Kumari Surabhi
Taste Java In The Clouds
Taste Java In The Clouds
Jacky Chu
Mapredtutorial
Mapredtutorial
Anup Mohta
Running Cognos on Hadoop
Running Cognos on Hadoop
Senturus
Cs267 hadoop programming
Cs267 hadoop programming
Kuldeep Dhole
Dart and Flutter Basics.pptx
Dart and Flutter Basics.pptx
DSCVSSUT
Intermachine Parallelism
Intermachine Parallelism
Sri Prasanna
Ähnlich wie Map Reduce introduction
(20)
Behm Shah Pagerank
Behm Shah Pagerank
Hadoop ecosystem
Hadoop ecosystem
MAP REDUCE IN DATA SCIENCE.pptx
MAP REDUCE IN DATA SCIENCE.pptx
Map Reduce
Map Reduce
Hadoop ecosystem
Hadoop ecosystem
Introduction to Map-Reduce Programming with Hadoop
Introduction to Map-Reduce Programming with Hadoop
Big Data & Analytics MapReduce/Hadoop – A programmer’s perspective
Big Data & Analytics MapReduce/Hadoop – A programmer’s perspective
MapReduce basics
MapReduce basics
MapReduce wordcount program
MapReduce wordcount program
Spark what's new what's coming
Spark what's new what's coming
Big-data-analysis-training-in-mumbai
Big-data-analysis-training-in-mumbai
Intro to Apache Spark by Marco Vasquez
Intro to Apache Spark by Marco Vasquez
Advance Map reduce - Apache hadoop Bigdata training by Design Pathshala
Advance Map reduce - Apache hadoop Bigdata training by Design Pathshala
A performance analysis of OpenStack Cloud vs Real System on Hadoop Clusters
A performance analysis of OpenStack Cloud vs Real System on Hadoop Clusters
Taste Java In The Clouds
Taste Java In The Clouds
Mapredtutorial
Mapredtutorial
Running Cognos on Hadoop
Running Cognos on Hadoop
Cs267 hadoop programming
Cs267 hadoop programming
Dart and Flutter Basics.pptx
Dart and Flutter Basics.pptx
Intermachine Parallelism
Intermachine Parallelism
Mehr von Muralidharan Deenathayalan
What's new in C# 8.0 (beta)
What's new in C# 8.0 (beta)
Muralidharan Deenathayalan
Introduction to Jupyter notebook and MS Azure Machine Learning Studio
Introduction to Jupyter notebook and MS Azure Machine Learning Studio
Muralidharan Deenathayalan
Alfresco 5.0 features
Alfresco 5.0 features
Muralidharan Deenathayalan
Test drive on driven development process
Test drive on driven development process
Muralidharan Deenathayalan
Apache Hive - Introduction
Apache Hive - Introduction
Muralidharan Deenathayalan
Apache cassandra
Apache cassandra
Muralidharan Deenathayalan
Alfresco share 4.1 to 4.2 customisation
Alfresco share 4.1 to 4.2 customisation
Muralidharan Deenathayalan
Introduction about Alfresco webscript
Introduction about Alfresco webscript
Muralidharan Deenathayalan
Alfresco activiti workflows
Alfresco activiti workflows
Muralidharan Deenathayalan
Alfresco content model
Alfresco content model
Muralidharan Deenathayalan
Mehr von Muralidharan Deenathayalan
(10)
What's new in C# 8.0 (beta)
What's new in C# 8.0 (beta)
Introduction to Jupyter notebook and MS Azure Machine Learning Studio
Introduction to Jupyter notebook and MS Azure Machine Learning Studio
Alfresco 5.0 features
Alfresco 5.0 features
Test drive on driven development process
Test drive on driven development process
Apache Hive - Introduction
Apache Hive - Introduction
Apache cassandra
Apache cassandra
Alfresco share 4.1 to 4.2 customisation
Alfresco share 4.1 to 4.2 customisation
Introduction about Alfresco webscript
Introduction about Alfresco webscript
Alfresco activiti workflows
Alfresco activiti workflows
Alfresco content model
Alfresco content model
Kürzlich hochgeladen
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
carlostorres15106
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
Delhi Call girls
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
Slibray Presentation
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
shyamraj55
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
Rizwan Syed
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
ThousandEyes
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
OnBoard
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
naman860154
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
Memoori
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Safe Software
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
2toLead Limited
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
Fwdays
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
Ridwan Fadjar
How to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
naman860154
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
Florian Wilhelm
The transition to renewables in India.pdf
The transition to renewables in India.pdf
Competition Advisory Services (India) LLP
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
null - The Open Security Community
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
Delhi Call girls
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
Kalema Edgar
Kürzlich hochgeladen
(20)
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
How to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
The transition to renewables in India.pdf
The transition to renewables in India.pdf
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
Map Reduce introduction
1.
Confidential, Copyright ©
Quanticate Introduction to Map - Reduce Muralidharan Deenathayalan Technical Lead Muralidharan.deenathayalan@quanticate.com Apache logo are trademarks of The Apache Software Foundation. All other marks mentioned may be trademarks or registered trademarks of their respective owners.
2.
Confidential, Copyright ©
Quanticate Agenda What is Map-Reduce? Map-Reduce architecture Advantages of Map-Reduce Frameworks available for writing Map-Reduce? WordCount – Map-Reduce Program explained How to compile Map-Reduce program using Eclipse? How to deploy Map-Reduce program? How to run Map-Reduce program? Q & A
3.
Confidential, Copyright ©
Quanticate Who Am I ? 7+ years of experience in Microsoft technologies like Asp.net, C#, SQL server and SharePoint 2+ years of experience in open source technologies like Java, Alfresco and Apache Cassandra Author of Apache Cassandra Cookbook (In writing ) Csharpcorner MVP Frequent blogger
4.
Confidential, Copyright ©
Quanticate What is Map-Reduce? Generally called as Map-R program MapReduce Map() + Reduce() MapReduce is a programming approach to process large datasets in parallel, distributed on a cluster ( Divide and conquer). Map
5.
Confidential, Copyright ©
Quanticate What is Map-Reduce? • Map: – Receives input key/value pair – Outputs intermediate key/value pair • Reduce : – Receives intermediate key/value pair – Outputs key/value pair Input Data Map Reduce Reduce Map Map Input Data
6.
Confidential, Copyright ©
Quanticate Map-Reduce Architecture overview Job trackerJob tracker Task tracker Task tracker Task tracker Master node Slave node 1 Slave node 2 Slave node N Workers user Workers Workers
7.
Confidential, Copyright ©
Quanticate Advantages of Map-Reduce Distributed pattern-based searching Distributed sorting Web access logs Machine Learning
8.
Confidential, Copyright ©
Quanticate Framework available for writing Map-Reduce Courtesy & ©: http://blog.matthewrathbone.com/2013/01/05/a-quick-guide-to-hadoop-map-reduce-frameworks.html JAVA Cascading Crunch CLOJURE Cascalog SCALA Scrunch Scalding Scoobi R Rhadoop MICROSOFT .Net (C# / VB.net) SPECIAL (HIGH-LEVEL) Apache Hive Apache Pig RUBY Wukong Cascading Jruby PYTHON MR Job Dumbo Hadooppy Pydoop Luigi
9.
Confidential, Copyright ©
Quanticate WordCount – Map-Reduce Program public static class Map extends MapReduceBase implements Mapper<LongWritable, Text, Text, IntWritable> { private final static IntWritable one = new IntWritable(1); private Text word = new Text(); public void map(LongWritable key, Text value, OutputCollector<Text, IntWritable> output, Reporter reporter) throws IOException { String line = value.toString(); StringTokenizer tokenizer = new StringTokenizer(line); while (tokenizer.hasMoreTokens()) { word.set(tokenizer.nextToken()); output.collect(word, one); } } }
10.
Confidential, Copyright ©
Quanticate WordCount – Map-Reduce Program public static class Reduce extends MapReduceBase implements Reducer<Text, IntWritable, Text, IntWritable> { public void reduce(Text key, Iterator<IntWritable> values, OutputCollector<Text, IntWritable> output, Reporter reporter) throws IOException { int sum = 0; while (values.hasNext()) { sum += values.next().get(); } output.collect(key, new IntWritable(sum)); } }
11.
Confidential, Copyright ©
Quanticate WordCount – Map-Reduce Program public static void main(String[] args) throws Exception { JobConf conf = new JobConf(WordCount.class); conf.setJobName("wordcount"); conf.setOutputKeyClass(Text.class); conf.setOutputValueClass(IntWritable.class); conf.setMapperClass(Map.class); conf.setCombinerClass(Reduce.class); conf.setReducerClass(Reduce.class); conf.setInputFormat(TextInputFormat.class); conf.setOutputFormat(TextOutputFormat.class); FileInputFormat.setInputPaths(conf, new Path(args[0])); FileOutputFormat.setOutputPath(conf, new Path(args[1])); JobClient.runJob(conf); }
12.
Confidential, Copyright ©
Quanticate How to compile Map-Reduce program using Eclipse? Refer Hadoop jar file from your disk Maven is simple to use Eclipse Project Build Project No errors in the eclipse console
13.
Confidential, Copyright ©
Quanticate How to deploy Map-Reduce program?
14.
Confidential, Copyright ©
Quanticate How to run Map-Reduce program?
15.
Confidential, Copyright ©
Quanticate Summary What is Map-Reduce? Architecture of Map-Reduce? Advantages of Map-Reduce Frameworks available for Map-Reduce? WordCount – Map-Reduce Program explained Compiling WordCount Map-Reduce program using Eclipse Deploying Map-Reduce program Executing a Map-Reduce program
16.
Confidential, Copyright ©
Quanticate Q & A
17.
Confidential, Copyright ©
Quanticate References http://en.wikipedia.org/wiki/MapReduce http://hortonworks.com http://hadoop.apache.org
18.
Confidential, Copyright ©
Quanticate Coding-Freaks.Net www.codingfreaks.net Quanticate OPDev Twitter https://twitter.com/quanticateopdev Twitter www.Twitter.com/muralidharand
19.
Confidential, Copyright ©
Quanticate