This document provides summaries of books and resources related to Hadoop and big data projects. It describes 12 books or resources, including titles like Hadoop in Practice, Programming Hive, Hadoop: The Definitive Guide, Apache Solr 3.1 Cookbook, and How to Develop Big Data Applications for Hadoop. For each item, it lists the title, author(s), publisher, publication date, ISBN number, and a brief 1-3 sentence description of what the book or resource covers.
2. Hadoop & Related Projects
• Hadoop in Practice
• Programming Hive
• Hadoop: The Definitive Guide, Third Edition
• Apache Solr 3.1 Cookbook
• HBase Administration Cookbook
• Hadoop in Action
• Pro Hadoop
• How to Develop Big Data Applications for
Hadoop
qa.zariga.com
3. Hadoop in Practice
• Hadoop in Practice
• By Alex Holmes
• Manning Publications, October 2012
• ISBN: 9781617290237
• 536 pages, $49.99
Hadoop in Practice collects 85 Hadoop examples and presents
them in a problem/solution format. Each technique addresses a
specific task you'll face, like querying big data using Pig or
writing a log file loader. You'll explore each problem step by
step, learning both how to build and deploy that specific
solution along with the thinking that went into its design. As you
work through the tasks, you'll find yourself growing more
comfortable with Hadoop and at home in the world of big data
qa.zariga.com
4. Programming Hive
• Programming Hive
By Edward Capriolo; Dean Wampler; Jason Rutherglen
O’Reilly Media, Inc., September 2012
ISBN: 9781449326944
• This example-driven guide shows you how to set up and configure Hive in your
environment, provides a detailed overview of Hadoop and MapReduce, and demonstrates
how Hive works within the Hadoop ecosystem. You’ll also find real-world case studies that
describe how companies have used Hive to solve unique problems involving petabytes of
data.
• Use Hive to create, alter, and drop databases, tables, views, functions, and indexes
• Customize data formats and storage options, from files to external databases
• Load and extract data from tables—and use queries, grouping, filtering, joining, and other
conventional query methods
• Gain best practices for creating user defined functions (UDFs)
• Learn Hive patterns you should use and anti-patterns you should avoid
• Integrate Hive with other data processing programs
• Use storage handlers for NoSQL databases and other datastores
• Learn the pros and cons of running Hive on Amazon’s Elastic MapReduce
qa.zariga.com
5. Hadoop: The Definitive
Guide, Third Edition
• Hadoop: The Definitive Guide, ThirdEdition
By Tom White
O’Reilly Media, October 2010
ISBN: 9781449398644
• Apache Hadoop is ideal for organizations with a growing need
to store and process massive application datasets. With this
book developers will find details for analyzing large datasets
with Hadoop, and administrators will learn how to set up and
run Hadoop clusters. The book includes case studies that
illustrate how Hadoop is used to solve specific problems
qa.zariga.com
6. Apache Solr 3.1 Cookbook
• Apache Solr 3.1 Cookbook
By Rafał Kud
Packt Publishing, July 2011
ISBN: 9781849512183
• This book is part of Packt's Cookbook series; each chapter looks at a
different aspect of working with Apache Solr. The recipes deal with
common problems of working with Solr by using easy-to-
understand, real-life examples. The book is not in any way a
complete Apache Solr reference and you should see it as a helping
hand when things get rough on your journey with Apache
Solr.Developers who are working with Apache Solr and would like to
know how to combat common problems will find this book of great
use. Knowledge of Apache Lucene would be a bonus but is not
required
qa.zariga.com
7. HBase Administration
Cookbook
• HBase Administration Cookbook
• By Yifeng Jiang
• Packt Publishing, August 2012
• ISBN: 9781849517140
• As part of Packt's cookbook series, each recipe offers a
practical, step-by-step solution to common problems found in
HBase administration. This book is for HBase
administrators, developers, and will even help Hadoop
administrators. You are not required to have HBase
experience, but are expected to have a basic understanding of
Hadoop and MapReduce.
qa.zariga.com
8. Hadoop in Action
• Hadoop in Action
By Chuck Lam
Manning Publications, December 2010
ISBN: 9781935182191
• The book begins by making the basic idea of Hadoop and MapReduce
easier to grasp by applying the default Hadoop installation to a few
easy-to-follow tasks, such as analyzing changes in word frequency across
a body of documents. The book continues through the basic concepts of
MapReduce applications developed using Hadoop, including a close look
at framework components, use of Hadoop for a variety of data analysis
tasks, and numerous examples of Hadoop in action.
• Hadoop in Action will explain how to use Hadoop and present design
patterns and practices of programming MapReduce. MapReduce is a
complex idea both conceptually and in its implementation, and Hadoop
users are challenged to learn all the knobs and levers for running
Hadoop. This book takes you beyond the mechanics of running
Hadoop, teaching you to write meaningful programs in a MapReduce
framework
qa.zariga.com
9. Pro Hadoop
• Pro Hadoop
By Jason Venner
Apress, June 2009
ISBN: 9781430219422
• From Apress, the name you’ve come to trust for hands–on
technical knowledge, Pro Hadoop brings you up to speed on
Hadoop. You learn the ins and outs of MapReduce; how to
structure a cluster, design, and implement the Hadoop file
system; and how to build your first cloud–computing tasks
using Hadoop. Learn how to let Hadoop take care of
distributing and parallelizing your software—you just focus on
the code, Hadoop takes care of the rest
qa.zariga.com
10. Howto Develop Big Data Applicationsfor
Hadoop
• How to Develop Big Data Applications for Hadoop
By Ken Krugler; Shevek M; Chris Wensel; Abe Taha
O’Reilly Media, February 2011
ISBN: 9781449305796
qa.zariga.com