Hadoop For Dummies Front Cover

Hadoop For Dummies

  • Length: 408 pages
  • Edition: 1
  • Publisher:
  • Publication Date: 2014-04-14
  • ISBN-10: 1118607554
  • ISBN-13: 9781118607558
  • Sales Rank: #345253 (See Top 100 Books)
Description

Let Hadoop For Dummies help harness the power of your data and rein in the information overload

Big data has become big business, and companies and organizations of all sizes are struggling to find ways to retrieve valuable information from their massive data sets with becoming overwhelmed. Enter Hadoop and this easy-to-understand For Dummies guide. Hadoop For Dummies helps readers understand the value of big data, make a business case for using Hadoop, navigate the Hadoop ecosystem, and build and manage Hadoop applications and clusters.

  • Explains the origins of Hadoop, its economic benefits, and its functionality and practical applications
  • Helps you find your way around the Hadoop ecosystem, program MapReduce, utilize design patterns, and get your Hadoop cluster up and running quickly and easily
  • Details how to use Hadoop applications for data mining, web analytics and personalization, large-scale text processing, data science, and problem-solving
  • Shows you how to improve the value of your Hadoop cluster, maximize your investment in Hadoop, and avoid common pitfalls when building your Hadoop cluster

From programmers challenged with building and maintaining affordable, scaleable data systems to administrators who must deal with huge volumes of information effectively and efficiently, this how-to has something to help you with Hadoop.

Table of Contents

Part I: Getting Started with Hadoop
Chapter 1: Introducing Hadoop and Seeing What It’s Good For
Chapter 2: Common Use Cases for Big Data in Hadoop
Chapter 3: Setting Up Your Hadoop Environment

Part II: How Hadoop Works
Chapter 4: Storing Data in Hadoop: The Hadoop Distributed File System
Chapter 5: Reading and Writing Data
Chapter 6: MapReduce Programming
Chapter 7: Frameworks for Processing Data in Hadoop: YARN and MapReduce
Chapter 8: Pig: Hadoop Programming Made Easier
Chapter 9: Statistical Analysis in Hadoop
Chapter 10: Developing and Scheduling Application Workflows with Oozie

Part III: Hadoop and Structured Data
Chapter 11: Hadoop and the Data Warehouse: Friends or Foes?
Chapter 12: Extremely Big Tables: Storing Data in HBase
Chapter 13: Applying Structure to Hadoop Data with Hive
Chapter 14: Integrating Hadoop with Relational Databases Using Sqoop
Chapter 15: The Holy Grail: Native SQL Access to Hadoop Data

Part IV: Administering and Configuring Hadoop
Chapter 16: Deploying Hadoop
Chapter 17: Administering Your Hadoop Cluster

Part V: The Part of Tens
Chapter 18: Ten Hadoop Resources Worthy of a Bookmark
Chapter 19: Ten Reasons to Adopt Hadoop

To access the link, solve the captcha.