Mastering Hadoop

by Sandeep Karanth

Length: 398 pages
Edition: 1
Language: English
Publisher: Packt Publishing
Publication Date: 2015-02-23
ISBN-10: 1783983647
ISBN-13: 9781783983643
Sales Rank: #4138114 (See Top 100 Books)

0 ratings

Print Book Look Inside

Description

Go beyond the basics and master the next generation of Hadoop data processing platforms

About This Book

Learn how to optimize Hadoop MapReduce, Pig and Hive
Dive into YARN and learn how it can integrate Storm with Hadoop
Understand how Hadoop can be deployed on the cloud and gain insights into analytics with Hadoop

Who This Book Is For

Do you want to broaden your Hadoop skill set and take your knowledge to the next level? Do you wish to enhance your knowledge of Hadoop to solve challenging data processing problems? Are your Hadoop jobs, Pig scripts, or Hive queries not working as fast as you intend? Are you looking to understand the benefits of upgrading Hadoop? If the answer is yes to any of these, this book is for you. It assumes novice-level familiarity with Hadoop.

In Detail

Hadoop is synonymous with Big Data processing. Its simple programming model, “code once and deploy at any scale” paradigm, and an ever-growing ecosystem makes Hadoop an all-encompassing platform for programmers with different levels of expertise.

This book explores the industry guidelines to optimize MapReduce jobs and higher-level abstractions such as Pig and Hive in Hadoop 2.0. Then, it dives deep into Hadoop 2.0 specific features such as YARN and HDFS Federation.

This book is a step-by-step guide that focuses on advanced Hadoop concepts and aims to take your Hadoop knowledge and skill set to the next level. The data processing flow dictates the order of the concepts in each chapter, and each chapter is illustrated with code fragments or schematic diagrams.

Chapter 1: Hadoop 2.X
Chapter 2: Advanced MapReduce
Chapter 3: Advanced Pig
Chapter 4: Advanced Hive
Chapter 5: Serialization and Hadoop I/O
Chapter 6: YARN – Bringing Other Paradigms to Hadoop
Chapter 7: Storm on YARN – Low Latency Processing in Hadoop
Chapter 8: Hadoop on the Cloud
Chapter 9: HDFS Replacements
Chapter 10: HDFS Federation
Chapter 11: Hadoop Security
Chapter 12: Analytics Using Hadoop
Appendix: Hadoop for Microsoft Windows

Free ChaptersTry Audible and Get Two Free Audiobooks »

To access the link, solve the captcha.

Recommended BooksMore Similar Books »

Mastering Tableau 2023: Implement advanced business intelligence techniques, analytics, and machine learning models with Tableau

2023-08-29

Cracking the Data Engineering Interview: Land your dream job with the help of resume-building tips, over 100 mock questions, and a unique portfolio

2023-11-07

Learning Geospatial Analysis with Python, 4th Edition: Unleash the power of Python 3 with practical techniques for learning GIS and remote sensing

2023-11-24

Alteryx Designer Cookbook: Over 60 recipes to transform your data into insights and take your productivity to a new level

2023-10-31

R Bioinformatics Cookbook: Utilize R packages for bioinformatics, genomics, data science, and machine learning, 2nd Edition

2023-10-31

Data Engineering with AWS: Acquire the skills to design and build AWS-based data transformation pipelines like a pro, 2nd Edition

2023-10-31

Data Storytelling and Translation: Bridging the Gap Between Numbers and Narratives

2023-10-30

The Statistics and Machine Learning with R Workshop: Unlock the power of efficient data science modeling with this hands-on guide

2023-10-25

Mastering Hadoop

About This Book

Who This Book Is For

In Detail

Table of Contents

Mastering Tableau 2023: Implement advanced business intelligence techniques, analytics, and machine learning models with Tableau

Cracking the Data Engineering Interview: Land your dream job with the help of resume-building tips, over 100 mock questions, and a unique portfolio

Learning Geospatial Analysis with Python, 4th Edition: Unleash the power of Python 3 with practical techniques for learning GIS and remote sensing

Alteryx Designer Cookbook: Over 60 recipes to transform your data into insights and take your productivity to a new level

R Bioinformatics Cookbook: Utilize R packages for bioinformatics, genomics, data science, and machine learning, 2nd Edition

Data Engineering with AWS: Acquire the skills to design and build AWS-based data transformation pipelines like a pro, 2nd Edition

Data Storytelling and Translation: Bridging the Gap Between Numbers and Narratives

The Statistics and Machine Learning with R Workshop: Unlock the power of efficient data science modeling with this hands-on guide