Apache Hive Essentials, 2nd Edition Front Cover

Apache Hive Essentials, 2nd Edition

  • Length: 210 pages
  • Edition: 2nd Revised edition
  • Publisher:
  • Publication Date: 2018-06-30
  • ISBN-10: 1788995090
  • ISBN-13: 9781788995092
  • Sales Rank: #973502 (See Top 100 Books)
Description

Apache Hive Essentials: Essential techniques to help you process, and get unique insights from, big data, 2nd Edition

This book takes you on a fantastic journey to discover the attributes of big data using Apache Hive.

Key Features

  • Grasp the skills needed to write efficient Hive queries to analyze the Big Data
  • Discover how Hive can coexist and work with other tools within the Hadoop ecosystem
  • Uses practical, example-oriented scenarios to cover all the newly released features of Apache Hive 2.3.3

Book Description

In this book, we prepare you for your journey into big data by frstly introducing you to backgrounds in the big data domain, alongwith the process of setting up and getting familiar with your Hive working environment.

Next, the book guides you through discovering and transforming the values of big data with the help of examples. It also hones your skills in using the Hive language in an effcient manner. Toward the end, the book focuses on advanced topics, such as performance, security, and extensions in Hive, which will guide you on exciting adventures on this worthwhile big data journey.

By the end of the book, you will be familiar with Hive and able to work effeciently to find solutions to big data problems

What you will learn

  • Create and set up the Hive environment
  • Discover how to use Hive’s definition language to describe data
  • Discover interesting data by joining and filtering datasets in Hive
  • Transform data by using Hive sorting, ordering, and functions
  • Aggregate and sample data in different ways
  • Boost Hive query performance and enhance data security in Hive
  • Customize Hive to your needs by using user-defined functions and integrate it with other tools

Who This Book Is For

If you are a data analyst, developer, or simply someone who wants to quickly get started with Hive to explore and analyze Big Data in Hadoop, this is the book for you. Since Hive is an SQL-like language, some previous experience with SQL will be useful to get the most out of this book.

Table of Contents

Chapter 1. OVERVIEW OF BIG DATA AND HIVE
Chapter 2. SETTING UP THE HIVE ENVIRONMENT
Chapter 3. DATA DEFINITION AND DESCRIPTION
Chapter 4. Data Correlation and Scope
Chapter 5. DATA MANIPULATION
Chapter 6. DATA AGGREGATION AND SAMPLING
Chapter 7. Extensibility Considerations
Chapter 8. Working with Other Tools
Chapter 9. Performance Considerations
Chapter 10. Security Considerations

To access the link, solve the captcha.