Up until recently, Hadoop deployments have existed on hardware owned and run by organizations, often alongside legacy "big-iron" hardware. Today, cloud service providers allow customers to effectively rent hardware and associated network connectivity, along with a variety of other features like databases and bulk storage.
But installing a Hadoop cluster on a public cloud service is not as straightforward as it may appear. This practical book shows you how to install these clusters in a way that harmonizes with public cloud service features, and examine ways to use and manage them efficiently.
You’ll learn how to architect clusters in a way that works with the features of the provider, not only to avoid potential pitfalls, but also to take full advantage of what the services can do. A cluster installed in a suboptimal fashion will run slower and cost more than expected, which can defeat the goals of moving to the service in the first place.