Publisher's Synopsis
Companies are finding new sources of valuable data-from social media to clickstreams to server logs to machine and geolocation data-and they know they need to employ big data tools, like Hadoop, to make the best use of it. Many organizations are turning to Hortonworks-started by twenty-four of the original team of Yahoo! engineers that developed Hadoop-a company that has emerged as one of the key vendors helping enterprise customers make use of Hadoop to gain new, powerful insights into customer needs and wants.
The company's flagship product, Hortonworks Data Platform (HDP), is more than a product. It is a platform and a suite of tools that together create a framework for loading, managing, accessing and analyzing massive volumes of data no matter the format or schema. Pro Hortonworks Data Platform: Harness the Power and Promise of Big Data with HDP, written by Hadoop and HDP expert Stephen Giles, is designed to help readers gain full advantage from HDP. The book-which assumes no prior knowledge of Hadoop-provides an understanding of all facets of HDP and how the various parts work both together and within a larger data platform. It is the insightful "missing manual" that all HDP users need to understand the platform in depth and how to use it to best advantage.
Pro Hortonworks Data Platform provides a deep understanding of the specific components that make HDP so powerful. The book will:
- Show how to install, configure, and secure HDP and all its components
- Illustrate the full lifecycle of a big data project using HDP Provide a deep understanding of Yarn, the core engine of Hadoop
- Show how to leverage HDP/Apache tools like Pig, Hive, Hbase, and Solr to harness data