Spark Operations Cookbook

Spark Operations Cookbook Solving the Practical Challenges of Spark Implementation

1st edition

Paperback (31 Jul 2017)

Not available for sale

Includes delivery to the United States

Out of stock

This service is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Publisher's Synopsis

The Apache Spark cluster computing system aims to make data analytics fast-both fast to run and fast to write. But as powerful and useful as Spark is for distributed systems, there are many issues that may occur during implementation. This practical cookbook contains recipes solving the most common problems that Spark users face. Author Neelesh Srinivas Salian, a customer operations engineer at Cloudera, has seen all things that can go wrong in the code for Spark applications.

Data engineers, system administrators, architects will learn recipes for debugging common and unexpected problems that occur during key phases of Spark implementation on large distributed system environments. From setting up your cluster to running your first application, submitting to a cluster, understanding storage needs, and handling security and monitoring metrics, this book is your guide to facing any Spark operations issue.

  • Learn an approach to debugging Spark from the perspective of improving business logic implementation
  • Understand the nuances of Spark's components, including Spark Core, Spark Streaming, SparkSQL, and MLLib
  • Get an entire chapter devoted to Spark security-an emerging and vital topic

Book information

ISBN: 9781491971581
Publisher: O'Reilly Media
Imprint: O'Reilly
Pub date:
Edition: 1st edition
Language: English
Number of pages: 200
Weight: 364g
Height: 233mm
Width: 178mm
Spine width: 16mm