Data Science on the Google Cloud Platform

Data Science on the Google Cloud Platform Implementing End-to-End Real-Time Data Pipelines : From Ingest to Machine Learning

First edition

Paperback (05 Jan 2018)

Not available for sale

Includes delivery to the United States

Out of stock

This service is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Publisher's Synopsis

Learn how easy it is to apply sophisticated statistical and machine learning methods to real-world problems when you build on top of the Google Cloud Platform (GCP). This hands-on guide shows developers entering the data science field how to implement an end-to-end data pipeline, using statistical and machine learning methods and tools on GCP. Through the course of the book, youâ??ll work through a sample business decision by employing a variety of data science approaches.

Follow along by implementing these statistical and machine learning solutions in your own project on GCP, and discover how this platform provides a transformative and more collaborative way of doing data science.

Youâ??ll learn how to:

  • Automate and schedule data ingest, using an App Engine application
  • Create and populate a dashboard in Google Data Studio
  • Build a real-time analysis pipeline to carry out streaming analytics
  • Conduct interactive data exploration with Google BigQuery
  • Create a Bayesian model on a Cloud Dataproc cluster
  • Build a logistic regression machine-learning model with Spark
  • Compute time-aggregate features with a Cloud Dataflow pipeline
  • Create a high-performing prediction model with TensorFlow
  • Use your deployed model as a microservice you can access from both batch and real-time pipelines

Book information

ISBN: 9781491974568
Publisher: O'Reilly Media
Imprint: O'Reilly
Pub date:
Edition: First edition
DEWEY: 004.33
DEWEY edition: 23
Language: English
Number of pages: xiv, 393
Weight: 704g
Height: 180mm
Width: 234mm
Spine width: 20mm