Practical Statistics for Data Scientists 50+ Essential Concepts Using R and Python

Peter C. Bruce, Andrew Bruce, Peter Gedeck, Peter C. Bruce

Second edition

Paperback (24 Jun 2020)

Save $22.33

~~RRP $80.90~~
$58.57

In Stock

Add to basket

Includes delivery to the United States

10+ copies available online - Usually dispatched within 7 days

Publisher's Synopsis

Statistical methods are a key part of data science, yet few data scientists have formal statistical training. Courses and books on basic statistics rarely cover the topic from a data science perspective. The second edition of this popular guide adds comprehensive examples in Python, provides practical guidance on applying statistical methods to data science, tells you how to avoid their misuse, and gives you advice on what's important and what's not.

Many data science resources incorporate statistical methods but lack a deeper statistical perspective. If you're familiar with the R or Python programming languages and have some exposure to statistics, this quick reference bridges the gap in an accessible, readable format.

With this book, you'll learn:

Why exploratory data analysis is a key preliminary step in data science
How random sampling can reduce bias and yield a higher-quality dataset, even with big data
How the principles of experimental design yield definitive answers to questions
How to use regression to estimate outcomes and detect anomalies
Key classification techniques for predicting which categories a record belongs to
Statistical machine learning methods that "learn" from data
Unsupervised learning methods for extracting meaning from unlabeled data

Book information

ISBN:	9781492072942
Publisher:	O'Reilly Media
Imprint:	O'Reilly
Pub date:	24 Jun 2020
Edition:	Second edition
DEWEY:	001.422
DEWEY edition:	23
Language:	English
Number of pages:	xvi, 342
Weight:	634g
Height:	178mm
Width:	233mm
Spine width:	21mm

The Art of Explanation

Ros Atkins

Hardback
Published 14 Sep 2023

Save $5.33

~~RRP $25.28~~
$19.95

In Stock

Add to basket

Designing Data-Intensive Applic...

Martin Kleppmann

Paperback
Published 14 Mar 2017

Save $13.46

~~RRP $60.67~~
$47.21

In Stock

Add to basket

Delta Lake: Up and Running

Bennie Haelen, Dan D...

Paperback
Published 27 Oct 2023

Save $18.63

~~RRP $66.99~~
$48.36

In Stock

Add to basket

Amazon Redshift: The Definitive...

Rajesh Francis, Raji...

Paperback
Published 20 Oct 2023

Save $21.43

~~RRP $80.90~~
$59.47

In Stock

Add to basket

Cost-Effective Data Pipelines

Sev Leonard (author)...

Paperback
Published 28 Jul 2023

Save $18.44

~~RRP $66.99~~
$48.55

In Stock

Add to basket

R for Data Analysis in Easy Steps

Mike McGrath

Paperback
Published 02 Jun 2023

Save $0.35

~~RRP $16.42~~
$16.07

In Stock

Add to basket

Fundamentals of Data Observabil...

Andy Petrella

Paperback
Published 25 Aug 2023

Save $18.65

~~RRP $66.99~~
$48.34

In Stock

Add to basket

Building Real-Time Analytics Sy...

Mark Needham

Paperback
Published 29 Sep 2023

Save $19.20

~~RRP $66.99~~
$47.79

In Stock

Add to basket

R Packages

Hadley Wickham, Jenn...

Paperback
Published 27 Jun 2023

Save $17.29

~~RRP $66.99~~
$49.70

In Stock

Add to basket

R for Data Science

Hadley Wickham, Mine...

Paperback
Published 23 Jun 2023

Save $20.12

~~RRP $80.90~~
$60.78

In Stock

Add to basket