Think Stats, 3rd Edition

Exploratory Data Analysis


Think Stats, 3rd Edition
Think Stats, 3rd Edition
CC BY

Book Details

Author Allen Downey
Publisher O'Reilly Media, Green Tea Press
Published 2025
Edition 3
Paperback 324 pages
Language English
ISBN-13 9781098190255, 9781098190255
ISBN-10 1098190254, 1098190254
License Creative Commons Attribution

Book Description

If you know how to program, you have the skills to turn data into knowledge. This thoroughly revised edition presents statistical concepts computationally, rather than mathematically, using programs written in Python. Through practical examples and exercises based on real-world datasets, you'll learn the entire process of exploratory data analysis - from wrangling data and generating statistics to identifying patterns and testing hypotheses.

Whether you're a data scientist, software engineer, or data enthusiast, you'll get up to speed on commonly used tools including NumPy, SciPy, and Pandas. You'll explore distributions, relationships between variables, visualization, and many other concepts. And all chapters are available as Jupyter notebooks, so you can read the text, run the code, and work on exercises all in one place.

- Analyze data distributions and visualize patterns using Python libraries
- Improve predictions and insights with regression models
- Dive into specialized topics like time series analysis and survival analysis
- Integrate statistical techniques and tools for validation, inference, and more
- Communicate findings with effective data visualization
- Troubleshoot common data analysis challenges
- Boost reproducibility and collaboration in data analysis projects with interactive notebooks


This book is available under a Creative Commons Attribution license (CC BY), which means that you are free to copy, distribute, and modify it, as long as you give appropriate credit to the original author.

If you enjoyed the book and would like to support the author, you can purchase a printed copy (hardcover or paperback) from official retailers.

Download and Read Links

Share This Book

[localhost]# find . -name "*Similar_Books*"


Think Stats, 2nd Edition

Analysis

If you know how to program, you have the skills to turn data into knowledge, using tools of probability and statistics. This concise introduction shows you how to perform statistical analysis computationally, rather than mathematically, with programs written in Python. By working with a single case study throughout this thoroughly revised book, you

R for Data Science

R Analysis

Learn how to use R to turn raw data into insight, knowledge, and understanding. This book introduces you to R, RStudio, and the tidyverse, a collection of R packages designed to work together to make data science fast, fluent, and fun. Suitable for readers with no previous programming experience, R for Data Science is designed to get you doing data

Critical Data Literacy

Analysis

A short course for students to increase their proficiency in analyzing and interpreting data visualizations. By completing this short course students will be able to explain the importance of data literacy, identify data visualization issues in order to improve their own skills in data story-telling. The intended outcome of this course is to help s

Think OCaml

Ocaml

How to Think Like a Computer Scientist is an introductory programming book based on the OCaml language. It is a modified version of Think Python by Allen Downey. It is intended for newcomers to programming and also those who know some programming but want to learn programming in the function-oriented paradigm, or those who simply want to learn OCam

Python Data Science Handbook

Python Pandas

For many researchers, Python is a first-class tool mainly because of its libraries for storing, manipulating, and gaining insight from data. Several resources exist for individual pieces of this data science stack, but only with the Python Data Science Handbook do you get them all - IPython, NumPy, Pandas, Matplotlib, Scikit-Learn, and other relate

Managing Cloud Native Data on Kubernetes

Kubernetes Cloud

Is Kubernetes ready for stateful workloads? This open source system has become the primary platform for deploying and managing cloud native applications. But because it was originally designed for stateless workloads, working with data on Kubernetes has been challenging. If you want to avoid the inefficiencies and duplicative costs of having separa