R for Data Science
Import, Tidy, Transform, Visualize, and Model Data
Book Details
| Authors | Garrett Grolemund, Hadley Wickham |
| Publisher | O'Reilly Media |
| Published | 2016 |
| Edition | 1st |
| Paperback | 520 pages |
| Language | English |
| ISBN-13 | 9781491910399 |
| ISBN-10 | 1491910399 |
| License | Creative Commons Attribution-NonCommercial-NoDerivatives |
Book Description
Learn how to use R to turn raw data into insight, knowledge, and understanding. This book introduces you to R, RStudio, and the tidyverse, a collection of R packages designed to work together to make data science fast, fluent, and fun. Suitable for readers with no previous programming experience, R for Data Science is designed to get you doing data science as quickly as possible.
Authors Hadley Wickham and Garrett Grolemund guide you through the steps of importing, wrangling, exploring, and modeling your data and communicating the results. You'll get a complete, big-picture understanding of the data science cycle, along with basic tools you need to manage the details. Each section of the book is paired with exercises to help you practice what you've learned along the way.
You'll learn how to:
- Wrangle: transform your datasets into a form convenient for analysis;
- Program: learn powerful R tools for solving data problems with greater clarity and ease;
- Explore: examine your data, generate hypotheses, and quickly test them;
- Model: provide a low-dimensional summary that captures true "signals" in your dataset;
- Communicate: learn R Markdown for integrating prose, code, and results.
This book is available under a Creative Commons Attribution-NonCommercial-NoDerivatives license (CC BY-NC-ND), which means that you are free to copy and distribute it, as long as you attribute the source, don't use it commercially, and don't create modified versions.
If you enjoyed the book and would like to support the author, you can purchase a printed copy (hardcover or paperback) from official retailers.
Download and Read Links
Share this Book
[localhost]# find . -name "*Similar_Books*"
Data Science at the Command Line, 2nd Edition
This thoroughly revised guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You'll learn how to combine small yet powerful command-line tools to quickly obtain, scrub, explore, and model your data. To get you started, author Jeroen Janssens provides a Docker image packed wit
Introduction to Data Science
Introduction to Data Science: Data Analysis and Prediction Algorithms with R introduces concepts and skills that can help you tackle real-world data analysis challenges. It covers concepts from probability, statistical inference, linear regression, and machine learning. It also helps you develop skills such as R programming, data wrangling, data vi
Data Parallel C++
Learn how to accelerate C++ programs using data parallelism. This open book enables C++ programmers to be at the forefront of this exciting and important new development that is helping to push computing to new levels. It is full of practical advice, detailed explanations, and code examples to illustrate key topics. Data parallelism in C++ enables
Intel Galileo and Intel Galileo Gen 2
Intel Galileo and Intel Galileo Gen 2: API Features and Arduino Projects for Linux Programmers provides detailed information about Intel Galileo and Intel Galileo Gen 2 boards for all software developers interested in Arduino and the Linux platform. The book covers the new Arduino APIs and is an introduction for developers on natively using Linux.
Elements of Data Science
Elements of Data Science is an introduction to the practical skills of working with data, written for people with no programming experience. Concepts are explained clearly and concisely, and exercises in each chapter demonstrate the real-world use of each feature. - Step-by-Step Approach: Learn how to execute a data science project from start to fi
Principles of Data Science
Principles of Data Science is intended to support one- or two-semester courses in data science. It is appropriate for data science majors and minors as well as students concentrating in business, finance, health care, engineering, the sciences, and a number of other fields where data science has become critically important. The authors have include