Data Science at the Command Line, 2nd Edition
Obtain, Scrub, Explore, and Model Data with Unix Power Tools
Book Details
| Author | Jeroen Janssens |
| Publisher | O'Reilly Media |
| Published | 2021 |
| Edition | 2nd |
| Paperback | 282 pages |
| Language | English |
| ISBN-13 | 9781492087915, 9781492087908 |
| ISBN-10 | 1492087912, 1492087904 |
| License | Creative Commons Attribution-NonCommercial-NoDerivatives |
Book Description
This thoroughly revised guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You'll learn how to combine small yet powerful command-line tools to quickly obtain, scrub, explore, and model your data. To get you started, author Jeroen Janssens provides a Docker image packed with over 100 Unix power tools - useful whether you work with Windows, macOS, or Linux.You'll quickly discover why the command line is an agile, scalable, and extensible technology. Even if you're comfortable processing data with Python or R, you'll learn how to greatly improve your data science workflow by leveraging the command line's power. This book is ideal for data scientists, analysts, engineers, system administrators, and researchers.
- Obtain data from websites, APIs, databases, and spreadsheets;
- Perform scrub operations on text, CSV, HTML, XML, and JSON files;
- Explore data, compute descriptive statistics, and create visualizations;
- Manage your data science workflow;
- Create your own tools from one-liners and existing Python or R code;
- Parallelize and distribute data-intensive pipelines;
- Model data with dimensionality reduction, regression, and classification algorithms;
- Leverage the command line from Python, Jupyter, R, RStudio, and Apache Spark.
This book is available under a Creative Commons Attribution-NonCommercial-NoDerivatives license (CC BY-NC-ND), which means that you are free to copy and distribute it, as long as you attribute the source, don't use it commercially, and don't create modified versions.
If you enjoyed the book and would like to support the author, you can purchase a printed copy (hardcover or paperback) from official retailers.
Download and Read Links
Share this Book
[localhost]# find . -name "*Similar_Books*"
Introduction to Autonomous Robots
Textbooks that provide a broad algorithmic perspective on the mechanics and dynamics of robots almost unfailingly serve students at the graduate level. Introduction to Autonomous Robots offers a much-needed resource for teaching third- and fourth-year undergraduates the computational fundamentals behind the design and control of autonomous robots.
Certified Programming with Dependent Types
A handbook to the Coq software for writing and checking mathematical proofs, with a practical engineering focus. The technology of mechanized program verification can play a supporting role in many kinds of research projects in computer science, and related tools for formal proof-checking are seeing increasing adoption in mathematics and engineerin
GNU SED
This open access book employs a practical, example-driven approach to introduce the features of sed and regular expressions incrementally. The author strongly advises readers to manually type and experiment with each example, emphasizing that a thorough understanding of both the sample input and the resulting output is critical for comprehension. T
The Tiny Book of Rules
The Tiny Book of Rulesis tiny and it's about the Rules module for Drupal. Rules is a module usually used to auto-mate actions on your Drupal site. Rules can react on events occurring on your site - such as a user logging in or a node being created - and perform customized follow-up actions such as redirecting to a certain page or setting field valu
The Computers That Made Britain
The home computer boom of the 1980s brought with it now iconic machines such as the ZX Spectrum, BBC Micro, and Commodore 64. Those machines would inspire a generation. Written by Tim Danton. The Computers That Made Britain (300 pages, hardback) tells the story of 19 of those computers - and what happened behind the scenes. With dozens of new inter
The Linux Command Line, 5th Edition
The Linux Command Line takes you from your very first terminal keystrokes to writing full programs in Bash, the most popular Linux shell (or command line). Along the way you'll learn the timeless skills handed down by generations of experienced, mouse-shunning gurus: file navigation, environment configuration, command chaining, pattern matching wit