Data Science & AI Books
Introduction to Data Science
Introduction to Data Science: Data Analysis and Prediction Algorithms with R introduces concepts and skills that can help you tackle real-world data analysis challenges. It covers concepts from probability, statistical inference, linear regression, and machine learning. It also helps you develop skills such as R programming, data wrangling, data vi
Apache Iceberg: The Definitive Guide
Traditional data architecture patterns are severely limited. To use these patterns, you have to ETL data into each tool - a cost-prohibitive process for making warehouse features available to all of your data. The lack of flexibility with these patterns requires you to lock into a set of priority tools and formats, which creates data silos and data
Embedded Analytics
Over the past 10 years, data analytics and data visualization have become essential components of an enterprise information strategy. And yet, the adoption of data analytics has remained remarkably static, reaching no more than 30% of potential users. This book explores the most important techniques for taking that adoption further: embedding analy
Representation Learning for Natural Language Processing, 2nd Edition
This open book provides an overview of the recent advances in representation learning theory, algorithms, and applications for natural language processing (NLP), ranging from word embeddings to pre-trained language models. It is divided into four parts. Part I presents the representation learning techniques for multiple language entries, including
AI at the Edge
Edge AI is transforming the way computers interact with the real world, allowing IoT devices to make decisions using the 99% of sensor data that was previously discarded due to cost, bandwidth, or power limitations. With techniques like embedded machine learning, developers can capture human intuition and deploy it to any target-from ultra-low powe
Unlock Data Agility with Composable Data Architecture
Are your data systems slowing down your AI initiatives? The potential of AI to revolutionize business is undeniable, but many organizations struggle to bridge the gap between ambitious ideas and real-world results. The cause? Traditional data architectures remain too rigid and siloed to support today's dynamic, data-intensive demands. If you're a d
Understanding ETL
Extract, transform, load (ETL) is at the center of every application of data, from business intelligence to AI. Constant shifts in the data landscape - including the implementations of lakehouse architectures and the importance of high-scale real-time data - mean that today's data practitioners must approach ETL a bit differently. This updated tech
Intermediate Statistics with R
Introductory statistics courses prepare students to think statistically but cover relatively few statistical methods. Building on the basic statistical thinking emphasized in an introductory course, a second course in statistics at the undergraduate level can explore a large number of statistical methods. This text covers more advanced graphical su
Learning Analytics Methods and Tutorials
This open access comprehensive methodological book offers a much-needed answer to the lack of resources and methodological guidance in learning analytics, which has been a problem ever since the field started. The book covers all important quantitative topics in education at large as well as the latest in learning analytics and education data minin
Essential GraphRAG
A Retrieval Augmented Generation (RAG) system automatically selects and supplies domain-specific context to an LLM, radically improving its ability to generate accurate, hallucination-free responses. The GraphRAG pattern employs a knowledge graph to structure the RAG's input, taking advantage of existing relationships in the data to generate rich,