Data Science & AI Books
Creating a Data-Driven Enterprise with DataOps
Many companies are busy collecting massive amounts of data, but few are taking advantage of this treasure horde to build a truly data insights-driven organization. To do so, the data team must democratize both data and the insights in a way that provides real-time access to all employees in the organization. This report explores DataOps, the proces
Making Sense of Stream Processing
How can event streams help make your application more scalable, reliable, and maintainable? In this report, O'Reilly author Martin Kleppmann shows you how stream processing can make your data storage and processing systems more flexible and less complex. Structuring data as a stream of events isn't new, but with the advent of open source projects s
Building Knowledge Graphs
Incredibly useful, knowledge graphs help organizations keep track of medical research, cybersecurity threat intelligence, GDPR compliance, web user engagement, and much more. They do so by storing interlinked descriptions of entities - objects, events, situations, or abstract concepts - and encoding the underlying information. How do you create a k
Statistics Done Wrong
Scientific progress depends on good research, and good research needs good statistics. But statistical analysis is tricky to get right, even for the best and brightest of us. You'd be surprised how many scientists are doing it wrong. Statistics Done Wrong is a pithy, essential guide to statistical blunders in modern science that will show you how t
Financial Machine Learning
Financial Machine Learning surveys the nascent literature on machine learning in the study of financial markets. The authors highlight the best examples of what this line of research has to offer and recommend promising directions for future research. This survey is designed for both financial economists interested in grasping machine learning tool
Natural Language Processing with Transformers
Since their introduction in 2017, transformers have quickly become the dominant architecture for achieving state-of-the-art results on a variety of natural language processing tasks. If you're a data scientist or coder, this practical book -now revised in full color- shows you how to train and scale these large models using Hugging Face Transformer
Statistical Foundations of Actuarial Learning and its Applications
This open access book discusses the statistical modeling of insurance problems, a process which comprises data collection, data analysis and statistical model building to forecast insured events that may happen in the future. It presents the mathematical foundations behind these fundamental statistical concepts and how they can be applied in daily
Data Visualization with Category Theory and Geometry
This open access book provides a robust exposition of the mathematical foundations of data representation, focusing on two essential pillars of dimensionality reduction methods, namely geometry in general and Riemannian geometry in particular, and category theory. Presenting a list of examples consisting of both geometric objects and empirical data
Accelerating AI with Synthetic Data
Recently, data scientists have found effective methods to generate high-quality synthetic data. That's good news for companies seeking large amounts of data to train and build artificial intelligence and machine learning models. This report provides an overview of synthetic data generation that not only focuses on business value and use cases but a
Practical Machine Learning
The book provides an accessible, comprehensive introduction for beginners to machine learning, equipping them with the fundamental skills and techniques essential for this field. It enables beginners to construct practical, real-world solutions powered by machine learning across diverse application domains. It demonstrates the fundamental technique