Apache Books


Apache Iceberg: The Definitive Guide

Apache Iceberg

Traditional data architecture patterns are severely limited. To use these patterns, you have to ETL data into each tool - a cost-prohibitive process for making warehouse features available to all of your data. The lack of flexibility with these patterns requires you to lock into a set of priority tools and formats, which creates data silos and data

Making Sense of Stream Processing

Kafka

How can event streams help make your application more scalable, reliable, and maintainable? In this report, O'Reilly author Martin Kleppmann shows you how stream processing can make your data storage and processing systems more flexible and less complex. Structuring data as a stream of events isn't new, but with the advent of open source projects s

Hacking Exposed Web Applications

Hacking Apache IIS ASP.NET

In today's world of pervasive Internet connectivity and rapidly evolving Web technology, online security is as critical as it is challenging. With the enhanced availability of information and services online and Web-based attacks and break-ins on the rise, security risks are at an all time high. Hacking Exposed Web Applications shows you, step-by-s

Graph Algorithms

Spark Neo4j

Learn how graph algorithms can help you leverage relationships within your data to develop intelligent solutions and enhance your machine learning models. With this practical guide, developers and data scientists will discover how graph analytics deliver value, whether they're used for building dynamic network models or forecasting real-world behav

The Common Java Cookbook

Java

In this book, you'll find information about a set of libraries developed within the Apache Commons (also referred to as "Commons"). Commons is a set of small, popular components which forms a Top-level Project at the Apache Software Foundation. Ranging from the elementary to the complex, many would consider some of these libraries indispensable to

Making Servers Work

Linux

This book highlights practical sysadmin skills, common architectures that you'll encounter, and best practices that apply to automating and running systems at any scale, from one laptop or server to 1,000 or more. It is intended to help orient you within the discipline, and hopefully encourages you to learn more about system administration.

Time Series Databases

NoSQL

Time series databases enable a fundamental step in the central storage and analysis of many types of machine data. As such, they lie at the heart of the Internet of Things (IoT). There's a revolution in sensor - to - insight data flow that is rapidly changing the way we perceive and understand the world around us. Much of the data generated by sens

Modern PHP

PHP

Back in 1995, software engineer Rasmus Lerdorf needed an easier way to maintain his own web page, and wrote a set of programs he called Personal Home Page, or PHP. He soon open sourced the software, but never anticipated it would become the dominant web-page programming language it is today, Now, PHP powers more than 80% of websites that report the

Producing Open Source Software, 2nd Edition

Software

The corporate market is now embracing free, "open source" software like never before, as evidenced by the recent success of the technologies underlying LAMP (Linux, Apache, MySQL, and PHP). Each is the result of a publicly collaborative process among numerous developers who volunteer their time and energy to create better software. The truth is, ho

Version Control with Subversion, 3rd Edition

Apache CVS

Written by members of the development team that maintains Subversion, this is the official guide and reference manual for the popular open source revision control technology. The new edition covers Subversion 1.7 with a complete introduction and guided tour of its capabilities, along with best practice recommendations. Version Control with Subversi