about this book
This book is designed to help architects, engineers, and data leaders move beyond surface-level awareness of Apache Iceberg and into a confident, production-ready implementation. While Iceberg is widely discussed, practical guidance on designing an end-to-end lakehouse around it remains limited. This book addresses that gap by breaking down each architectural layer, outlining design choices, and providing both conceptual frameworks and hands-on exercises.
Rather than prescribe a one-size-fits-all deployment, the book emphasizes adaptability. You’ll learn how to assess your needs, weigh tradeoffs, and select tools that align with your business and technical goals.
Who should read this book
This book is for data architects, platform engineers, and senior data professionals responsible for modernizing data infrastructure or designing new analytical platforms. You should be familiar with the general concepts of data lakes, warehouses, and processing tools such as Apache Spark or Flink. No prior experience with Apache Iceberg is required, but familiarity with cloud storage, distributed systems, and SQL will help you get the most out of the material.
How this book is organized: A road map
The book is divided into three parts: