appendix C The Apache Iceberg specification
Apache Iceberg is more than an open table format; it is a specification with clearly defined rules for how table metadata, snapshots, partitioning, and schema evolution behave. These specifications ensure consistent behavior across tools and engines, enabling interoperability, reliability, and long-term stability. As Iceberg evolves, each new version of the specification introduces new capabilities while balancing backward compatibility.
This appendix provides a concise but thorough reference to the Iceberg specification. It begins by explaining the purpose and structure of the specification, and it then walks through format versions 1, 2, and 3, highlighting the major features introduced in each. You’ll also find guidance on how metadata, snapshots, and concurrency work under the hood, along with coverage of the REST Catalog API and Puffin file format specification.
Whether you’re implementing your own engine integration or evaluating compatibility between tools, this appendix will help you understand the guarantees Iceberg makes and how they continue to evolve.