Apache Paimon Demystified

Apache Paimon Demysti f ied Zhiyan Xiao (LY Corp), 2025/3/26
A Realtime Lakehouse Table Format

Self Introduction 肖志彦 / Zhiyan Xiao 🏠 中国重慶 /
Chongqing, China 8D Magic City, Hot Pot 👨🎓 香港中文大学 / The Chinese University of Hong Kong Math, Information Engineering 👨💻 Software Engineer @ LY Corporation Streaming Data Pipeline, Spark, Iceberg ✨ Improving big data systems with Rust Hadoop Client, Hive Client, SASL / Kerberos, InnoFile, InnoTable

Purpose of Today To understand basic ideas of Paimon

Afterwards, we can • Deep dive internals of Paimon •
Consider how to utilize Paimon in real business

Agenda What is Paimon? How does Paimon work? When to
use Paimon?

What is Paimon?

A Lakehouse table format enabling realtime read and write operations

First, how to read and write big data?

Usage of File System & File Format

Issues in directly handling data f iles • No explicit
schema • Schema evolution is not well de f ined • Full data scanning is needed when • reading records with particular f ilters • deleting or updating certain records • Missing advanced features like ACID, Time Travel, etc.

Bene f its of Table Format • De f ine
schema explicitly • Support schema evolution • Accelerate querying by skipping unnecessary data f iles • Support other advanced features like ACID, Time Travel, etc.

Popular Table Formats • Hive • Lakehouse Table Format •
Iceberg • Delta Lake • Hudi • Realtime Lakehouse (Streamhouse) Table Format • Paimon

Skipping unnecessary data f iles would signi f icantly accelerate
data queries

Hive Table Format • Skip unnecessary data f iles with
partition and bucket • Partition • Mapping “partition columns” to “data directories” • /table/year=2025/month=3 • Bucket • Mapping “bucket columns” to “data f iles with corresponding hash” • /table/year=2025/month=3/part-00000

Issues of Hive Table Format • Dif f icult for
Hive Metastore to support huge amount of partitions • Query optimization is limited to partition and bucket level • Missing advanced features like ACID, full schema evolution, etc.

Lakehouse Table Format Take Iceberg as example • Skip unnecessary
data f iles with metadata • metadata.json • manifest-list.avro (contains stats to skip manifest f iles) • manifest.avro (contains stats to skip data f iles) • data- f iles.parquet • Support advanced features like ACID, full schema evolution, etc.

Issues of Lakehouse Table Format For high-throughput streaming write, •
Iceberg • Commits are atomic and inef f icient for frequent writes • Delta Lake, Hudi • Update/delete relies on Merge-on-Read with expensive compaction

Paimon Table Format • Skip unnecessary data f iles with
metadata (manifest, index f ile) • Support high-throughput streaming write and low latency read • with LSM Tree (log-structured merge-tree) storage model

Above is the role Paimon plays as a Realtime Lakehouse
Table Format

How does Paimon work?

Paimon Table Types • Table with Primary Key • Table
without Primary Key • View • Format Table • Object Table • Materialized Table

Paimon File Layouts

Insert One Record INSERT INTO T VALUES (1, 10001, 'varchar00001',
'20230501');

Insert Multiple Records In di ff erent partitions

Delete Records DELETE FROM T WHERE dt >= '20230503';

Compact Table CALL sys.compact('T');

Expire Snapshot

Flink Stream Write Before Flink checkpoint

Flink Stream Write During Flink checkpoint

Change Log Producer Take “lookup change log” as example

When to use Paimon?

Usage of Paimon • High-throughput streaming write • Streaming read
from change log • Uni f ied streaming-batch pipeline • (Realtime Lakehouse) Streamhouse Architecture

Potential Future Actions • Comprehensively compare behaviors of Iceberg and
Paimon • Create PoC with Paimon to demonstrate its bene f its • Smoothly upgrade from Lakehouse to Streamhouse

References • https://paimon.apache.org/ • https://github.com/facebook/rocksdb/wiki/Universal-Compaction

Apache Paimon Demystified

Apache Paimon Demystified

More Decks by Open Data Driven

Other Decks in Technology

Featured

Transcript