Flink - Changelog Event Types - ZhangZhihuiAAA

In Apache Flink (especially Table/SQL API), streams are often represented as changelogs rather than simple append-only rows.
The four event types tell downstream operators how a row has changed.

Here’s a clear breakdown 👇

The 4 Flink Changelog Event Types

Event	Name	Meaning
+I	Insert	A new row is added
-U	Update Before	Old version of a row before an update
+U	Update After	New version of a row after an update
-D	Delete	A row is removed

1. +I (Insert)

What it means:
A brand-new row appears.

Example:
A user places their first order.

This row did not exist before.

2. -U (Update Before)

What it means:
The old value of a row that is about to be updated.

This is emitted only for update-capable streams.

3. +U (Update After)

What it means:
The new value of the same row after the update.

👉 Together, -U and +U represent one logical update.

4. -D (Delete)

What it means:
A row is removed entirely.

How Updates Work (Very Important)

Flink represents updates as two events:

Why?
Because this makes the stream fully consistent for:

stateful operators
joins
sinks that need exact change semantics

Example Timeline

Imagine this SQL:

Incoming orders:

Alice orders 50
Alice orders another 25
Alice cancels first order

Emitted changelog:

No deletes here because the aggregation still exists.

Which Streams Produce Which Events?

Stream Type	Possible Events
Append-only source	`+I`
Aggregation	`+I`, `-U`, `+U`
Join	`+I`, `-U`, `+U`, `-D`
Upsert sink	`+I`, `+U`, `-D`
Retract stream	all four

Mental Model

Think of Flink changelog events like database binlogs:

+I → INSERT
-U → UPDATE (old row)
+U → UPDATE (new row)
-D → DELETE

If you want, I can also explain:

how these map to Kafka upsert topics
which sinks support which event types
or how to convert changelog streams to append-only

Just tell me 👍

The first and the second.

Great—here are (1) how Flink changelog events map to Kafka upsert topics and (2) which sinks support which event types, explained clearly and practically.

1️⃣ Mapping Flink Changelog Events to Kafka Upsert Topics

What is an upsert Kafka topic?

An upsert topic represents the latest state per key.

Keyed by a primary key
A record with the same key overwrites the previous value
A tombstone (null value) means delete

Flink’s upsert-kafka connector is designed exactly for this model.

Event → Kafka Record Mapping

Assume:

Flink Event	Kafka Key	Kafka Value	Meaning
+I	key	value	Insert new row
+U	key	value	Update existing row
-D	key	`null`	Delete row
-U	❌ ignored	❌ ignored	Not written

⚠️ Important:
Kafka upsert topics do NOT store -U events.
They only care about final state, not intermediate transitions.

Example

Flink emits:

Kafka upsert topic receives:

Why `-U` Is Dropped

Kafka has no concept of “update-before”.
Only the latest value per key matters.

When to Use `upsert-kafka`

✔ Aggregations
✔ Materialized views
✔ CDC-style streams
✔ Exactly-once state replication

❌ Append-only event logs (use normal Kafka instead)

2️⃣ Which Sinks Support Which Event Types?

This is critical when designing Flink SQL pipelines.

Sink Capability Matrix

Sink Type	+I	-U	+U	-D	Notes
Print / Debug	✅	✅	✅	✅	For debugging
Upsert Kafka	✅	❌	✅	✅	Requires primary key
Kafka (append)	✅	❌	❌	❌	Insert-only
JDBC (upsert mode)	✅	❌	✅	✅	PK required
JDBC (append)	✅	❌	❌	❌	No updates
Filesystem (CSV/Parquet)	✅	❌	❌	❌	Append-only
Hudi / Iceberg / Paimon	✅	❌	✅	✅	Table formats
Elasticsearch	✅	❌	✅	✅	Document ID = key

Why Many Sinks Reject `-U`

-U is an internal consistency signal, not a final state.

Most sinks only want:

Insert
Update (after)
Delete

So Flink:

Consumes -U internally
Does NOT forward it to sinks like Kafka, JDBC, Iceberg

Common Error You’ll See

Fix options:

Use an upsert sink
Define a PRIMARY KEY
Convert to append-only (if logically valid)

Quick Rule of Thumb

If your query has GROUP BY, JOIN, or DISTINCT → produces updates
If your sink doesn’t support updates → it will fail
Kafka upsert sinks solve 80% of these cases


博客园 © 2004-2025 浙公网安备 33010602011771号浙ICP备2021040463号-3

导航