Skip to content

feat: support Iceberg v3 unknown type#662

Open
manuzhang wants to merge 3 commits into
apache:mainfrom
manuzhang:codex/support-unknown-v3-type
Open

feat: support Iceberg v3 unknown type#662
manuzhang wants to merge 3 commits into
apache:mainfrom
manuzhang:codex/support-unknown-v3-type

Conversation

@manuzhang
Copy link
Copy Markdown
Member

@manuzhang manuzhang commented May 20, 2026

Closes #665


Summary

  • Add Iceberg v3 unknown primitive type and JSON serialization/deserialization support.
  • Support unknown as null-only data across Arrow, Avro, Parquet, schema projection, and nested fields.
  • Enforce required-field invariants for unknown/null-only projections and Arrow null imports.

Validation

  • ctest --test-dir build --output-on-failure

Co-authored-by: @codex

@manuzhang manuzhang marked this pull request as ready for review May 20, 2026 11:10
Comment thread src/iceberg/schema_util.cc
Comment thread src/iceberg/test/update_schema_test.cc Outdated
Comment thread src/iceberg/parquet/parquet_writer.cc Outdated
@wgtmac
Copy link
Copy Markdown
Member

wgtmac commented May 22, 2026

Please rebase on the latest main as well.

manuzhang and others added 2 commits May 22, 2026 17:06
Add an Iceberg unknown primitive type and JSON, Arrow, Avro, Parquet, projection, and data path support for null-only unknown fields. Enforce optionality invariants so required projections cannot be materialized from unknown/null-only fields.

Co-authored-by: Codex <codex@openai.com>
Enforce unknown as a v3-only optional type in schema validation and update paths, allow schema-update promotion from unknown to primitive types, reject unknown-to-nested projection, and reject unsupported Parquet writes for unknown list or map leaves.

Co-authored-by: Codex <codex@openai.com>
@manuzhang manuzhang force-pushed the codex/support-unknown-v3-type branch from 26ccf9b to 8087ed1 Compare May 22, 2026 09:19
Assert that promotion helpers reject nested type targets for unknown and regular primitive source types.

Co-authored-by: Codex <codex@openai.com>
@manuzhang manuzhang force-pushed the codex/support-unknown-v3-type branch from eb195ac to d2a34b5 Compare May 22, 2026 09:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Support v3 unknown data type

2 participants