Skip to content

Conversation

@lidel
Copy link
Member

@lidel lidel commented Jan 17, 2026

Users can now pin down import settings to get reproducible CIDs across software updates.

Apply a profile: ipfs config profile apply unixfs-v1-2025

Two profiles available:

  • unixfs-v1-2025: modern CIDv1 with improved defaults
  • unixfs-v0-2015 (alias legacy-cid-v0): best-effort legacy CIDv0 behavior

New config options for fine-tuning:

  • Import.UnixFSHAMTDirectorySizeEstimation: controls HAMT threshold estimation (links, block, disabled)
  • Import.UnixFSDAGLayout: choose balanced or trickle layout

New ipfs add flags:

  • --dereference-symlinks: resolve all symlinks to target content (replaces deprecated --dereference-args)
  • --empty-dirs / -E: include empty directories
  • --hidden / -H: include hidden files

Removes test-cid-v1 and test-cid-v1-wide profiles.

Tests cover profile determinism, HAMT threshold boundaries, and symlink handling.

Related: IPIP-499, depends on boxo#1088 and go-ipfs-cmds#315

implements IPIP-499: add config options for controlling UnixFS DAG
determinism and introduces `unixfs-v1-2025` and `unixfs-v0-2015`
profiles for cross-implementation CID reproducibility.

changes:
- add Import.* fields: HAMTDirectorySizeEstimation, SymlinkMode,
  DAGLayout, IncludeEmptyDirectories, IncludeHidden
- add validation for all Import.* config values
- add unixfs-v1-2025 profile (recommended for new data)
- add unixfs-v0-2015 profile (alias: legacy-cid-v0)
- remove deprecated test-cid-v1 and test-cid-v1-wide profiles
- wire Import.HAMTSizeEstimationMode() to boxo globals
- update go.mod to use boxo with SizeEstimationMode support

ref: https://specs.ipfs.tech/ipips/ipip-0499/
@lidel lidel force-pushed the feat/ipip-499-unixfs-2025 branch 2 times, most recently from bf5578b to d79f7de Compare January 17, 2026 04:55
add CLI flags for controlling file collection behavior during ipfs add:

- `--dereference-symlinks`: recursively resolve symlinks to their target
  content (replaces deprecated --dereference-args which only worked on
  CLI arguments). wired through go-ipfs-cmds to boxo's SerialFileOptions.
- `--empty-dirs` / `-E`: include empty directories (default: true)
- `--hidden` / `-H`: include hidden files (default: false)

these flags are CLI-only and not wired to Import.* config options because
go-ipfs-cmds library handles input file filtering before the directory
tree is passed to kubo. removed unused Import.UnixFSSymlinkMode config
option that was defined but never actually read by the CLI.

also:
- wire --trickle to Import.UnixFSDAGLayout config default
- update go-ipfs-cmds to v0.15.1-0.20260117043932-17687e216294
- add SYMLINK HANDLING section to ipfs add help text
- add CLI tests for all three flags

ref: ipfs/specs#499
lidel added 2 commits January 19, 2026 06:13
add comprehensive test suite for UnixFS CID determinism per IPIP-499:
- verify exact HAMT threshold boundary for both estimation modes:
  - v0-2015 (links): sum(name_len + cid_len) == 262144
  - v1-2025 (block): serialized block size == 262144
- verify HAMT triggers at threshold + 1 byte for both profiles
- add all deterministic CIDs for cross-implementation testing

also wires SizeEstimationMode through CLI/API, allowing
Import.UnixFSHAMTSizeEstimation config to take effect.

bumps boxo to ipfs/boxo@6707376 which aligns HAMT threshold with
JS implementation (uses > instead of >=), fixing CID determinism
at the exact 256 KiB boundary.
Previously, resolving symlinks required two flags:
- --dereference-args: resolved symlinks passed as CLI arguments
- --dereference-symlinks: resolved symlinks inside directories

Now --dereference-symlinks handles both cases. Users only need one flag
to fully dereference symlinks when adding files to IPFS.

The deprecated --dereference-args still works for backwards compatibility
but is no longer necessary.
- update boxo to ebdaf07c (nil filter fix, thread-safety docs)
- simplify changelog for IPIP-499 section
- shorten test names, move context to comments
@lidel lidel marked this pull request as ready for review January 20, 2026 02:26
@lidel lidel requested a review from a team as a code owner January 20, 2026 02:26
@lidel
Copy link
Member Author

lidel commented Jan 20, 2026

I may add more tests or improve code, but its ready for initial review, to course correct-early.

Copy link
Contributor

@gammazero gammazero left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

All code looks good, and it looks like all tests cases are covered.

lidel and others added 3 commits January 22, 2026 01:25
Co-authored-by: Andrew Gillis <11790789+gammazero@users.noreply.github.com>
add test that confirms kubo uses balanced layout (all leaves at same
depth) rather than balanced-packed (varying depths). creates 45MiB file
to trigger multi-level DAG and walks it to verify leaf depth uniformity.

includes trickle subtest to validate test logic can detect varying depths.

supports CAR export via DAG_LAYOUT_CAR_OUTPUT env var for test vectors.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants