JSON-aware compression for NDJSON sinks - 2x smaller than zstd on structured logs #25063
Unanswered
rushikeshmore
asked this question in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hey,
I built a JSON/NDJSON compressor called DataCortex in Rust. Instead of treating JSON as raw bytes like zstd/gzip, it auto-infers the schema, reorgs rows into columns, and applies type-specific encoding before compressing. The result on structured logs:
Lossless, byte-exact roundtrip. 381 tests. MIT licensed, published on crates.io.
I'm wondering if this could be useful as a compression codec option in Vector for JSON/NDJSON sinks. It's pure Rust so no FFI needed. Main limitation is it only helps on JSON/NDJSON data.
Repo: https://github.com/rushikeshmore/DataCortex
Site: https://datacortex-dcx.vercel.app
Beta Was this translation helpful? Give feedback.
All reactions