Skip to content

Conversation

@lshaw8317
Copy link
Collaborator

SHUFFLE now makes use of provided filters_meta, interpreting it as number of bytestreams into which to divde input. This is useful for strings of unicode characters, since one may specify the character size (4 bytes) which will be used rather than the typesize of each array entry (4 * length of string). This typically leads to better compression (see results from tests/b2nd/b2nd_stringshuffle.c)

BLOSC_NOFILTER = 0,
//!< Byte-wise shuffle. `filters_meta` does not have any effect here.
BLOSC_BITSHUFFLE = 2,
BLOSC_SHUFFLE = 1,
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think comments should be below for a correct rendering: https://blosc.org/c-blosc2/reference/utility_variables.html#c.BLOSC_SHUFFLE
Also, for byte-wise, filters_meta does have any effect here.

Copy link
Member

@FrancescAlted FrancescAlted left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It would be nice if you paste actual compression ratio figures here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants