Chore/manoeuvre #2

GaussianWonder · 2026-01-09T14:48:32Z

Notes

After this whole sharade, this is the anticlimactic conclusion:

Name	Time
Original impl +clones	~900 µs
Original impl -clones	~400 µs
Scalar	~92 µs
SimdFixed	~122 µs
SimdFit	~166 µs

If scalar implementation is properly inlined, given appropiate compiler options the result seems to be already vetorized,
so the benchmark results are:

I barely did as good as the compiler.

SIMD ops are significantly better, faster, and more efficient when data is aligned to cache line or vector register boundaries,
which is probably the time difference observed in sync impls.

The parallel versions are even closer:

Name	Time
ScalarPar	~51.5 µs
SimdFixedPar	~52.5 µs
SimdFitPar	~53.25 µs

Changelog

rust nightly

test feature

Used for internal benchmarks, not tracked by criterion.
portable_simd feature

Used for simd implementations.

xtask pattern

Will be used to generate test/dev assets (and optionally ci/cd).

fix bayer matrices

BAYER2 and BAYER3 pattern were wrong.

vscode settings

format on save
default formatter rust-lang.rust-analyzer

module structure refator

x_utils.rs -> utils/x.rs

traits

/// Core trait for applying a transform to data
///
/// Generic over the right-hand side type `Rhs`, similar to `Add`, `Sub`, etc.
pub trait Transform<Rhs = ()> {
    /// Apply the transform to the given data
    fn apply(&mut self, rhs: &mut Rhs);
}

see transform.rs and impls in bayer_transform.rs, example of usage can be observed here:

src\tests\bayer_strategy.rs:60@apply_strategy

benches\bayer_transform_utils.rs:46@benchmark_strategy

The property of reusing the same transform can only be seen in benchmarks.

The property to swap input arguments between transform calls is available
and checked against the borrow checker, but no code-example is documented
(see bayer_transform.rs:30@BayerArgs::replace_input)

structs

pub struct Texture<T> {
    width: u32,
    height: u32,
    buffer: Vec<T>,
}

pub struct TextureRef<'a, T> {
    width: u32,
    height: u32,
    buffer: &'a [T],
}

Textures can be owned structures or borrowed from other containers.
All Texture types implement AsRef<[T]> which is a trait agnostic of owned container type.

This allows for fewer allocations / conversions in some scenarios.
(i.e. video processing, where references to bytes can be tossed around).

and +1 variant: see texture.rs.

This pairs well with Transforms.

iterator things

see utils/iterator.rs

most likely this will be deleted later, it does not really help and it is not used.

experimental test to check against performance loss when using custom iterators for
index par iter vs parallel processing by par chunking.

error

src\error.rs contains a flexible DitherpunkerError and an associated Result type.

Implementing From<possible_error_type> for DitherpunkerError allows seamless ? usage with
mixed error types in the same function body.

crates

itertools

Not specifically required, but handy
multiversion

Suggest simd width for a given data type to autodetect strategy
num-traits

Used to describe num utils constrained on num ops
image@GaussianWonder/image
```
[patch.crates-io]
image = { git = "https://github.com/GaussianWonder/image", branch = "pub-enlargable-v0.25.9" }
```
patch issue: make Enlargable trait pub.

Can be used to generically describe ops available on image::ImageBuffer<..., T> for any Texture<T>

Benches Plots

Sync

Par

TODO WIP

try as_simd()

safe wrapper around slice::align_to. this is to check if performance is lost because the simd ops are not performed on aligned items within the buffers.
remove json, use serde
remove unnecessary bayer strategies
replace all Result types with error::Result
generate blue noise
save all assets as textures instead of static arrays.

GaussianWonder added 30 commits December 1, 2025 01:52

chore: format

20197d0

chore: bump edition

9d85558

fix: bayer matrices

bbc7289

chore: enable nightly

c1a506d

chore: unignored vscode settings

ab9e248

feat: add xtask pattern for dev tasks and ci/cd

111735c

chore: format, lint and refactor

42394c2

chore: add settings.json

cdaad82

chore: change f64 to f32

4a39914

feat: utils extensions

ddb635c

feat: add texture, texture ref and texture ref mut

e6ab4f6

feat: add bayer transforms and bayer strategies

53a3765

feat: add bayer strategy tests

2586063

chore: add missing iterator tests

2f69cf0

feat: add benches

21e2155

chore: remove unused deps

4f33058

chore: remove all mod.rs files

8fa8c54

feat: consolidate TextureTransform trait

66b1462

chore: refactor-rename

318c674

feat: add transform to main

2dd054f

feat: add pipes, add grayscale, add benches

97a4852

chore: add bench entries

1cf8fad

chore: add num tests

79c4150

chore: move buffer tests

1f98a66

feat: add writable texture from borrwed refs

d35e908

chore: add anonymous error (avoid if possible)

1696429

chore: add transform utils tests

0e4c53a

chore: test pipeline

9b60a83

chore: test grayscale impl against supported multi channels

e03aea1

chore: swap to chunks_exact

e2e181f

GaussianWonder added 3 commits January 15, 2026 01:27

feat: multiversion preset at crate-level

dde282d

feat: add simd_target multiversion utils

7996342

chore: tests

e45d71a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chore/manoeuvre #2

Chore/manoeuvre #2

Uh oh!

GaussianWonder commented Jan 9, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Chore/manoeuvre #2

Are you sure you want to change the base?

Chore/manoeuvre #2

Uh oh!

Conversation

GaussianWonder commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Notes

Changelog

rust nightly

xtask pattern

fix bayer matrices

vscode settings

module structure refator

traits

structs

iterator things

error

crates

Benches Plots

Sync

Par

TODO WIP

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

GaussianWonder commented Jan 9, 2026 •

edited

Loading