Skip to content

arm neon ld1{,q_x[234]}: speedups on SSE[32] & WASM#1329

Merged
mr-c merged 1 commit intosimd-everywhere:masterfrom
mr-c:libjpeg-turbo-speedtests
Sep 17, 2025
Merged

arm neon ld1{,q_x[234]}: speedups on SSE[32] & WASM#1329
mr-c merged 1 commit intosimd-everywhere:masterfrom
mr-c:libjpeg-turbo-speedtests

Conversation

@mr-c
Copy link
Collaborator

@mr-c mr-c commented Sep 17, 2025

  • Consolidate and propogate the use of these speedups to all vld1q_*_x[234] functions
  • Speedups for SSE3+ confirmed with the libjpeg-turbo benchmarks
  • Entropy encoding went from 309 to 883 Mcoefficients/sec on GCC 12.2

Consolidate and propogate the use of these speedups on all _xN functions

Speedups for SSE3+ confirmed with the libjpeg-turbo benchmarks

Entropy encoding went from 309 to 883 Mcoefficients/sec on GCC 12.2
@mr-c mr-c changed the title arm neon ld1{,q_x[234]}: slight speadups on SSE[32] & WASM arm neon ld1{,q_x[234]}: speedups on SSE[32] & WASM Sep 17, 2025
@mr-c mr-c enabled auto-merge (rebase) September 17, 2025 14:13
@mr-c mr-c merged commit 47c3c0b into simd-everywhere:master Sep 17, 2025
235 of 236 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant