refactor(sdk): Allow to send waveform for any audio message #5732

zecakeh · 2025-10-01T09:26:07Z

We plan to use this in Fractal to always be able to present a waveform for an audio message.

The second commit might be more controversial: it changes the format of the waveform from a list of u16 between 0 and 1024 to a list of f32 between 0 and 1. This is done because the value between 0 and 1024 used in the event is quite arbitrary (and they have changed in MSC3246 since then), most clients should end up with values between 0 and 1, and need to convert it for sending it. So this centralizes this conversion in the SDK.

By moving the waveform declaration into `BaseAudioInfo`. Signed-off-by: Kévin Commaille <[email protected]>

Most clients will probably work with values between 0 and 1 and need to convert it just to send it, so we can move that conversion into the SDK. This is also more forwards-compatible, because MSC3246 now has a different max value for the amplitude, so when this becomes stable, the only change needed will be in the SDK. Signed-off-by: Kévin Commaille <[email protected]>

Signed-off-by: Kévin Commaille <[email protected]>

codspeed-hq · 2025-10-01T10:19:24Z

CodSpeed Performance Report

Merging #5732 will not alter performance

_{Comparing zecakeh:audio-waveform (ab7b887) with main (681b221)}

Summary

✅ 50 untouched

codecov · 2025-10-01T20:57:23Z

Codecov Report

❌ Patch coverage is 0% with 5 lines in your changes missing coverage. Please review.
✅ Project coverage is 88.40%. Comparing base (59b7da2) to head (ab7b887).
⚠️ Report is 4 commits behind head on main.
✅ All tests successful. No failed tests found.

Files with missing lines	Patch %	Lines
crates/matrix-sdk/src/attachment.rs	0.00%	4 Missing ⚠️
crates/matrix-sdk/src/room/mod.rs	0.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #5732      +/-   ##
==========================================
- Coverage   88.40%   88.40%   -0.01%     
==========================================
  Files         359      359              
  Lines       99048    99046       -2     
  Branches    99048    99046       -2     
==========================================
- Hits        87566    87562       -4     
- Misses       7344     7346       +2     
  Partials     4138     4138

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Hywan

The changes look good to me. I'm a bit annoyed by the regression with the waveform format, but since it's a different type and it's documented in the CHANGELOG, I think it's fine.

Hywan · 2025-10-02T06:22:22Z

Re-running the benchmarks. I suspect the result is misleading and could be due to noise.

@zecakeh can you fix the conflicts please?

bnjbvr

Thanks, we're on board with this change, and as you said it'll make the switch to the new MSC simpler. We might delay the landing of this PR, since we want to limit landing breaking changes at the FFI layer until the Matrix conference.

One comment I would have is: why to use f32 and not f64? We'd have a much better precision when it comes to the [0; 1] range, leading to a better precision in the normalized range; and I don't think we're getting any notable speedup for using 32 bits instead of 64 bits here.

Would it make sense to change to a f64 in this patch?

bnjbvr · 2025-10-01T16:23:25Z

crates/matrix-sdk/src/attachment.rs

    /// The waveform of the audio clip.
-    pub waveform: Option<Vec<u16>>,
+    ///
+    /// Must only includes values between 0 and 1.


nit: include

zecakeh · 2025-10-02T08:38:47Z

I don't mind changing to f64 but I chose f32 because it is precise enough to be converted to an integer value between 0 and 1024 and it should be enough to draw it too so I don't see what we gain by increasing the precision?

bnjbvr · 2025-10-02T08:50:45Z

I wary of using float32s, because they can't represent finely small values, viz. values in the [0; 1] range. That is, many small float32 values would be normalized to the same int in the [0; 1024] range, so if you looked at a flat projection of the f32 range to the int10 range, you might see holes in the int range. Since float64 have so much more precision, it's unlikely that this problem would happen with those.

zecakeh · 2025-10-02T09:12:12Z

I am not convinced that this is true, because f32 has a precision of 6 significant digits, and for integer values between 0 and 1024 we only need 4.

bnjbvr

I've double-checked that with a program, and agree with your conclusion; good point.

Approving, but we've been requested to not merge this immediately.

zecakeh · 2025-10-02T15:03:28Z

No problem, I'll rebase this after the Matrix conference then to solve the conflicts.

zecakeh added 2 commits October 1, 2025 10:56

refactor(sdk): Allow to send waveform for any audio message

99d1092

By moving the waveform declaration into `BaseAudioInfo`. Signed-off-by: Kévin Commaille <[email protected]>

zecakeh requested a review from a team as a code owner October 1, 2025 09:26

zecakeh requested review from bnjbvr and removed request for a team October 1, 2025 09:26

Add changelog for waveform changes

ab7b887

Signed-off-by: Kévin Commaille <[email protected]>

Hywan approved these changes Oct 2, 2025

View reviewed changes

bnjbvr reviewed Oct 2, 2025

View reviewed changes

bnjbvr approved these changes Oct 2, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor(sdk): Allow to send waveform for any audio message #5732

refactor(sdk): Allow to send waveform for any audio message #5732

zecakeh commented Oct 1, 2025

Uh oh!

codspeed-hq bot commented Oct 1, 2025 •

edited

Loading

Uh oh!

codecov bot commented Oct 1, 2025

Uh oh!

Hywan left a comment

Uh oh!

Hywan commented Oct 2, 2025

Uh oh!

bnjbvr left a comment

Uh oh!

bnjbvr Oct 1, 2025

Uh oh!

zecakeh commented Oct 2, 2025

Uh oh!

bnjbvr commented Oct 2, 2025

Uh oh!

zecakeh commented Oct 2, 2025

Uh oh!

bnjbvr left a comment

Uh oh!

zecakeh commented Oct 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

refactor(sdk): Allow to send waveform for any audio message #5732

Are you sure you want to change the base?

refactor(sdk): Allow to send waveform for any audio message #5732

Conversation

zecakeh commented Oct 1, 2025

Uh oh!

codspeed-hq bot commented Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CodSpeed Performance Report

Merging #5732 will not alter performance

Summary

Uh oh!

codecov bot commented Oct 1, 2025

Codecov Report

Uh oh!

Hywan left a comment

Choose a reason for hiding this comment

Uh oh!

Hywan commented Oct 2, 2025

Uh oh!

bnjbvr left a comment

Choose a reason for hiding this comment

Uh oh!

bnjbvr Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

zecakeh commented Oct 2, 2025

Uh oh!

bnjbvr commented Oct 2, 2025

Uh oh!

zecakeh commented Oct 2, 2025

Uh oh!

bnjbvr left a comment

Choose a reason for hiding this comment

Uh oh!

zecakeh commented Oct 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codspeed-hq bot commented Oct 1, 2025 •

edited

Loading