Skip to content

Conversation

roypat
Copy link
Contributor

@roypat roypat commented Aug 19, 2025

In commit e7504ae ("refactor: cleanup vmm::snapshot module"),
firecracker started reading the snapshot vmstate file in a single pass
instead of first loading it into a Vec and then deserializing. This
seems to have caused some performance regression due to the deserializer
doing many successive reads, resulting in many read(2) syscalls.
Fix this by going back to first reading the snapshot file into a buffer,
and then deserializing from slice instead.

Signed-off-by: Patrick Roy [email protected]## Changes

...

Reason

...

License Acceptance

By submitting this pull request, I confirm that my contribution is made under
the terms of the Apache 2.0 license. For more information on following Developer
Certificate of Origin and signing off your commits, please check
CONTRIBUTING.md.

PR Checklist

  • I have read and understand CONTRIBUTING.md.
  • I have run tools/devtool checkbuild --all to verify that the PR passes
    build checks on all supported architectures.
  • I have run tools/devtool checkstyle to verify that the PR passes the
    automated style checks.
  • I have described what is done in these changes, why they are needed, and
    how they are solving the problem in a clear and encompassing way.
  • I have updated any relevant documentation (both in code and in the docs)
    in the PR.
  • I have mentioned all user-facing changes in CHANGELOG.md.
  • If a specific issue led to this PR, this PR closes the issue.
  • When making API changes, I have followed the
    Runbook for Firecracker API changes.
  • I have tested all new and changed functionalities in unit tests and/or
    integration tests.
  • I have linked an issue to every new TODO.

  • This functionality cannot be added in rust-vmm.

@roypat roypat requested review from Manciukic and bchalios August 19, 2025 12:02
@roypat roypat force-pushed the snapshot-buf-read branch 2 times, most recently from d6ff290 to c9cda3c Compare August 19, 2025 12:24
bchalios
bchalios previously approved these changes Aug 19, 2025
Copy link

codecov bot commented Aug 19, 2025

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 82.41%. Comparing base (57ea0a3) to head (5d9c4ba).
⚠️ Report is 2 commits behind head on main.

Additional details and impacted files
@@            Coverage Diff             @@
##             main    #5394      +/-   ##
==========================================
+ Coverage   82.36%   82.41%   +0.05%     
==========================================
  Files         266      266              
  Lines       30571    30580       +9     
==========================================
+ Hits        25180    25203      +23     
+ Misses       5391     5377      -14     
Flag Coverage Δ
5.10-c5n.metal 82.39% <100.00%> (+0.01%) ⬆️
5.10-m5n.metal 82.39% <100.00%> (+0.01%) ⬆️
5.10-m6a.metal 81.67% <100.00%> (+0.01%) ⬆️
5.10-m6g.metal 78.99% <100.00%> (+<0.01%) ⬆️
5.10-m6i.metal 82.38% <100.00%> (+0.02%) ⬆️
5.10-m7a.metal-48xl 81.65% <100.00%> (?)
5.10-m7g.metal 78.99% <100.00%> (+<0.01%) ⬆️
5.10-m7i.metal-24xl 82.36% <100.00%> (?)
5.10-m7i.metal-48xl 82.35% <100.00%> (?)
5.10-m8g.metal-24xl 78.99% <100.00%> (?)
5.10-m8g.metal-48xl 78.99% <100.00%> (?)
6.1-c5n.metal 82.43% <100.00%> (+0.01%) ⬆️
6.1-m5n.metal 82.43% <100.00%> (+0.02%) ⬆️
6.1-m6a.metal 81.71% <100.00%> (+0.02%) ⬆️
6.1-m6g.metal 78.99% <100.00%> (+<0.01%) ⬆️
6.1-m6i.metal 82.42% <100.00%> (+0.02%) ⬆️
6.1-m7a.metal-48xl 81.70% <100.00%> (?)
6.1-m7g.metal 78.99% <100.00%> (+<0.01%) ⬆️
6.1-m7i.metal-24xl 82.44% <100.00%> (?)
6.1-m7i.metal-48xl 82.44% <100.00%> (?)
6.1-m8g.metal-24xl 78.99% <100.00%> (?)
6.1-m8g.metal-48xl 78.99% <100.00%> (?)

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

roypat added 2 commits August 19, 2025 13:44
In commit e7504ae ("refactor: cleanup vmm::snapshot module"),
firecracker started reading the snapshot vmstate file in a single pass
instead of first loading it into a Vec and then deserializing. This
seems to have caused some performance regression due to the deserializer
doing many successive reads, resulting in many read(2) syscalls.
Fix this by going back to first reading the snapshot file into a buffer,
and then deserializing from slice instead.

Signed-off-by: Patrick Roy <[email protected]>
This is covered by rust intergration tests, but having a unit test that
does the snapshot/restore roundtrip is nice, because I can run it on my
laptop without needing sudo.

Signed-off-by: Patrick Roy <[email protected]>
@roypat roypat added the Status: Awaiting review Indicates that a pull request is ready to be reviewed label Aug 19, 2025
@roypat roypat requested a review from Manciukic August 20, 2025 10:18
@roypat roypat enabled auto-merge (rebase) August 20, 2025 12:42
@roypat roypat merged commit 32a77ca into firecracker-microvm:main Aug 20, 2025
7 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Status: Awaiting review Indicates that a pull request is ready to be reviewed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants