Skip to content

Conversation

bchalios
Copy link
Contributor

Changes

Teach MicroVM.kill() that there might be cases where the microVM process has already died and let it finish its cleanup gracefully.

Reason

We have some tests that stress the microVM in a way that might cause it to die. We do check whether that's the case, however this is inherently racy, i.e. the microVM might die on us after the check but before the fixture logic tries to recycle it. Such a test is test_balloon.py::test_deflate_oom.

License Acceptance

By submitting this pull request, I confirm that my contribution is made under
the terms of the Apache 2.0 license. For more information on following Developer
Certificate of Origin and signing off your commits, please check
CONTRIBUTING.md.

PR Checklist

  • I have read and understand CONTRIBUTING.md.
  • I have run tools/devtool checkbuild --all to verify that the PR passes
    build checks on all supported architectures.
  • I have run tools/devtool checkstyle to verify that the PR passes the
    automated style checks.
  • I have described what is done in these changes, why they are needed, and
    how they are solving the problem in a clear and encompassing way.
  • I have updated any relevant documentation (both in code and in the docs)
    in the PR.
  • I have mentioned all user-facing changes in CHANGELOG.md.
  • If a specific issue led to this PR, this PR closes the issue.
  • When making API changes, I have followed the
    Runbook for Firecracker API changes.
  • I have tested all new and changed functionalities in unit tests and/or
    integration tests.
  • I have linked an issue to every new TODO.

  • This functionality cannot be added in rust-vmm.

@bchalios bchalios force-pushed the graceful_killing_microvms branch from 9302beb to 1521d35 Compare September 17, 2025 08:28
Copy link

codecov bot commented Sep 17, 2025

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 82.79%. Comparing base (c28dbdb) to head (c7b0bec).
⚠️ Report is 1 commits behind head on main.

Additional details and impacted files
@@           Coverage Diff           @@
##             main    #5443   +/-   ##
=======================================
  Coverage   82.79%   82.79%           
=======================================
  Files         263      263           
  Lines       27301    27301           
=======================================
  Hits        22603    22603           
  Misses       4698     4698           
Flag Coverage Δ
5.10-m5n.metal 82.98% <ø> (+<0.01%) ⬆️
5.10-m6a.metal 82.23% <ø> (+<0.01%) ⬆️
5.10-m6g.metal 79.57% <ø> (+<0.01%) ⬆️
5.10-m6i.metal 82.97% <ø> (-0.01%) ⬇️
5.10-m7a.metal-48xl 82.22% <ø> (ø)
5.10-m7g.metal 79.57% <ø> (ø)
5.10-m7i.metal-24xl 82.94% <ø> (+<0.01%) ⬆️
5.10-m7i.metal-48xl 82.94% <ø> (-0.01%) ⬇️
5.10-m8g.metal-24xl 79.57% <ø> (ø)
5.10-m8g.metal-48xl 79.57% <ø> (ø)
6.1-m5n.metal 83.01% <ø> (ø)
6.1-m6a.metal 82.27% <ø> (-0.01%) ⬇️
6.1-m6g.metal 79.56% <ø> (-0.01%) ⬇️
6.1-m6i.metal 83.00% <ø> (-0.01%) ⬇️
6.1-m7a.metal-48xl 82.25% <ø> (ø)
6.1-m7g.metal 79.56% <ø> (-0.01%) ⬇️
6.1-m7i.metal-24xl 83.01% <ø> (ø)
6.1-m7i.metal-48xl 83.01% <ø> (-0.01%) ⬇️
6.1-m8g.metal-24xl 79.56% <ø> (-0.01%) ⬇️
6.1-m8g.metal-48xl 79.57% <ø> (ø)

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

We have some tests that stress the microVM in a way that _might_ cause
it to die. We do check whether that's the case, however this is
inherently racy, i.e. the microVM might die on us after the check but
before the fixture logic tries to recycle it. Such a test is
test_balloon.py::test_deflate_oom.

In order to avoid such situations, teach MicroVM.kill() that there might
be cases where the microVM process has already died and let it finish
its cleanup gracefully.

Signed-off-by: Babis Chalios <[email protected]>
@bchalios bchalios force-pushed the graceful_killing_microvms branch from 1521d35 to c7b0bec Compare September 17, 2025 08:59
@bchalios bchalios marked this pull request as ready for review September 17, 2025 08:59
@bchalios bchalios added the Status: Awaiting review Indicates that a pull request is ready to be reviewed label Sep 17, 2025
@bchalios bchalios enabled auto-merge (rebase) September 17, 2025 09:35
@bchalios bchalios merged commit dcfc4c1 into firecracker-microvm:main Sep 17, 2025
8 checks passed
@bchalios bchalios deleted the graceful_killing_microvms branch September 17, 2025 10:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Status: Awaiting review Indicates that a pull request is ready to be reviewed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants