Fix Intel CPUID leaf 4 cache topology for SMT #1002

glitzflitz · 2026-01-05T03:04:47Z

When SMT is enabled, L1/L2 caches should report being shared by 2 logical processors (the SMT siblings). Previously EAX[25:14] was always being set to 0, indicating no sharing which contradicts the SMT topology reported in leaf 0xB. As per [1] EAX[25:14] indicates maximum number of addressable IDs for logical processors sharing this cache.

This mismatch causes linux guest to print "BUG: arch topology borken / the SMT domain not a subset of the CLS domain" during boot. Linux derives L2 cache sharing groups from leaf 4 and expects SMT siblings to share L2 but it was being informed that each vCPU has private L1/L2.

This brings the SMT handling logic in CPUID inline with whats being done for AMD in fix_amd_cache_topo() which sets the sharing count to 2 when has_smt is true. This fixes #1001.

[1]: Table 1-15. Reference for CPUID Leaf 04H
https://cdrdv2-public.intel.com/775917/intel-64-architecture-processor-topology-enumeration.pdf

iximeow · 2026-01-05T18:31:48Z

oh nice, thanks! totally an oversight when I was in there. how visible is this under /sys/devices/system/cpu/cpu*/cache/index*/shared_cpu_list before the fix? I'm not sure how much Linux fixes this up, or if it faithfully communicates the wrong topology to userland. if this is legible to a booted guest, we probably should have been looking for this in guest_cpu_topo_test.

so, if this is legible in the guest, could you adjust that test to check index*/shared_cpu_list is reasonable too? that is, index0 and index1 are shared across two cores (if SMT is enabled! which, oh dear, the test doesn't assert..), that index2 is shared across all cores, and that there aren't more cache levels?

(.. I also see that in retrospect that test assumes SMT siblings are adjacent in APIC ID, which is definitely wrong in general.)

glitzflitz · 2026-01-05T22:47:06Z

This is what I get before the patch

root@archiso ~ # cat /sys/devices/system/cpu/cpu*/cache/index*/shared_cpu_list
0
0
0
0-3
1
1
1
0-3
2
2
2
0-3
3
3
3
0-3

which indeed shows each vCPU as its own private L1 and L2 cache.

After the patch I get

root@archiso ~ # cat /sys/devices/system/cpu/cpu*/cache/index*/shared_cpu_list
0-1
0-1
0-1
0-3
0-1
0-1
0-1
0-3
2-3
2-3
2-3
0-3
2-3
2-3
2-3
0-3

Let me add a test for this

glitzflitz · 2026-01-05T23:49:25Z

Also I just noticed the sibling_idx here

propolis/phd-tests/tests/src/cpuid.rs

Line 326 in dacb53d

let sibling_idx = idx / 4;

shouldn't sibling_idx be idx/2 instead of idx/4?
The thread_siblings documentation in linux is vague but I think it is a hex bitmask where bit N is set if CPU N is a sibling. With SMT, CPUs 0-1 set bits 0-1 which gives '3', CPUs 2-3 set bits 2-3 which gives 'c' and so on. Since each hex digit covers 4 CPUs and we are iterating over pairs of siblings, we go through 2 pairs before moving to the next hex digit, so idx/2 makes sense instead of idx/4? The current code would only advance sibling_idx every 8 CPUs.
I just ran the test it fails for me locally.

When SMT is enabled, L1/L2 caches should report being shared by 2 logical processors (the SMT siblings). Previously EAX[25:14] was always being set to 0, indicating no sharing which contradicts the SMT topology reported in leaf 0xB. As per [1] EAX[25:14] indicates maximum number of addressable IDs for logical processors sharing this cache. This mismatch causes linux guest to print "BUG: arch topology borken / the SMT domain not a subset of the CLS domain" during boot. Linux derives L2 cache sharing groups from leaf 4 and expects SMT siblings to share L2 but it was being informed that each vCPU has private L1/L2. This brings the SMT handling logic in CPUID inline with what being done for AMD in fix_amd_cache_topo() which sets the sharing count to 2 when has_smt is true. This fixes oxidecomputer#1001. [1]: Table 1-15. Reference for CPUID Leaf 04H https://cdrdv2-public.intel.com/775917/intel-64-architecture-processor-topology-enumeration.pdf Signed-off-by: Amey Narkhede <[email protected]>

The existing test assertion would fail on hosts with SMT enabled due to incorrect index calculations. Also add has_smt() helper to skip thread_siblings checks on non-SMT hosts and remove the unused itertools import. Signed-off-by: Amey Narkhede <[email protected]>

Verify that Linux guest observes correct cache sharing topology from /sys/devices/system/cpu/cpu0/cache/. With SMT enabled, L1 and L2 caches should report sharing by SMT siblings while L3 should be shared across all vCPUs. Signed-off-by: Amey Narkhede <[email protected]>

glitzflitz · 2026-01-06T00:34:08Z

I fixed the sibling_idx calculation in Fix thread_siblings assertion in guest_cpu_topo_test and added the test in Add cache topology verification to guest_cpu_topo_test. It passes for me locally now.

glitzflitz · 2026-01-06T00:43:44Z

Also noticed this late

that index2 is shared across all cores, and that there aren't more cache levels?

Do you mean index3 😅/L3 which should be shared across all cores? Index 0 and 1 is split among L1i and L1d.

root@archiso ~ # cat /sys/devices/system/cpu/cpu0/cache/index0/level
1
root@archiso ~ # cat /sys/devices/system/cpu/cpu0/cache/index1/level
1
root@archiso ~ # cat /sys/devices/system/cpu/cpu0/cache/index2/level
2
root@archiso ~ # cat /sys/devices/system/cpu/cpu0/cache/index3/level
3

I added test for L1 and L2 to be shared among SMT siblings and L3 to be among all CPUs in Add cache topology verification to guest_cpu_topo_test

glitzflitz added 3 commits January 6, 2026 00:01

glitzflitz force-pushed the cpuid branch from d48e531 to 291c8d4 Compare January 6, 2026 00:02

glitzflitz marked this pull request as ready for review January 6, 2026 01:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix Intel CPUID leaf 4 cache topology for SMT #1002

Fix Intel CPUID leaf 4 cache topology for SMT #1002

glitzflitz commented Jan 5, 2026 •

edited

Loading

Uh oh!

iximeow commented Jan 5, 2026

Uh oh!

glitzflitz commented Jan 5, 2026

Uh oh!

glitzflitz commented Jan 5, 2026 •

edited

Loading

Uh oh!

glitzflitz commented Jan 6, 2026 •

edited

Loading

Uh oh!

glitzflitz commented Jan 6, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix Intel CPUID leaf 4 cache topology for SMT #1002

Are you sure you want to change the base?

Fix Intel CPUID leaf 4 cache topology for SMT #1002

Conversation

glitzflitz commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

iximeow commented Jan 5, 2026

Uh oh!

glitzflitz commented Jan 5, 2026

Uh oh!

glitzflitz commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

glitzflitz commented Jan 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

glitzflitz commented Jan 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

glitzflitz commented Jan 5, 2026 •

edited

Loading

glitzflitz commented Jan 5, 2026 •

edited

Loading

glitzflitz commented Jan 6, 2026 •

edited

Loading

glitzflitz commented Jan 6, 2026 •

edited

Loading