CloudHv: bump PcdCpuMaxLogicalProcessorNumber to 256 by tpressure · Pull Request #6 · cloud-hypervisor/edk2

tpressure · 2025-06-05T20:53:16Z

Problem

When using CLOUDHV.fd on cloud-hypervisor, only a total of 64 vCPUs can be used. This is because cloud-hypervisor does not (yet) implement a fw_cfg device that tells us the maximum number of logical processors and the default cloud-hypervisor config sets this value to 64. Thus, the firmware is only allocating stacks for 64 cpus. When more than 64 vcpus are in use, random vcpus crash due to stack collisions.

Proposed solution

We should bump the default here to at least what cloud-hypervisor currently supports, that is, 254 vcpus.

I'm also open to bump this to 1024 or 8192 (the maximum number of CPUs Linux supports) because I'm working on >254 vcpu support in CHV anyway.

Signed-off-by: Thomas Prescher <thomas.prescher@cyberus-technology.de> On-behalf-of: SAP thomas.prescher@sap.com

tpressure · 2025-06-06T08:09:49Z

@rbradford is this the right place for this change, or should I rather propose this in upstream EDK2?

rbradford · 2025-06-09T11:09:17Z

@tpressure Yes, you should send it upstream - this repository only exists to produce binary assets for the main CH CI.

…eEntry This patch introduces a synchronization point between the BSP and APs to ensure all APs have entered their SMM wait-loop (while (TRUE) in APHandler ()) before the BSP calls into the SMI handler logic via gSmmCpuPrivate ->SmmCoreEntry(). Previously, the BSP would invoke ReleaseAllAPs() and immediately proceed to SmmCoreEntry() without confirming whether APs had reached the stable waiting state. If SmmStartupThisAp() was called inside the SMI handler shortly after ReleaseAllAPs(), it might lead to a race condition: APs are issued two consecutive wait signals (SmmCpuSyncWaitForBsp()). BSP sends two consecutive releases (ReleaseAllAPs() + SmmStartupThisAp()) If an AP has not yet responded to the first release, the second release may overwrite the semaphore state, and the AP might miss the notification, causing it to hang or behave unpredictably. To address this: A SmmCpuSyncWaitForAPs() is added in BSP after mmCpuPlatformHookBeforeMmiHandler() and before entering SmmCoreEntry(). A matching SmmCpuSyncReleaseBsp() is added in AP immediately after its own SmmCpuPlatformHookBeforeMmiHandler() This ensures that BSP does not enter SMI handler logic or dispatch any AP-related requests before all APs are confirmed to be idle and ready. Debug sync point markers (e.g., /// cloud-hypervisor#6, tianocore#7) are updated accordingly. This change eliminates a subtle but critical race condition in multi-processor/multi-socket systems during SMM entry and improves overall synchronization safety. Signed-off-by: Wei6 Xu <wei6.xu@intel.com>

tpressure force-pushed the chv-256-vcpus branch from 184b2cc to ac93082 Compare June 5, 2025 20:54

CloudHv: bump PcdCpuMaxLogicalProcessorNumber to 512

9c026e0

Signed-off-by: Thomas Prescher <thomas.prescher@cyberus-technology.de> On-behalf-of: SAP thomas.prescher@sap.com

tpressure force-pushed the chv-256-vcpus branch from ac93082 to 9c026e0 Compare June 5, 2025 20:59

rbradford closed this Jun 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

CloudHv: bump PcdCpuMaxLogicalProcessorNumber to 256#6

CloudHv: bump PcdCpuMaxLogicalProcessorNumber to 256#6
tpressure wants to merge 1 commit intocloud-hypervisor:chfrom
tpressure:chv-256-vcpus

tpressure commented Jun 5, 2025

Uh oh!

tpressure commented Jun 6, 2025 •

edited

Loading

Uh oh!

rbradford commented Jun 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

tpressure commented Jun 5, 2025

Problem

Proposed solution

Uh oh!

tpressure commented Jun 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rbradford commented Jun 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tpressure commented Jun 6, 2025 •

edited

Loading