Fix DMA buffer allocation to respect contiguous flag

Yourself · X547 · commit b83fadfea5cb · 2026-01-20T23:18:31.000+09:00
Remove debug code that forced all DMA allocations to be physically
contiguous. This was causing large memory allocations to fail with
NV_ERR_NO_MEMORY (0x51) because finding large contiguous physical
memory blocks is difficult, especially on systems with fragmented
memory.

The fix allows non-contiguous DMA buffers when the caller does not
require contiguous memory, enabling large model loading in llama.cpp
and other applications that need significant GPU memory.

Tested with Llama 3.1 8B (4.5GB) model which previously failed to load.
diff --git a/nvidia_gsp/nvidia/os-haiku.cpp b/nvidia_gsp/nvidia/os-haiku.cpp
@@ -418,7 +418,7 @@ NV_STATUS NV_API_CALL nv_alloc_pages(
 	alloc->area.SetTo(create_area(
 		"DMA Buffer",
 		&address, B_ANY_KERNEL_ADDRESS, (uint64)page_count * B_PAGE_SIZE,
-		(true || contiguous) ? B_CONTIGUOUS : B_FULL_LOCK,
+		contiguous ? B_CONTIGUOUS : B_FULL_LOCK,
 		B_KERNEL_READ_AREA | B_KERNEL_WRITE_AREA | B_CLONEABLE_AREA
 	));