Skip to content

Commit 30f13ed

Browse files
committed
update
1 parent d4b88d7 commit 30f13ed

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

tests/quantization/gguf/test_gguf.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -82,6 +82,7 @@ def test_gguf_memory(self):
8282
self.ckpt_path, quantization_config=quantization_config, torch_dtype=self.torch_dtype
8383
)
8484
model.to("cuda")
85+
assert (model.get_memory_footprint() / 1024**3) < 5
8586
inputs = self.get_dummy_inputs()
8687

8788
torch.cuda.reset_peak_memory_stats()

0 commit comments

Comments
 (0)