Enable loading in half precision #73

hum-ma · 2026-01-12T13:54:30Z

I noticed there already were functions for setting autocast dtype depending on GPU capability. However, they were restricted to compute major version >= 7 whereas my GPU with compute 5.2 can also run models in float16, and in fact it is the only way to run SAM3 because it crashes with OOM if loading in float32 with 4 GB VRAM. I don't know if compute 6.x devices are good with float16 so the detection rule could be tuned further if they have problems.

I moved dtype detection into load_model, save it with the sam3_model and subsequently use it to autocast accordingly. Loading the video model with .half() is the key change to reduce memory use, but the autocasts are necessary to prevent tensor type mismatch errors during processing. I hope I caught them for all use cases...

The interactive detection dialog doesn't use the same loaded model so I added separate detection for it.

hum-ma added 11 commits January 12, 2026 15:28

Enable half precision

70ec248

allow autocast

4f7cab4

enable loading in half precision

2c5740d

dtype detection and autocast

351ddcf

.half() modifies model in-place

49d1e9a

.half() in place; add bfloat16()

f1000c1

add cache clearing function

db748ff

attention tensor offloading and restoration

29443ce

recursive function for offloading/cache clearing

549fca2

Add files via upload

724ad50

clear components from vram when offloading

ba84038

hum-ma mentioned this pull request Jan 17, 2026

Vram residual issue. #51

Open

This was referenced Jan 26, 2026

part of VRAM memory freazing after finishing SAM3 model. yolain/ComfyUI-Easy-Sam3#24

Open

part of VRAM memory freazing after finishing SAM3 model. Ltamann/ComfyUI-TBG-SAM3#18

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable loading in half precision #73

Enable loading in half precision #73

Uh oh!

hum-ma commented Jan 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Enable loading in half precision #73

Are you sure you want to change the base?

Enable loading in half precision #73

Uh oh!

Conversation

hum-ma commented Jan 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant