CUDA out of memory.

when I use 5090*8 and
torchrun --nproc_per_node=8 sample_video.py \
     --video-size 128 128  \
	 --video-length 129     \
	 --infer-steps 10       \
	 --prompt "A cat walks on the grass, realistic style."  \
	 --seed 42      \
	 --embedded-cfg-scale 6.0     \
	 --flow-shift 7.0      \
	 --flow-reverse \
	 --ulysses-degree=4    \
	 --ring-degree=2       \
	 --save-path ./results

why CUDA out of memory?
[rank1]: torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 112.00 MiB. GPU 1 has a total capacity of 31.37 GiB of which 14.38 MiB is free. Including non-PyTorch memory, this process has 31.35 GiB memory in use. Of the allocated memory 30.69 GiB is allocated by PyTorch, and 68.60 MiB is reserved by PyTorch but unallocated. If reserved but unallocated memory is large try setting PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True to avoid fragmentation.  See documentation for Memory Management  (https://pytorch.org/docs/stable/notes/cuda.html#environment-variables)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUDA out of memory. #299

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

CUDA out of memory. #299

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions