-
Notifications
You must be signed in to change notification settings - Fork 431
Closed
Description
Hi,
Me again. I have a segmentation fault issue when I try to generate a video.
./sd -M vid_gen --diffusion-model ./models/wan/Wan2.2-TI2V-5B-Q4_0.gguf \
--vae ./models/wan/wan2.2_vae.safetensors \
--t5xxl ./models/wan/umt5-xxl-encoder-Q3_K_S.gguf \
-p "a lovely girl" \
--cfg-scale 6.0 \
--sampling-method euler -v -W 480 -H 832 \
--diffusion-fa \
--offload-to-cpu \
--video-frames 33 \
--flow-shift 3.0
Option:
n_threads: 6
mode: vid_gen
model_path:
wtype: unspecified
clip_l_path:
clip_g_path:
clip_vision_path:
t5xxl_path: ./models/wan/umt5-xxl-encoder-Q3_K_S.gguf
diffusion_model_path: ./models/wan/Wan2.2-TI2V-5B-Q4_0.gguf
high_noise_diffusion_model_path:
vae_path: ./models/wan/wan2.2_vae.safetensors
taesd_path:
esrgan_path:
control_net_path:
embedding_dir:
stacked_id_embed_dir:
input_id_images_path:
style ratio: 20.00
normalize input image: false
output_path: output.png
init_image_path:
end_image_path:
mask_image_path:
control_image_path:
ref_images_paths:
increase_ref_index: false
offload_params_to_cpu: true
clip_on_cpu: false
control_net_cpu: false
vae_on_cpu: false
diffusion flash attention: true
diffusion Conv2d direct: false
vae_conv_direct: false
control_strength: 0.90
prompt: a lovely girl
negative_prompt:
clip_skip: -1
width: 480
height: 832
sample_params: (txt_cfg: 6.00, img_cfg: 6.00, distilled_guidance: 3.50, slg.layer_count: 3, slg.layer_start: 0.01, slg.layer_end: 0.20, slg.scale: 0.00, scheduler: default, sample_method: euler, sample_steps: 20, eta: 0.00)
high_noise_sample_params: (txt_cfg: 7.00, img_cfg: 7.00, distilled_guidance: 3.50, slg.layer_count: 3, slg.layer_start: 0.01, slg.layer_end: 0.20, slg.scale: 0.00, scheduler: default, sample_method: euler_a, sample_steps: -1, eta: 0.00)
moe_boundary: 0.875
flow_shift: 3.00
strength(img2img): 0.75
rng: cuda
seed: 42
batch_count: 1
vae_tiling: false
upscale_repeats: 1
chroma_use_dit_mask: true
chroma_use_t5_mask: false
chroma_t5_mask_pad: 1
video_frames: 33
fps: 16
System Info:
SSE3 = 1
AVX = 1
AVX2 = 1
AVX512 = 0
AVX512_VBMI = 0
AVX512_VNNI = 0
FMA = 1
NEON = 0
ARM_FMA = 0
F16C = 1
FP16_VA = 0
WASM_SIMD = 0
VSX = 0
[DEBUG] stable-diffusion.cpp:151 - Using Vulkan backend
[DEBUG] ggml_extend.hpp:62 - ggml_vulkan: Found 1 Vulkan devices:
[DEBUG] ggml_extend.hpp:62 - ggml_vulkan: 0 = Radeon RX 5500 XT (RADV NAVI14) (radv) | uma: 0 | fp16: 1 | bf16: 0 | warp size: 32 | shared memory: 65536 | int dot: 0 | matrix cores: none
[INFO ] stable-diffusion.cpp:208 - loading diffusion model from './models/wan/Wan2.2-TI2V-5B-Q4_0.gguf'
[INFO ] model.cpp:1035 - load ./models/wan/Wan2.2-TI2V-5B-Q4_0.gguf using gguf format
[DEBUG] model.cpp:1052 - init from './models/wan/Wan2.2-TI2V-5B-Q4_0.gguf'
[ERROR] ggml_extend.hpp:71 - gguf_init_from_file_impl: tensor 'patch_embedding.weight' has invalid number of dimensions: 5 > 4
[ERROR] ggml_extend.hpp:71 - gguf_init_from_file_impl: failed to read tensor info
[ERROR] model.cpp:1061 - failed to open './models/wan/Wan2.2-TI2V-5B-Q4_0.gguf' with gguf_init_from_file. Try to open it with GGUFReader.
[DEBUG] gguf_reader.hpp:197 - GGUF v3, tensor_count=825, metadata_kv_count=3
[INFO ] stable-diffusion.cpp:248 - loading t5xxl from './models/wan/umt5-xxl-encoder-Q3_K_S.gguf'
[INFO ] model.cpp:1035 - load ./models/wan/umt5-xxl-encoder-Q3_K_S.gguf using gguf format
[DEBUG] model.cpp:1052 - init from './models/wan/umt5-xxl-encoder-Q3_K_S.gguf'
[INFO ] stable-diffusion.cpp:255 - loading vae from './models/wan/wan2.2_vae.safetensors'
[INFO ] model.cpp:1038 - load ./models/wan/wan2.2_vae.safetensors using safetensors format
[DEBUG] model.cpp:1145 - init from './models/wan/wan2.2_vae.safetensors', prefix = 'vae.'
[DEBUG] model.cpp:1778 - patch_embedding_channels 147456
[INFO ] stable-diffusion.cpp:267 - Version: Wan 2.2 TI2V
[INFO ] stable-diffusion.cpp:301 - Weight type: q4_0
[INFO ] stable-diffusion.cpp:302 - Conditioner weight type: q3_K
[INFO ] stable-diffusion.cpp:303 - Diffusion model weight type: q4_0
[INFO ] stable-diffusion.cpp:304 - VAE weight type: NONE
[DEBUG] stable-diffusion.cpp:306 - ggml tensor size = 400 bytes
[WARN ] stable-diffusion.cpp:335 - !!!It appears that you are using the T5 model. Some backends may encounter issues with it.If you notice that the generated images are completely black,try running the T5 model on the CPU using the --clip-on-cpu parameter.
[INFO ] stable-diffusion.cpp:345 - Using flash attention in the diffusion model
[INFO ] wan.hpp:1999 - Wan2.2-TI2V-5B
[DEBUG] ggml_extend.hpp:1609 - t5 params backend buffer size = 2720.00 MB(RAM) (242 tensors)
[DEBUG] ggml_extend.hpp:1609 - Wan2.2-TI2V-5B params backend buffer size = 2892.18 MB(RAM) (825 tensors)
[DEBUG] ggml_extend.hpp:1609 - wan_vae params backend buffer size = 1344.24 MB(RAM) (196 tensors)
[DEBUG] stable-diffusion.cpp:566 - loading weights
[DEBUG] model.cpp:1998 - loading tensors from ./models/wan/Wan2.2-TI2V-5B-Q4_0.gguf
|==================================================| 1263/1263 - 58.95it/s
[DEBUG] model.cpp:1998 - loading tensors from ./models/wan/umt5-xxl-encoder-Q3_K_S.gguf
|==================================================| 1263/1263 - 63.36it/s
[DEBUG] model.cpp:1998 - loading tensors from ./models/wan/wan2.2_vae.safetensors
|==================================================| 1263/1263 - 176.05it/s
[INFO ] model.cpp:2216 - loading tensors completed, taking 48.64s (process: 0.10s, read: 48.50s, memcpy: 0.00s, convert: 0.03s, copy_to_backend: 0.00s)
[INFO ] stable-diffusion.cpp:647 - total params memory size = 6956.42MB (VRAM 6956.42MB, RAM 0.00MB): text_encoders 2720.00MB(VRAM), diffusion_model 2892.18MB(VRAM), vae 1344.24MB(VRAM), controlnet 0.00MB(VRAM), pmid 0.00MB(VRAM)
[INFO ] stable-diffusion.cpp:702 - running in FLOW mode
[DEBUG] stable-diffusion.cpp:726 - finished loaded file
[INFO ] stable-diffusion.cpp:2405 - generate_video 480x832x33
[INFO ] stable-diffusion.cpp:860 - attempting to apply 0 LoRAs
[INFO ] stable-diffusion.cpp:880 - apply_loras completed, taking 0.00s
[DEBUG] stable-diffusion.cpp:881 - prompt after extract and remove lora: "a lovely girl"
[DEBUG] conditioner.hpp:1287 - parse 'a lovely girl' to [['a lovely girl', 1], ]
[DEBUG] t5.hpp:402 - token length: 512
[INFO ] ggml_extend.hpp:1534 - t5 offload params (2720.00 MB, 242 tensors) to runtime backend (Vulkan0), taking 0.99s
[DEBUG] ggml_extend.hpp:1437 - t5 compute buffer size: 297.00 MB(VRAM)
[1] 10483 segmentation fault (core dumped) ./sd -M vid_gen --diffusion-model ./models/wan/Wan2.2-TI2V-5B-Q4_0.gguf --vae
Metadata
Metadata
Assignees
Labels
No labels