@@ -21,8 +21,10 @@ API and command-line option may change frequently.***
2121 - [ SD3/SD3.5] ( ./docs/sd3.md )
2222 - [ Flux-dev/Flux-schnell] ( ./docs/flux.md )
2323 - [ Chroma] ( ./docs/chroma.md )
24+ - [ Qwen Image] ( ./docs/qwen_image.md )
2425 - Image Edit Models
2526 - [ FLUX.1-Kontext-dev] ( ./docs/kontext.md )
27+ - [ Qwen Image Edit/Qwen Image Edit 2509] ( ./docs/qwen_image_edit.md )
2628 - Video Models
2729 - [ Wan2.1/Wan2.2] ( ./docs/wan.md )
2830 - [ PhotoMaker] ( https://github.com/TencentARC/PhotoMaker ) support.
@@ -125,13 +127,14 @@ cmake --build . --config Release
125127
126128##### Using HipBLAS
127129This provides BLAS acceleration using the ROCm cores of your AMD GPU. Make sure to have the ROCm toolkit installed.
130+ To build for another GPU architecture than installed in your system, set `$GFX_NAME` manually to the desired architecture (replace first command). This is also necessary if your GPU is not officially supported by ROCm, for example you have to set `$GFX_NAME` manually to `gfx1030` for consumer RDNA2 cards.
128131
129132Windows User Refer to [docs/hipBLAS_on_Windows.md](docs%2FhipBLAS_on_Windows.md) for a comprehensive guide.
130133
131134```
132- export GFX_NAME=$(rocminfo | grep -m 1 -E "gfx [ ^ 0 ] {1}" | sed -e 's / * Name: * //' | awk '{$1=$1; print}' || echo "rocminfo missing")
133- echo $ GFX_NAME
134- cmake .. -G "Ninja" -DCMAKE_C_COMPILER=clang -DCMAKE_CXX_COMPILER=clang++ -DSD_HIPBLAS=ON -DCMAKE_BUILD_TYPE=Release -DGPU_TARGETS=$GFX_NAME -DCMAKE_BUILD_WITH_INSTALL_RPATH=ON
135+ if command -v rocminfo; then export GFX_NAME=$(rocminfo | awk ' / * Name: +gfx [ 1-9 ] / {print $2; exit}'); else echo "rocminfo missing!"; fi
136+ if [ -z "${GFX_NAME}" ] ; then echo "Error: Couldn't detect GPU!"; else echo "Building for GPU: ${ GFX_NAME}"; fi
137+ cmake .. -G "Ninja" -DCMAKE_C_COMPILER=clang -DCMAKE_CXX_COMPILER=clang++ -DSD_HIPBLAS=ON -DCMAKE_BUILD_TYPE=Release -DGPU_TARGETS=$GFX_NAME -DAMDGPU_TARGETS=$GFX_NAME - DCMAKE_BUILD_WITH_INSTALL_RPATH=ON -DCMAKE_POSITION_INDEPENDENT_CODE =ON
135138cmake --build . --config Release
136139```
137140
@@ -284,7 +287,7 @@ usage: ./bin/sd [arguments]
284287
285288arguments:
286289 -h, --help show this help message and exit
287- -M, --mode [MODE] run mode, one of: [img_gen, vid_gen, convert], default: img_gen
290+ -M, --mode [MODE] run mode, one of: [img_gen, vid_gen, upscale, convert], default: img_gen
288291 -t, --threads N number of threads to use during computation (default: -1)
289292 If threads <= 0, then threads will be set to the number of CPU physical cores
290293 --offload-to-cpu place the weights in RAM to save VRAM, and automatically load them into VRAM when needed
@@ -295,11 +298,13 @@ arguments:
295298 --clip_g path to the clip-g text encoder
296299 --clip_vision path to the clip-vision encoder
297300 --t5xxl path to the t5xxl text encoder
301+ --qwen2vl path to the qwen2vl text encoder
302+ --qwen2vl_vision path to the qwen2vl vit
298303 --vae [VAE] path to vae
299304 --taesd [TAESD_PATH] path to taesd. Using Tiny AutoEncoder for fast decoding (low quality)
300305 --control-net [CONTROL_PATH] path to control net model
301306 --embd-dir [EMBEDDING_PATH] path to embeddings
302- --upscale-model [ESRGAN_PATH] path to esrgan model. Upscale images after generate, just RealESRGAN_x4plus_anime_6B supported by now
307+ --upscale-model [ESRGAN_PATH] path to esrgan model. For img_gen mode, upscale images after generate, just RealESRGAN_x4plus_anime_6B supported by now
303308 --upscale-repeats Run the ESRGAN upscaler this many times (default 1)
304309 --type [TYPE] weight type (examples: f32, f16, q4_0, q4_1, q5_0, q5_1, q8_0, q2_K, q3_K, q4_K)
305310 If not specified, the default is the type of the weight file
@@ -448,6 +453,7 @@ These projects use `stable-diffusion.cpp` as a backend for their image generatio
448453- [Local Diffusion](https://github.com/rmatif/Local-Diffusion)
449454- [sd.cpp-webui](https://github.com/daniandtheweb/sd.cpp-webui)
450455- [LocalAI](https://github.com/mudler/LocalAI)
456+ - [Neural-Pixel](https://github.com/Luiz-Alcantara/Neural-Pixel)
451457
452458## Contributors
453459
@@ -462,6 +468,7 @@ Thank you to all the people who have already contributed to stable-diffusion.cpp
462468## References
463469
464470- [ggml](https://github.com/ggerganov/ggml)
471+ - [diffusers](https://github.com/huggingface/diffusers)
465472- [stable-diffusion](https://github.com/CompVis/stable-diffusion)
466473- [sd3-ref](https://github.com/Stability-AI/sd3-ref)
467474- [stable-diffusion-stability-ai](https://github.com/Stability-AI/stablediffusion)
@@ -472,4 +479,4 @@ Thank you to all the people who have already contributed to stable-diffusion.cpp
472479- [generative-models](https://github.com/Stability-AI/generative-models/)
473480- [PhotoMaker](https://github.com/TencentARC/PhotoMaker)
474481- [Wan2.1](https://github.com/Wan-Video/Wan2.1)
475- - [Wan2.2](https://github.com/Wan-Video/Wan2.2)
482+ - [Wan2.2](https://github.com/Wan-Video/Wan2.2)
0 commit comments