Skip to content

Commit bda0afd

Browse files
authored
Merge branch 'main' into xpu
2 parents b7148d6 + b9e9965 commit bda0afd

File tree

235 files changed

+30515
-12906
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

235 files changed

+30515
-12906
lines changed

.github/workflows/nightly_tests.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -248,7 +248,7 @@ jobs:
248248
BIG_GPU_MEMORY: 40
249249
run: |
250250
python -m pytest -n 1 --max-worker-restart=0 --dist=loadfile \
251-
-m "big_gpu_with_torch_cuda" \
251+
-m "big_accelerator" \
252252
--make-reports=tests_big_gpu_torch_cuda \
253253
--report-log=tests_big_gpu_torch_cuda.log \
254254
tests/

.github/workflows/pr_tests_gpu.yml

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,6 +13,7 @@ on:
1313
- "src/diffusers/loaders/peft.py"
1414
- "tests/pipelines/test_pipelines_common.py"
1515
- "tests/models/test_modeling_common.py"
16+
- "examples/**/*.py"
1617
workflow_dispatch:
1718

1819
concurrency:
@@ -188,7 +189,7 @@ jobs:
188189
shell: bash
189190
strategy:
190191
fail-fast: false
191-
max-parallel: 2
192+
max-parallel: 4
192193
matrix:
193194
module: [models, schedulers, lora, others]
194195
steps:

docker/diffusers-doc-builder/Dockerfile

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -47,6 +47,10 @@ RUN python3.10 -m pip install --no-cache-dir --upgrade pip uv==0.1.11 && \
4747
tensorboard \
4848
transformers \
4949
matplotlib \
50-
setuptools==69.5.1
50+
setuptools==69.5.1 \
51+
bitsandbytes \
52+
torchao \
53+
gguf \
54+
optimum-quanto
5155

5256
CMD ["/bin/bash"]

docs/source/en/_toctree.yml

Lines changed: 24 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -93,6 +93,26 @@
9393
- local: hybrid_inference/api_reference
9494
title: API Reference
9595
title: Hybrid Inference
96+
- sections:
97+
- local: modular_diffusers/overview
98+
title: Overview
99+
- local: modular_diffusers/modular_pipeline
100+
title: Modular Pipeline
101+
- local: modular_diffusers/components_manager
102+
title: Components Manager
103+
- local: modular_diffusers/modular_diffusers_states
104+
title: Modular Diffusers States
105+
- local: modular_diffusers/pipeline_block
106+
title: Pipeline Block
107+
- local: modular_diffusers/sequential_pipeline_blocks
108+
title: Sequential Pipeline Blocks
109+
- local: modular_diffusers/loop_sequential_pipeline_blocks
110+
title: Loop Sequential Pipeline Blocks
111+
- local: modular_diffusers/auto_pipeline_blocks
112+
title: Auto Pipeline Blocks
113+
- local: modular_diffusers/end_to_end_guide
114+
title: End-to-End Example
115+
title: Modular Diffusers
96116
- sections:
97117
- local: using-diffusers/consisid
98118
title: ConsisID
@@ -333,6 +353,8 @@
333353
title: SanaTransformer2DModel
334354
- local: api/models/sd3_transformer2d
335355
title: SD3Transformer2DModel
356+
- local: api/models/skyreels_v2_transformer_3d
357+
title: SkyReelsV2Transformer3DModel
336358
- local: api/models/stable_audio_transformer
337359
title: StableAudioDiTModel
338360
- local: api/models/transformer2d
@@ -527,6 +549,8 @@
527549
title: Semantic Guidance
528550
- local: api/pipelines/shap_e
529551
title: Shap-E
552+
- local: api/pipelines/skyreels_v2
553+
title: SkyReels-V2
530554
- local: api/pipelines/stable_audio
531555
title: Stable Audio
532556
- local: api/pipelines/stable_cascade

docs/source/en/api/cache.md

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -28,3 +28,9 @@ Cache methods speedup diffusion transformers by storing and reusing intermediate
2828
[[autodoc]] FasterCacheConfig
2929

3030
[[autodoc]] apply_faster_cache
31+
32+
### FirstBlockCacheConfig
33+
34+
[[autodoc]] FirstBlockCacheConfig
35+
36+
[[autodoc]] apply_first_block_cache

docs/source/en/api/loaders/lora.md

Lines changed: 7 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -26,6 +26,7 @@ LoRA is a fast and lightweight training method that inserts and trains a signifi
2626
- [`HunyuanVideoLoraLoaderMixin`] provides similar functions for [HunyuanVideo](https://huggingface.co/docs/diffusers/main/en/api/pipelines/hunyuan_video).
2727
- [`Lumina2LoraLoaderMixin`] provides similar functions for [Lumina2](https://huggingface.co/docs/diffusers/main/en/api/pipelines/lumina2).
2828
- [`WanLoraLoaderMixin`] provides similar functions for [Wan](https://huggingface.co/docs/diffusers/main/en/api/pipelines/wan).
29+
- [`SkyReelsV2LoraLoaderMixin`] provides similar functions for [SkyReels-V2](https://huggingface.co/docs/diffusers/main/en/api/pipelines/skyreels_v2).
2930
- [`CogView4LoraLoaderMixin`] provides similar functions for [CogView4](https://huggingface.co/docs/diffusers/main/en/api/pipelines/cogview4).
3031
- [`AmusedLoraLoaderMixin`] is for the [`AmusedPipeline`].
3132
- [`HiDreamImageLoraLoaderMixin`] provides similar functions for [HiDream Image](https://huggingface.co/docs/diffusers/main/en/api/pipelines/hidream)
@@ -92,6 +93,10 @@ To learn more about how to load LoRA weights, see the [LoRA](../../using-diffuse
9293

9394
[[autodoc]] loaders.lora_pipeline.WanLoraLoaderMixin
9495

96+
## SkyReelsV2LoraLoaderMixin
97+
98+
[[autodoc]] loaders.lora_pipeline.SkyReelsV2LoraLoaderMixin
99+
95100
## AmusedLoraLoaderMixin
96101

97102
[[autodoc]] loaders.lora_pipeline.AmusedLoraLoaderMixin
@@ -100,6 +105,6 @@ To learn more about how to load LoRA weights, see the [LoRA](../../using-diffuse
100105

101106
[[autodoc]] loaders.lora_pipeline.HiDreamImageLoraLoaderMixin
102107

103-
## WanLoraLoaderMixin
108+
## LoraBaseMixin
104109

105-
[[autodoc]] loaders.lora_pipeline.WanLoraLoaderMixin
110+
[[autodoc]] loaders.lora_base.LoraBaseMixin
Lines changed: 30 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,30 @@
1+
<!-- Copyright 2024 The HuggingFace Team. All rights reserved.
2+
3+
Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with
4+
the License. You may obtain a copy of the License at
5+
6+
http://www.apache.org/licenses/LICENSE-2.0
7+
8+
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on
9+
an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the
10+
specific language governing permissions and limitations under the License. -->
11+
12+
# SkyReelsV2Transformer3DModel
13+
14+
A Diffusion Transformer model for 3D video-like data was introduced in [SkyReels-V2](https://github.com/SkyworkAI/SkyReels-V2) by the Skywork AI.
15+
16+
The model can be loaded with the following code snippet.
17+
18+
```python
19+
from diffusers import SkyReelsV2Transformer3DModel
20+
21+
transformer = SkyReelsV2Transformer3DModel.from_pretrained("Skywork/SkyReels-V2-DF-1.3B-540P-Diffusers", subfolder="transformer", torch_dtype=torch.bfloat16)
22+
```
23+
24+
## SkyReelsV2Transformer3DModel
25+
26+
[[autodoc]] SkyReelsV2Transformer3DModel
27+
28+
## Transformer2DModelOutput
29+
30+
[[autodoc]] models.modeling_outputs.Transformer2DModelOutput

docs/source/en/api/pipelines/chroma.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -36,7 +36,7 @@ import torch
3636
from diffusers import ChromaPipeline
3737

3838
pipe = ChromaPipeline.from_pretrained("lodestones/Chroma", torch_dtype=torch.bfloat16)
39-
pipe.enabe_model_cpu_offload()
39+
pipe.enable_model_cpu_offload()
4040

4141
prompt = [
4242
"A high-fashion close-up portrait of a blonde woman in clear sunglasses. The image uses a bold teal and red color split for dramatic lighting. The background is a simple teal-green. The photo is sharp and well-composed, and is designed for viewing with anaglyph 3D glasses for optimal effect. It looks professionally done."

0 commit comments

Comments
 (0)