Skip to content

Commit 7b77518

Browse files
authored
feat(diffusers/pipelines): add pipelines of skyreels_v2 in diffusers master (#1203)
1 parent 52a0431 commit 7b77518

28 files changed

+6942
-2
lines changed

docs/diffusers/_toctree.yml

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -236,6 +236,8 @@
236236
title: PriorTransformer
237237
- local: api/models/sd3_transformer2d
238238
title: SD3Transformer2DModel
239+
- local: api/models/skyreels_v2_transformer_3d
240+
title: SkyReelsV2Transformer3DModel
239241
- local: api/models/sana_transformer2d
240242
title: SanaTransformer2DModel
241243
- local: api/models/stable_audio_transformer
@@ -430,6 +432,8 @@
430432
title: Semantic Guidance
431433
- local: api/pipelines/shap_e
432434
title: Shap-E
435+
- local: api/pipelines/skyreels_v2
436+
title: SkyReels-V2
433437
- local: api/pipelines/stable_audio
434438
title: Stable Audio
435439
- local: api/pipelines/stable_cascade

docs/diffusers/api/loaders/lora.md

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,9 @@ LoRA is a fast and lightweight training method that inserts and trains a signifi
2424
- `SanaLoraLoaderMixin` provides similar functions for [Sana](../../api/pipelines/sana.md).
2525
- `HunyuanVideoLoraLoaderMixin` provides similar functions for [HunyuanVideo](../../api/pipelines/hunyuan_video.md).
2626
- `Lumina2LoraLoaderMixin` provides similar functions for [Lumina2](../../api/pipelines/lumina2.md).
27-
- `AmusedLoraLoaderMixin` is for the [`AmusedPipeline`].
27+
- `WanLoraLoaderMixin` provides similar functions for [Wan](../../api/pipelines/wan.md).
28+
- `SkyReelsV2LoraLoaderMixin` provides similar functions for [SkyReels-V2](../../api/pipelines/skyreels_v2.md).
29+
- `AmusedLoraLoaderMixin` is for the [AmusedPipeline](../../api/pipelines/amused.md).
2830
- `LoraBaseMixin` provides a base class with several utility methods to fuse, unfuse, unload, LoRAs and more.
2931

3032
!!! tip
@@ -52,6 +54,10 @@ LoRA is a fast and lightweight training method that inserts and trains a signifi
5254

5355
::: mindone.diffusers.loaders.lora_pipeline.Lumina2LoraLoaderMixin
5456

57+
::: mindone.diffusers.loaders.lora_pipeline.WanLoraLoaderMixin
58+
59+
::: mindone.diffusers.loaders.lora_pipeline.SkyReelsV2LoraLoaderMixin
60+
5561
::: mindone.diffusers.loaders.lora_pipeline.AmusedLoraLoaderMixin
5662

5763
::: mindone.diffusers.loaders.lora_base.LoraBaseMixin
Lines changed: 26 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,26 @@
1+
<!-- Copyright 2024 The HuggingFace Team. All rights reserved.
2+
3+
Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with
4+
the License. You may obtain a copy of the License at
5+
6+
http://www.apache.org/licenses/LICENSE-2.0
7+
8+
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on
9+
an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the
10+
specific language governing permissions and limitations under the License. -->
11+
12+
# SkyReelsV2Transformer3DModel
13+
14+
A Diffusion Transformer model for 3D video-like data was introduced in [SkyReels-V2](https://github.com/SkyworkAI/SkyReels-V2) by the Skywork AI.
15+
16+
The model can be loaded with the following code snippet.
17+
18+
```python
19+
from mindone.diffusers import SkyReelsV2Transformer3DModel
20+
21+
transformer = SkyReelsV2Transformer3DModel.from_pretrained("Skywork/SkyReels-V2-DF-1.3B-540P-Diffusers", subfolder="transformer", mindspore_dtype=ms.bfloat16)
22+
```
23+
24+
::: mindone.diffusers.SkyReelsV2Transformer3DModel
25+
26+
::: mindone.diffusers.models.modeling_outputs.Transformer2DModelOutput

docs/diffusers/api/pipelines/skyreels_v2.md

Lines changed: 303 additions & 0 deletions
Large diffs are not rendered by default.

mindone/diffusers/__init__.py

Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -73,6 +73,7 @@
7373
"SD3ControlNetModel",
7474
"SD3MultiControlNetModel",
7575
"SD3Transformer2DModel",
76+
"SkyReelsV2Transformer3DModel",
7677
"SparseControlNetModel",
7778
"StableAudioDiTModel",
7879
"StableCascadeUNet",
@@ -229,6 +230,11 @@
229230
"SemanticStableDiffusionPipeline",
230231
"ShapEImg2ImgPipeline",
231232
"ShapEPipeline",
233+
"SkyReelsV2DiffusionForcingImageToVideoPipeline",
234+
"SkyReelsV2DiffusionForcingPipeline",
235+
"SkyReelsV2DiffusionForcingVideoToVideoPipeline",
236+
"SkyReelsV2ImageToVideoPipeline",
237+
"SkyReelsV2Pipeline",
232238
"StableAudioPipeline",
233239
"StableAudioProjectionModel",
234240
"StableCascadeCombinedPipeline",
@@ -412,6 +418,7 @@
412418
SD3ControlNetModel,
413419
SD3MultiControlNetModel,
414420
SD3Transformer2DModel,
421+
SkyReelsV2Transformer3DModel,
415422
SparseControlNetModel,
416423
StableAudioDiTModel,
417424
StableCascadeUNet,
@@ -567,6 +574,11 @@
567574
SemanticStableDiffusionPipeline,
568575
ShapEImg2ImgPipeline,
569576
ShapEPipeline,
577+
SkyReelsV2DiffusionForcingImageToVideoPipeline,
578+
SkyReelsV2DiffusionForcingPipeline,
579+
SkyReelsV2DiffusionForcingVideoToVideoPipeline,
580+
SkyReelsV2ImageToVideoPipeline,
581+
SkyReelsV2Pipeline,
570582
StableAudioPipeline,
571583
StableAudioProjectionModel,
572584
StableCascadeCombinedPipeline,

mindone/diffusers/loaders/__init__.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -76,6 +76,7 @@ def text_encoder_attn_modules(text_encoder):
7676
"Lumina2LoraLoaderMixin",
7777
"WanLoraLoaderMixin",
7878
"HiDreamImageLoraLoaderMixin",
79+
"SkyReelsV2LoraLoaderMixin",
7980
],
8081
"peft": ["PeftAdapterMixin"],
8182
"single_file": ["FromSingleFileMixin"],
@@ -100,6 +101,7 @@ def text_encoder_attn_modules(text_encoder):
100101
Mochi1LoraLoaderMixin,
101102
SanaLoraLoaderMixin,
102103
SD3LoraLoaderMixin,
104+
SkyReelsV2LoraLoaderMixin,
103105
StableDiffusionLoraLoaderMixin,
104106
StableDiffusionXLLoraLoaderMixin,
105107
WanLoraLoaderMixin,

0 commit comments

Comments
 (0)