You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+46-1Lines changed: 46 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -13,7 +13,7 @@ Welcome to the **SkyReels V2** repository! Here, you'll find the model weights a
13
13
14
14
15
15
## 🔥🔥🔥 News!!
16
-
* May 16, 2025: 🔥 We support video extension and experimental features for start/end frame control.
16
+
* May 16, 2025: 🔥 We support video extension and experimental features for start/end frame control in diffusion forcing model.
17
17
* Apr 24, 2025: 🔥 We release the 720P models, [SkyReels-V2-DF-14B-720P](https://huggingface.co/Skywork/SkyReels-V2-DF-14B-720P) and [SkyReels-V2-I2V-14B-720P](https://huggingface.co/Skywork/SkyReels-V2-I2V-14B-720P). The former facilitates infinite-length autoregressive video generation, and the latter focuses on Image2Video synthesis.
18
18
* Apr 21, 2025: 👋 We release the inference code and model weights of [SkyReels-V2](https://huggingface.co/collections/Skywork/skyreels-v2-6801b1b93df627d441d0d0d9) Series Models and the video captioning model [SkyCaptioner-V1](https://huggingface.co/Skywork/SkyCaptioner-V1) .
19
19
* Apr 3, 2025: 🔥 We also release [SkyReels-A2](https://github.com/SkyworkAI/SkyReels-A2). This is an open-sourced controllable video generation framework capable of assembling arbitrary visual elements.
> -`--addnoise_condition` is used to help smooth the long video generation by adding some noise to the clean condition. Too large noise can cause the inconsistency as well. 20 is a recommended value, and you may try larger ones, but it is recommended to not exceed 50.
224
224
> - Generating a 540P video using the 1.3B model requires approximately 14.7GB peak VRAM, while the same resolution video using the 14B model demands around 51.2GB peak VRAM.
225
225
226
+
video extention
227
+
```shell
228
+
model_id=Skywork/SkyReels-V2-DF-14B-540P
229
+
# asynchronous inference
230
+
python3 generate_video_df.py \
231
+
--model_id ${model_id} \
232
+
--resolution 540P \
233
+
--ar_step 0 \
234
+
--base_num_frames 97 \
235
+
--num_frames 120 \
236
+
--overlap_history 17 \
237
+
--prompt ${prompt} \
238
+
--addnoise_condition 20 \
239
+
--offload \
240
+
--use_ret_steps \
241
+
--teacache \
242
+
--teacache_thresh 0.3 \
243
+
--video_path ${video_path}
244
+
```
245
+
> **Note**:
246
+
> - When performing video extension, you need to pass the `--video_path ${video_path}` parameter to specify the video to be extended.
247
+
248
+
start/end frame control
249
+
```shell
250
+
model_id=Skywork/SkyReels-V2-DF-14B-540P
251
+
# asynchronous inference
252
+
python3 generate_video_df.py \
253
+
--model_id ${model_id} \
254
+
--resolution 540P \
255
+
--ar_step 0 \
256
+
--base_num_frames 97 \
257
+
--num_frames 97 \
258
+
--overlap_history 17 \
259
+
--prompt ${prompt} \
260
+
--addnoise_condition 20 \
261
+
--offload \
262
+
--use_ret_steps \
263
+
--teacache \
264
+
--teacache_thresh 0.3 \
265
+
--image ${image} \
266
+
--end_image ${end_image}
267
+
```
268
+
> **Note**:
269
+
> - When controlling the start and end frames, you need to pass the `--image ${image}` parameter to control the generation of the start frame and the `--end_image ${end_image}` parameter to control the generation of the end frame.
0 commit comments