Docs: CogVideoX #9578

glide-the · 2024-10-04T11:09:37Z

What does this PR do?

Added CogVideox's Advanced inference and model introduction

sayakpaul · 2024-10-04T11:13:56Z

HuggingFaceDocBuilderDev · 2024-10-04T20:37:15Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

a-r-r-o-w

Nice to have this! Redirecting to @stevhliu for a deeper review.

Instead of uploading the gif/png here, could you open a PR to https://huggingface.co/datasets/huggingface/documentation-images/tree/main/diffusers, which I will merge so we can link it here. We don't keep images/videos in this repository otherwise it can get quite bulky to clone

…n-images/discussions/371

glide-the · 2024-10-07T07:34:13Z

Nice to have this! Redirecting to @stevhliu for a deeper review.

Instead of uploading the gif/png here, could you open a PR to https://huggingface.co/datasets/huggingface/documentation-images/tree/main/diffusers, which I will merge so we can link it here. We don't keep images/videos in this repository otherwise it can get quite bulky to clone

image move in https://huggingface.co/datasets/huggingface/documentation-images/discussions/371

stevhliu

Super cool!! I did an initial pass over the docs and will follow up with a more in-depth look soon 🙂

docs/source/en/_toctree.yml

docs/source/en/using-diffusers/text-img2vid.md

stevhliu · 2024-10-09T22:48:21Z

docs/source/en/using-diffusers/cogvideox.md

+specific language governing permissions and limitations under the License.
+-->
+# CogVideoX
+CogVideoX is an open-source version of the video generation model originating from QingYing. The table below displays the list of video generation models we currently offer, along with their foundational information.


It would be nice to briefly describe the technical aspects of CogVideoX so users have a better idea of how it works and what makes it different from other models (check out the Stable Diffusion XL doc as an example).

Maybe something like (feel free to copy/reuse in the training doc as well):

CogVideoX is a text-to-video generation model focused on creating more coherent videos aligned with a prompt. It achieves this using several methods.

a 3D variational autoencoder that compresses videos spatially and temporally, improving compression rate and video accuracy.

an expert transformer block to help align text and video, and a 3D full attention module for capturing and creating spatially and temporally accurate videos.

docs/source/en/training/cogvideox.md

stevhliu · 2024-10-09T23:10:46Z

docs/source/en/training/cogvideox.md

+> [!TIP]
+> You can pass `--use_8bit_adam` to reduce the memory requirements of training.
+
+> [!IMPORTANT]


This should also just be plain text rather than a callout.

Co-authored-by: Steven Liu <[email protected]>

stevhliu

Cool, thanks so much for iterating! Just a few more comments and then we can merge 🙂

stevhliu · 2024-10-11T17:49:25Z

docs/source/en/using-diffusers/cogvideox.md

+specific language governing permissions and limitations under the License.
+-->
+# CogVideoX
+CogVideoX is an open-source version of the video generation model originating from QingYing. The table below displays the list of video generation models we currently offer, along with their foundational information.


Maybe something like (feel free to copy/reuse in the training doc as well):

CogVideoX is a text-to-video generation model focused on creating more coherent videos aligned with a prompt. It achieves this using several methods.

a 3D variational autoencoder that compresses videos spatially and temporally, improving compression rate and video accuracy.

an expert transformer block to help align text and video, and a 3D full attention module for capturing and creating spatially and temporally accurate videos.

stevhliu · 2024-10-11T19:05:35Z

docs/source/en/training/cogvideox.md

+-->
+# CogVideoX
+
+🤗 Diffusers framework is huggface's open source solution related to diffusion model. Through module tools, it can be conveniently and quickly integrated with custom frameworks. In the direction of model training, Diffusers has accelerate acceleration support and is compatible with common reasoning frameworks.


Replace this paragraph with the suggestion (or something like that) from using-diffusers/cogvideox.md since users coming to Diffusers are probably already familiar with it. They want to know more about CogVideoX :)

docs/source/en/training/cogvideox.md

Co-authored-by: Steven Liu <[email protected]>

… methods

yiyixuxu · 2024-10-15T03:26:31Z

@stevhliu is this good to merge now?

stevhliu · 2024-10-15T17:19:07Z

Yeah looks good now. Thanks for iterating and improving on the docs @glide-the! 🤗

* CogVideoX docs --------- Co-authored-by: Steven Liu <[email protected]> Co-authored-by: YiYi Xu <[email protected]>

CogVideoX docs

c70d203

a-r-r-o-w reviewed Oct 5, 2024

View reviewed changes

a-r-r-o-w requested a review from stevhliu October 5, 2024 20:14

mv images to https://huggingface.co/datasets/huggingface/documentatio…

3b8bea2

…n-images/discussions/371

stevhliu reviewed Oct 9, 2024

View reviewed changes

glide-the and others added 19 commits October 11, 2024 21:59

Update docs/source/en/_toctree.yml

58b6157

Co-authored-by: Steven Liu <[email protected]>

Update docs/source/en/using-diffusers/text-img2vid.md

7c621b7

Co-authored-by: Steven Liu <[email protected]>

Update docs/source/en/using-diffusers/text-img2vid.md

1040fe2

Co-authored-by: Steven Liu <[email protected]>

Update docs/source/en/using-diffusers/text-img2vid.md

b681aa5

Co-authored-by: Steven Liu <[email protected]>

Update docs/source/en/training/cogvideox.md

aeb52ed

Co-authored-by: Steven Liu <[email protected]>

Update docs/source/en/training/cogvideox.md

6731754

Co-authored-by: Steven Liu <[email protected]>

Update docs/source/en/training/cogvideox.md

1ed46ff

Co-authored-by: Steven Liu <[email protected]>

Update docs/source/en/training/cogvideox.md

96d673f

Co-authored-by: Steven Liu <[email protected]>

Update docs/source/en/training/cogvideox.md

0159b43

Co-authored-by: Steven Liu <[email protected]>

Update docs/source/en/training/cogvideox.md

087fa97

Co-authored-by: Steven Liu <[email protected]>

Update docs/source/en/training/cogvideox.md

e8b377e

Co-authored-by: Steven Liu <[email protected]>

Update docs/source/en/training/cogvideox.md

72aebcf

Co-authored-by: Steven Liu <[email protected]>

Update docs/source/en/using-diffusers/cogvideox.md

4fd19f6

Co-authored-by: Steven Liu <[email protected]>

Update docs/source/en/using-diffusers/cogvideox.md

d8a9a8f

Co-authored-by: Steven Liu <[email protected]>

Update docs/source/en/using-diffusers/cogvideox.md

6107be1

Co-authored-by: Steven Liu <[email protected]>

Update docs/source/en/using-diffusers/cogvideox.md

a940038

Co-authored-by: Steven Liu <[email protected]>

Update CogVideoX training documentation

b0d4146

Reduce memory usage and update training documentation

23232f7

update cogvideoxmd

ab169be

stevhliu reviewed Oct 11, 2024

View reviewed changes

glide-the and others added 2 commits October 13, 2024 16:33

Update docs/source/en/training/cogvideox.md

1034de0

Co-authored-by: Steven Liu <[email protected]>

Update docs/source/en/training/cogvideox.md

0c31092

Co-authored-by: Steven Liu <[email protected]>

glide-the and others added 4 commits October 13, 2024 16:34

Update docs/source/en/training/cogvideox.md

7149a16

Co-authored-by: Steven Liu <[email protected]>

Update docs/source/en/training/cogvideox.md

4b10b0c

Co-authored-by: Steven Liu <[email protected]>

Update CogVideoX documentation with improved text-to-video generation…

4badd47

… methods

Merge branch 'main' into doc_cogvideox

e454c95

yiyixuxu merged commit 0d935df into huggingface:main Oct 16, 2024
1 check passed

sayakpaul pushed a commit that referenced this pull request Dec 23, 2024

Docs: CogVideoX (#9578)

1b7000c

* CogVideoX docs --------- Co-authored-by: Steven Liu <[email protected]> Co-authored-by: YiYi Xu <[email protected]>

Docs: CogVideoX #9578

Docs: CogVideoX #9578

Conversation

glide-the commented Oct 4, 2024

What does this PR do?

Uh oh!

sayakpaul commented Oct 4, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Oct 4, 2024

Uh oh!

a-r-r-o-w left a comment

Choose a reason for hiding this comment

Uh oh!

glide-the commented Oct 7, 2024

Uh oh!

stevhliu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

stevhliu Oct 9, 2024

Choose a reason for hiding this comment

Uh oh!

stevhliu Oct 11, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

stevhliu Oct 9, 2024

Choose a reason for hiding this comment

Uh oh!

stevhliu left a comment

Choose a reason for hiding this comment

Uh oh!

stevhliu Oct 11, 2024

Choose a reason for hiding this comment

Uh oh!

stevhliu Oct 11, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yiyixuxu commented Oct 15, 2024

Uh oh!

stevhliu commented Oct 15, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants