I am trying to fine-tune the Wan 2.1 model and would like to leverage the Context Parallelism (CP) feature to manage memory and scale the training. I saw in the main README that CP support is listed as a key feature.
I have looked through the examples/training directory and the documentation, but I couldn't find a specific example or launch script demonstrating how to fine-tune the Wan model with Context Parallelism enabled.
Could you please provide some guidance or a minimal example on how to properly configure a training job for Wan 2.1 with Context Parallelism?