You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: examples/research_projects/sana/README.md
+5-5Lines changed: 5 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,11 +1,11 @@
1
1
# Training SANA Sprint Diffuser
2
2
3
-
This README explains how to use the provided Bash script commands to download a pre-trained teacher diffuser model and train it on a specific dataset.
3
+
This README explains how to use the provided bash script commands to download a pre-trained teacher diffuser model and train it on a specific dataset, following the [SANA Sprint methodology](https://arxiv.org/abs/2503.09641).
4
4
5
5
6
6
## Setup
7
7
8
-
### 1. Define Your Local Paths
8
+
### 1. Define the local paths
9
9
10
10
Set a variable for your desired output directory. This directory will store the downloaded model and the training checkpoints/results.
11
11
@@ -14,7 +14,7 @@ your_local_path='output' # Or any other path you prefer
14
14
mkdir -p $your_local_path# Create the directory if it doesn't exist
15
15
```
16
16
17
-
### 2. Download the Pre-trained Model
17
+
### 2. Download the pre-trained model
18
18
19
19
Download the SANA Sprint teacher model from Hugging Face Hub. The script uses the 1.6B parameter model.
*(Optional: You can also download the 0.6B model by replacing the model name: `Efficient-Large-Model/Sana_Sprint_0.6B_1024px_teacher_diffusers`)*
26
26
27
-
### 3. Acquire the Dataset Shards
27
+
### 3. Acquire the dataset shards
28
28
29
29
The training script in this example uses specific `.parquet` shards from the `brivangl/midjourney-v6-llava` dataset instead of downloading the entire dataset automatically via `dataset_name`.
0 commit comments