Skip to content

Commit 5951f8f

Browse files
authored
Apply suggestions from code review
1 parent 04e1b02 commit 5951f8f

File tree

1 file changed

+5
-5
lines changed

1 file changed

+5
-5
lines changed

examples/research_projects/sana/README.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,11 +1,11 @@
11
# Training SANA Sprint Diffuser
22

3-
This README explains how to use the provided Bash script commands to download a pre-trained teacher diffuser model and train it on a specific dataset.
3+
This README explains how to use the provided bash script commands to download a pre-trained teacher diffuser model and train it on a specific dataset, following the [SANA Sprint methodology](https://arxiv.org/abs/2503.09641).
44

55

66
## Setup
77

8-
### 1. Define Your Local Paths
8+
### 1. Define the local paths
99

1010
Set a variable for your desired output directory. This directory will store the downloaded model and the training checkpoints/results.
1111

@@ -14,7 +14,7 @@ your_local_path='output' # Or any other path you prefer
1414
mkdir -p $your_local_path # Create the directory if it doesn't exist
1515
```
1616

17-
### 2. Download the Pre-trained Model
17+
### 2. Download the pre-trained model
1818

1919
Download the SANA Sprint teacher model from Hugging Face Hub. The script uses the 1.6B parameter model.
2020

@@ -24,7 +24,7 @@ huggingface-cli download Efficient-Large-Model/SANA_Sprint_1.6B_1024px_teacher_d
2424

2525
*(Optional: You can also download the 0.6B model by replacing the model name: `Efficient-Large-Model/Sana_Sprint_0.6B_1024px_teacher_diffusers`)*
2626

27-
### 3. Acquire the Dataset Shards
27+
### 3. Acquire the dataset shards
2828

2929
The training script in this example uses specific `.parquet` shards from the `brivangl/midjourney-v6-llava` dataset instead of downloading the entire dataset automatically via `dataset_name`.
3030

@@ -71,7 +71,7 @@ python train_sana_sprint_diffusers.py \
7171
--resume_from_checkpoint="latest" \
7272
```
7373

74-
### Explanation of Parameters
74+
### Explanation of parameters
7575

7676
* `--pretrained_model_name_or_path`: Path to the downloaded pre-trained model directory.
7777
* `--output_dir`: Directory where training logs, checkpoints, and the final model will be saved.

0 commit comments

Comments
 (0)