addressing issue #5, SD3 training#11
addressing issue #5, SD3 training#11sarahahtee wants to merge 7 commits intosign-language-processing:mainfrom
Conversation
merge branches
issue seems to stem from the load_dataset() func that the SD3 training script uses. -possible that the issue lies in the process_train() func of the training script. tried passing both image path strings and PIL image objects |
|
debugging output - |
|
I suggest you debug directly in this script: https://github.com/huggingface/diffusers/blob/main/examples/controlnet/train_controlnet_sd3.py#L644-L649 Specifically, where marked, you print the directories, and check they exist, and print the dataset, and print the error, etc. If there is an error loading the dataset, but the directory is correct, make sure the dataset is generated correctly in the |
|
here's the edited function (train_controlnet_sd3.py): edited function (train_controlnet_sd3.py): dataset generation check (dataset.py): dataset generation check (dataset.py) output :( |
|
Great. You see that the columns are:
No |
|
i have also tried that without success -
|
|
The ['_data_files', '_fingerprint', '_format_columns', '_format_kwargs', '_format_type', '_output_all_columns', '_split']Maybe, the path to the dataset you are creating and loading is different? Either the path is wrong, or the specific method to load the dataset is wrong. Have you tried |
|
yes i've tried load_from_disk as well as verified the path and env variables. although not ideal, adding load_from_disk to the training script partially solves the issue of loading the data.
this is odd because the dataset creation already handles 3 channel RGB (here for res. 512). i have also checked so the channel mismatch issue lies with ControlNet initialization for SD3 specifically. I'm looking into the control for sd3 from diffusers more closely atm |
|
Alright. It is to be expected that examples on their repo should work directly |
|
Done! (Issue) |

I'm attempting to use the existing training setup with Stable Diffusion 3. Here's the history of errors encountered, please let me know how to proceed with error 4 -
Error 1:
TypeError: argument of type 'PosixPath' is not iterableSolution 1:
Added str conversion in dataset.py
output_path = str(output_path) # after mkdirError 2:
train_controlnet_sd3.py: error: unrecognized arguments: --enable_xformers_memory_efficient_attentionSolution 2:
Removed arg
Error 3:
Solution 3: Modified controlnet_SD3.py to add additional parameters
--dual_attention_layersand--qk_norm.Error 4:
P.S. i did try to include only the relevant files for this branch comparison but even with a local reset, file changes from other open PRs still show up in this request.