-
Notifications
You must be signed in to change notification settings - Fork 2
Dit unit tests #68
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dit unit tests #68
Conversation
sajadn
commented
Nov 20, 2025
- add unit tests
- very minor cleanings
- add missing iter to the DiffusionDataModuleConfig
Signed-off-by: sajadn <[email protected]>
Signed-off-by: Sajad Norouzi <[email protected]>
Signed-off-by: Sajad Norouzi <[email protected]>
Signed-off-by: sajadn <[email protected]>
4b75aa7 to
2f46b43
Compare
Signed-off-by: sajadn <[email protected]>
Signed-off-by: sajadn <[email protected]>
|
/ok to test fa1b884 |
Signed-off-by: sajadn <[email protected]>
|
/ok to test c376224 |
Signed-off-by: sajadn <[email protected]>
c376224 to
644970d
Compare
Signed-off-by: sajadn <[email protected]>
|
/ok to test 3e93c2a |
abhinavg4
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good. Let;s merge once tets pass
| "output_params": ["-f", "mp4"], | ||
| } | ||
|
|
||
| print("video_save_path", video_save_path) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good cleanup! Removing debug print statements keeps the output clean in production.
| def build_datasets(self, context: DatasetBuildContext): | ||
| return self.dataset.train_dataloader(), self.dataset.val_dataloader(), self.dataset.test_dataloader() | ||
| return ( | ||
| iter(self.dataset.train_dataloader()), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice improvement! Wrapping dataloaders with iter() makes the interface more explicit and reduces potential confusion in downstream usage.
|
/ok to test f859fbb |
huvunvidia
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Left a few comments.
| DiTModelProvider: Configuration for the DiT-S model. | ||
| """ | ||
| return DiTModelProvider( | ||
| return DiTXLModelProvider( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nit: In docstring it notes "DiT-S" but we provide DiTXLModelProvider.
| "mediapy>=1.2.4", | ||
| "megatron-bridge", | ||
| "wandb[media]>=0.23.0", | ||
| ] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just to make sure, are mediapy and wandb included in OSRB?
Any dependency needs to be approved for open-source.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
oops, this is a great question. I'm not sure about that.
|
/ok to test 99ada83 |
abhinavg4
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LOkks good
|
Approved from Automation perspective |
|
/ok to test f007a7f |
* edm and data preprocess tests. Signed-off-by: sajadn <[email protected]> * Minor cleanings for DiT. Signed-off-by: Sajad Norouzi <[email protected]> * add dit unit test. Signed-off-by: Sajad Norouzi <[email protected]> * add iter to the DiffusionDataModule. Signed-off-by: sajadn <[email protected]> * add missing copyright. Signed-off-by: sajadn <[email protected]> * use 'no caption' if caption is not present. Signed-off-by: sajadn <[email protected]> * fix dit inference bug. Add wanbd to inference code. Signed-off-by: sajadn <[email protected]> * update the DiT configs to be aligned with the original paper. Signed-off-by: sajadn <[email protected]> * add wandb[video] and mediapy to uv. Signed-off-by: sajadn <[email protected]> * adjust pos_ids in mock_dataset to have batch dimension, fuse adaLN layers, use DiTSelfAttention. Signed-off-by: sajadn <[email protected]> * fix the diffusion sample size bug. Signed-off-by: sajadn <[email protected]> * fix broken tests. Signed-off-by: sajadn <[email protected]> --------- Signed-off-by: sajadn <[email protected]> Signed-off-by: Sajad Norouzi <[email protected]> Co-authored-by: Abhinav Garg <[email protected]> Signed-off-by: Lawrence Lane <[email protected]>