Fine-Tuning MAISI for Head/Neck CT Always Falls Back to Chest Anatomy #2022
Unanswered
Mudassar-MLE
asked this question in
Q&A
Replies: 2 comments 3 replies
-
It looks like you were using maisi-ddpm? It is a known issue that this version was not good at head and neck. Have you tried maisi-rflow? |
Beta Was this translation helpful? Give feedback.
2 replies
-
@Can-Zhao I have a related question. For the rFlow checkpoints, is it possible to constrain the model to generate only a specific region, for example chest CT? I couldn’t find where this can be configured. (In my case, I am specifically using only the diffusion model, not ControlNet => not the function LDMSampler) |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi,
First, thank you to the MAISI team for the incredible work. I am currently exploring MAISI for neck CT, but I’m running into several challenges. There don’t seem to be many anatomies supported for head/neck CT. Even the existing models are unable to generate meaningful images of the brain, head, or neck, although the documentation mentions that indexing “1” should support head-neck CT.
I tried fine-tuning ControlNet following the provided instructions, adding around 40 new labels with 5–10 cases. However, the model did not converge. After experimenting with different parameters and epochs, it showed slight convergence, but the outputs still mostly resembled chest anatomy with a rough neck contour.

In addition, I reviewed the configs/image_median_statistics.json file in the HuggingFace repo and couldn’t find anything related to neck anatomy. Dataset preparation is not an issue on my side—I am generating embeddings through the MAISI scripts, and masks are 100% verified.
This leaves me confused: do I need to train ControlNet from scratch (and potentially each model from scratch), or is MAISI simply not designed to handle neck scenarios? Also, what would be a good dataset size to train on? In community discussions, I noticed that results were inconsistent—some had issues even with 300 cases, while others reported good results with ~700 cases.
Any guidance or advice on this would be really appreciated!
@guopengf @Can-Zhao
Beta Was this translation helpful? Give feedback.
All reactions