You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am a little bit confused about which backend to use to run GRPO on LLama 4 Scout, I have been using FSDP with GPT-oss with VLLM, but I have seen the megatron backend has good support for MoEs. Given that I need to configure my docker container on the specific VLLM/SGLang frameworks, can someone give me a few pointers of which is the recommended setup to get the best performance for LLama 4 Scout?
Thanks a lot for the great framework and the help!
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
Hello everyone!
I am a little bit confused about which backend to use to run GRPO on LLama 4 Scout, I have been using FSDP with GPT-oss with VLLM, but I have seen the megatron backend has good support for MoEs. Given that I need to configure my docker container on the specific VLLM/SGLang frameworks, can someone give me a few pointers of which is the recommended setup to get the best performance for LLama 4 Scout?
Thanks a lot for the great framework and the help!
Beta Was this translation helpful? Give feedback.
All reactions