We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 79e1063 commit f326e97Copy full SHA for f326e97
main_training_mamba.py
@@ -86,7 +86,7 @@ def main(**kwargs):
86
auto_wrap_policy=wrapping_policy,
87
mixed_precision=mixed_precision_policy,
88
sharding_strategy=sharding_strategy_policy,
89
- use_orig_params=cfg.use_torch_compile,
+ use_orig_params=True,
90
device_id=torch.cuda.current_device(),
91
limit_all_gathers=True,
92
param_init_fn=param_init_fn,
0 commit comments