Commit 52310f6
authored
Clean up fp8 / fp4 recipe handling (#1504)
Some misc. cleanup of #1484
<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit
# Release Notes
* **New Features**
* Added FP8/FP4 low-precision quantization support for ESM2, Llama3,
Mixtral, and Qwen models with per-layer precision control
* Added quantized model initialization option for faster FP8 deployments
* Introduced per-layer autocast context management for flexible
precision configuration
* **Documentation**
* Added comprehensive "Running with Low Precision (FP8/FP4)" guides for
ESM2 and Llama3 models
* **Tests**
* Enhanced quantization tests with per-layer precision validation and
legacy FP8 pathway coverage
* Removed redundant quantization test module
* **Chores**
* Refactored model initialization flows for quantization recipe handling
* Updated configuration paths across training scripts
<!-- end of auto-generated comment: release notes by coderabbit.ai -->
---------
Signed-off-by: Peter St. John <pstjohn@nvidia.com>1 parent afcb80e commit 52310f6
File tree
50 files changed
+2423
-2128
lines changed- bionemo-recipes
- models
- esm2
- tests
- common
- llama3
- tests/common
- mixtral
- tests/common
- qwen
- tests/common
- recipes
- esm2_accelerate_te/example_8m_checkpoint
- esm2_native_te
- example_8m_checkpoint
- hydra_config
- model_configs/nvidia/esm2_t6_8M_UR50D
- tests
- tokenizers/esm2
- esm2_peft_te/example_8m_checkpoint
- llama3_native_te
- hydra_config
- tests
- ci/scripts
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
50 files changed
+2423
-2128
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
142 | 142 | | |
143 | 143 | | |
144 | 144 | | |
| 145 | + | |
| 146 | + | |
| 147 | + | |
| 148 | + | |
| 149 | + | |
| 150 | + | |
| 151 | + | |
| 152 | + | |
| 153 | + | |
145 | 154 | | |
146 | 155 | | |
147 | 156 | | |
| |||
152 | 161 | | |
153 | 162 | | |
154 | 163 | | |
155 | | - | |
| 164 | + | |
156 | 165 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
69 | 69 | | |
70 | 70 | | |
71 | 71 | | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
| 88 | + | |
| 89 | + | |
| 90 | + | |
| 91 | + | |
| 92 | + | |
| 93 | + | |
| 94 | + | |
| 95 | + | |
| 96 | + | |
| 97 | + | |
| 98 | + | |
| 99 | + | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
| 103 | + | |
| 104 | + | |
| 105 | + | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
| 114 | + | |
| 115 | + | |
| 116 | + | |
| 117 | + | |
| 118 | + | |
| 119 | + | |
| 120 | + | |
| 121 | + | |
| 122 | + | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
| 137 | + | |
| 138 | + | |
| 139 | + | |
| 140 | + | |
| 141 | + | |
| 142 | + | |
| 143 | + | |
| 144 | + | |
| 145 | + | |
| 146 | + | |
| 147 | + | |
| 148 | + | |
| 149 | + | |
| 150 | + | |
| 151 | + | |
| 152 | + | |
| 153 | + | |
| 154 | + | |
| 155 | + | |
| 156 | + | |
| 157 | + | |
| 158 | + | |
| 159 | + | |
| 160 | + | |
| 161 | + | |
| 162 | + | |
| 163 | + | |
| 164 | + | |
| 165 | + | |
| 166 | + | |
| 167 | + | |
72 | 168 | | |
73 | 169 | | |
74 | 170 | | |
| |||
0 commit comments