Commit 2bef100
Fix gptoss_from_pretrained to correctly load HuggingFace weights (#4)
* Fix gptoss_from_pretrained to correctly load HuggingFace weights
- Update gptoss_normalize_config to map HF config keys (num_local_experts,
num_experts_per_tok, nested rope_scaling) to internal names
- Rewrite gptoss_hf_weights_remap to:
- Use underscore suffix (_blocks/_scales) for MXFP4 weight detection
- Remap HF parameter names to model parameter names
- Concatenate separate q/k/v projections into combined qkv tensors
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
* Add dotty to Imports
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
* ++
---------
Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>1 parent 655d881 commit 2bef100
File tree
8 files changed
+775
-13
lines changed- R
- man
- tests/testthat
8 files changed
+775
-13
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
11 | 11 | | |
12 | 12 | | |
13 | 13 | | |
14 | | - | |
| 14 | + | |
15 | 15 | | |
16 | 16 | | |
17 | 17 | | |
18 | 18 | | |
19 | 19 | | |
20 | | - | |
| 20 | + | |
21 | 21 | | |
| 22 | + | |
22 | 23 | | |
23 | 24 | | |
24 | 25 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
12 | 12 | | |
13 | 13 | | |
14 | 14 | | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
15 | 18 | | |
16 | 19 | | |
17 | 20 | | |
| |||
0 commit comments