Replies: 1 comment
-
The paper for SOLAR 10.7B indirectly provides a formula for upscaling from 7B (or 7.2B) to 10.7B (or 11B). |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
How to create a model for fine-tuning through multiple passthrough merges?
I want to create 5b by merging qwen2.5 1.5b, can anyone guide me?
Beta Was this translation helpful? Give feedback.
All reactions