Replies: 2 comments
-
|
Update:
So far my sense is NeMo is much harder to adapt as compared to other open source frameworks. |
Beta Was this translation helpful? Give feedback.
-
|
Update: But I'm finally getting close... But I feel the result is coming within reach... ...I will distill TildeOpen with NeMo even if it's the last thing I do... |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi!
I'd like to distill the TildeOpen model (LlamaForCausalLM class on huggingface) using the NeMo framework.
TildeOpen's basically Llama2/3, except it also uses YaRN as position embeddings.
Where do you propose I should start?
Thanks!
Ingus
Beta Was this translation helpful? Give feedback.
All reactions