How to calibrate weights of models after apply Accelerating a Hugging Face Llama 3 models with Transformer Engine #1748
Unanswered
xuanvinh1997
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
After reading this tutorial, Accelerating a Hugging Face Llama 2 and Llama 3 models with Transformer Engine, I now understand how to convert HF/Torch layers to TE layers and fuse them. However, the model's output is no longer correct.
My question is: how can I calibrate it without re-training the pretrained model?
Many thanks!
Beta Was this translation helpful? Give feedback.
All reactions