Replies: 1 comment 2 replies
-
@tigerinus , no, it's either of two ways. I would say Intel-optimum is recommended way to do it for HF models while the patch is more an example to demonstrate how it could be done directly. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I read that to use NNCF in transformers, patch is needed as described in https://github.com/openvinotoolkit/nncf/blob/develop/third_party_integration/huggingface_transformers/README.md
But it also seems that NNCF is already being used in optimum-intel as described in https://huggingface.co/docs/optimum/intel/optimization_ov
Does it mean to use NNCF I need to do both patching and making sure it uses optimum-intel library?
Thanks.
Beta Was this translation helpful? Give feedback.
All reactions