Replies: 1 comment
-
Can you give more information about what you tried? What model were you using? This error is coming from onnxruntime, vs genai itself. It appears the model expects input_ids to be 4096 in size and 6 is passed in. Is this expecting input_ids to be padded out to 4096? |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I was looking for libraries that can prune in addition to quantizing and it appears that SparseML can do that. However, I tried taking the onnx weights exported by the library and loading it to genai. I keep getting errors like the one below. I'm guessing that maybe pruning isn't supported by this library yet?
Beta Was this translation helpful? Give feedback.
All reactions