Projecting a QLoRa adapter to original model space #697
-
Given a base model and a QLoRa adapter, is it possible to create a new model of the same size as the base model representing the weights with the QLoRa perturbation applied to it? During training QLoRa is extremely helpful, but during inference I would like to work with a single checkpoint representing the whole model, rather than with a checkpoint and an adapter. That way, I'll be able to use the huggingface inference API. It will also be nice to keep things as simple as possible. So what I'm looking for is code to load a base model from a checkpoint, load a QLoRa adapter, output a new model checkpoint representing the weights of the base model with the QLoRa perturbation applied. Is this possible? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 5 replies
-
Hey, could you check if |
Beta Was this translation helpful? Give feedback.
Hey, could you check if
merge_and_unload
solves your problem?