alora issues

For hf models, setting the adapter causes generation requests to the base model to be modified and sets `requires_grad=True`. We need to:
- unset the adapter after calling the alora
- disable requires_grad (I believe)
- implement a lock for huggingface aloras so that async non-alora generation requests don't accidentally use the alora
- have aloras use model_options?

We should also make sure that the VLLM server doesn't have similar issues with the active adapter needing to be set/unset or managed between requests.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

alora issues #162

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

alora issues #162

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions