-
Notifications
You must be signed in to change notification settings - Fork 16
Open
Description
Immediate To-Dos:
Improve the Multilora PEFT class extension code ( @sumo already has an implementation and will push it shortly)
Gating needs to be standardized to enable flexible switching of expert adapters from a larger db of adapters (likely through centroid/similarity measures)
UI to run MoE inference and Base Model inference side by side (w streaming and display of selected experts during inference)
simplifying the process of Finetuning new experts and adding them to the MoE arch
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels
Type
Projects
Status
😿Todo