-
Notifications
You must be signed in to change notification settings - Fork 30
Open
Description
Hi,
I am currently working on an LLM project and have fine-tuned a model (e.g., Llama 3) using NVIDIA NeMo, resulting in a .nemo format model. While I can deploy it by exporting to trt-llm, the current version of this repository does not yet support that workflow. I believe there’s an opportunity to extend the project to support that backend version.
I find this project fascinating and would love to contribute by adding compatibility for models served via an API using the .nemo format. If possible, I would be happy to discuss how I can contribute to this effort.
Looking forward to your thoughts.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels