-
Notifications
You must be signed in to change notification settings - Fork 100
Open
Description
I was previously able to pick what I wanted for any model/quant using the list of returned downloaded models, but after this update, I now only get the default single quantization with no way to see the others.
Ex: I downloaded 3 quants for granite-4-h
lms ls and the python sdk basically return this, with just the first mondel
{
"type": "llm",
"modelKey": "ibm/granite-4-h-tiny",
"format": "gguf",
"displayName": "Granite 4 H Tiny",
"publisher": "ibm",
"path": "ibm/granite-4-h-tiny",
"sizeBytes": 4231029813,
"paramsString": "7B",
"architecture": "granitehybrid",
"quantization": {
"name": "Q4_K_M",
"bits": 4
},
"vision": false,
"trainedForToolUse": true,
"maxContextLength": 1048576
},
How do I programmatically get to know about the other quants and load them?
Metadata
Metadata
Assignees
Labels
No labels