Skip to content

Commit e3042d8

Browse files
committed
upload llada-mini
1 parent 5e20980 commit e3042d8

File tree

2 files changed

+15
-2
lines changed

2 files changed

+15
-2
lines changed

docs/models.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -122,7 +122,7 @@
122122
* [x] v2: [Ling-mini-2.0](https://huggingface.co/inclusionAI/Ling-mini-2.0/tree/56c261e07b78d95dad61336fcbdb21ef4fdbcabe), [Ring-mini-2.0](https://huggingface.co/inclusionAI/Ring-mini-2.0/tree/d4eac003b34b59b733f05039a876616d840a37d6)
123123

124124
* LLaDA (`LLaDA2MoeModelLM`)
125-
* [x] [mini-preview](https://huggingface.co/inclusionAI/LLaDA2.0-mini-preview/tree/d25d3b2ac0b966b64da11d6c791f8bf4bc31e90c)
125+
* [x] [mini-preview](https://huggingface.co/inclusionAI/LLaDA2.0-mini-preview/tree/d25d3b2ac0b966b64da11d6c791f8bf4bc31e90c), [mini](https://huggingface.co/inclusionAI/LLaDA2.0-mini/tree/d699e90dd1bd154d65128d9447f3570f4dac44f4)
126126

127127
Supported options (`--set OPTION VALUE`):
128128
- `block_length`: default 32

scripts/models.json

Lines changed: 14 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3469,7 +3469,7 @@
34693469
},
34703470
"llada2.0": {
34713471
"brief": "LLaDA2.0-mini-preview is a diffusion language model featuring a 16BA1B Mixture-of-Experts (MoE) architecture.",
3472-
"default": "mini-preview",
3472+
"default": "mini",
34733473
"license": "Apache License 2.0",
34743474
"variants": {
34753475
"mini-preview": {
@@ -3484,6 +3484,19 @@
34843484
"url": "chatllm_quantized_bailing/llada2.0-mini-preview-q4_1.bin"
34853485
}
34863486
}
3487+
},
3488+
"mini": {
3489+
"default": "q8",
3490+
"quantized": {
3491+
"q8": {
3492+
"size": 17277819216,
3493+
"url": "chatllm_quantized_bailing/llada2.0-mini.bin"
3494+
},
3495+
"q4_1": {
3496+
"size": 10166016336,
3497+
"url": "chatllm_quantized_bailing/llada2.0-mini-q4_1.bin"
3498+
}
3499+
}
34873500
}
34883501
}
34893502
},

0 commit comments

Comments
 (0)