Skip to content

Commit a611cbc

Browse files
authored
chore(model gallery): add qwen3-55b-a3b-total-recall-deep-40x (#5747)
Signed-off-by: Ettore Di Giacinto <[email protected]>
1 parent 850b525 commit a611cbc

File tree

1 file changed

+42
-0
lines changed

1 file changed

+42
-0
lines changed

gallery/index.yaml

Lines changed: 42 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1307,6 +1307,48 @@
13071307
- filename: Qwen3-55B-A3B-TOTAL-RECALL-V1.3.i1-Q4_K_M.gguf
13081308
sha256: bcf5a1f8a40e9438a19b23dfb40e872561c310296c5ac804f937a0e3c1376def
13091309
uri: huggingface://mradermacher/Qwen3-55B-A3B-TOTAL-RECALL-V1.3-i1-GGUF/Qwen3-55B-A3B-TOTAL-RECALL-V1.3.i1-Q4_K_M.gguf
1310+
1311+
- !!merge <<: *qwen3
1312+
name: "qwen3-55b-a3b-total-recall-deep-40x"
1313+
icon: https://huggingface.co/DavidAU/Qwen3-55B-A3B-TOTAL-RECALL-V1.3/resolve/main/qwen3-total-recall.gif
1314+
urls:
1315+
- https://huggingface.co/DavidAU/Qwen3-55B-A3B-TOTAL-RECALL-Deep-40X-GGUF
1316+
description: |
1317+
WARNING: MADNESS - UN HINGED and... NSFW. Vivid prose. INTENSE. Visceral Details. Violence. HORROR. GORE. Swearing. UNCENSORED... humor, romance, fun.
1318+
Qwen3-55B-A3B-TOTAL-RECALL-Deep-40X-GGUF
1319+
1320+
A highly experimental model ("tamer" versions below) based on Qwen3-30B-A3B (MOE, 128 experts, 8 activated), with Brainstorm 40X (by DavidAU - details at bottom of this page).
1321+
1322+
These modifications blow the model (V1) out to 87 layers, 1046 tensors and 55B parameters.
1323+
1324+
Note that some versions are smaller than this, with fewer layers/tensors and smaller parameter counts.
1325+
1326+
The adapter extensively alters performance, reasoning and output generation.
1327+
1328+
Exceptional changes in creative, prose and general performance.
1329+
1330+
Regens of the same prompt - even with the same settings - will be very different.
1331+
1332+
THREE example generations below - creative (generated with Q3_K_M, V1 model).
1333+
1334+
ONE example generation (#4) - non creative (generated with Q3_K_M, V1 model).
1335+
1336+
You can run this model on CPU and/or GPU due to unique model construction, size of experts and total activated experts at 3B parameters (8 experts), which translates into roughly almost 6B parameters in this version.
1337+
1338+
Two quants uploaded for testing: Q3_K_M, Q4_K_M
1339+
1340+
V3, V4 and V5 are also available in these two quants.
1341+
1342+
V2 and V6 in Q3_k_m only; as are: V 1.3, 1.4, 1.5, 1.7 and V7 (newest)
1343+
1344+
NOTE: V2 and up are from source model 2, V1 and 1.3,1.4,1.5,1.7 are from source model 1.
1345+
overrides:
1346+
parameters:
1347+
model: Qwen3-55B-A3B-TOTAL-RECALL-V5-Deep-40X-q4_K_M.gguf
1348+
files:
1349+
- filename: Qwen3-55B-A3B-TOTAL-RECALL-V5-Deep-40X-q4_K_M.gguf
1350+
sha256: 20ef786a8c8e74eb257aa3069e237cbd40f42d25f5502fed6fa016bb8afbdae4
1351+
uri: huggingface://DavidAU/Qwen3-55B-A3B-TOTAL-RECALL-Deep-40X-GGUF/Qwen3-55B-A3B-TOTAL-RECALL-V5-Deep-40X-q4_K_M.gguf
13101352
- &gemma3
13111353
url: "github:mudler/LocalAI/gallery/gemma.yaml@master"
13121354
name: "gemma-3-27b-it"

0 commit comments

Comments
 (0)