|
1307 | 1307 | - filename: Qwen3-55B-A3B-TOTAL-RECALL-V1.3.i1-Q4_K_M.gguf |
1308 | 1308 | sha256: bcf5a1f8a40e9438a19b23dfb40e872561c310296c5ac804f937a0e3c1376def |
1309 | 1309 | uri: huggingface://mradermacher/Qwen3-55B-A3B-TOTAL-RECALL-V1.3-i1-GGUF/Qwen3-55B-A3B-TOTAL-RECALL-V1.3.i1-Q4_K_M.gguf |
| 1310 | + |
| 1311 | +- !!merge <<: *qwen3 |
| 1312 | + name: "qwen3-55b-a3b-total-recall-deep-40x" |
| 1313 | + icon: https://huggingface.co/DavidAU/Qwen3-55B-A3B-TOTAL-RECALL-V1.3/resolve/main/qwen3-total-recall.gif |
| 1314 | + urls: |
| 1315 | + - https://huggingface.co/DavidAU/Qwen3-55B-A3B-TOTAL-RECALL-Deep-40X-GGUF |
| 1316 | + description: | |
| 1317 | + WARNING: MADNESS - UN HINGED and... NSFW. Vivid prose. INTENSE. Visceral Details. Violence. HORROR. GORE. Swearing. UNCENSORED... humor, romance, fun. |
| 1318 | + Qwen3-55B-A3B-TOTAL-RECALL-Deep-40X-GGUF |
| 1319 | + |
| 1320 | + A highly experimental model ("tamer" versions below) based on Qwen3-30B-A3B (MOE, 128 experts, 8 activated), with Brainstorm 40X (by DavidAU - details at bottom of this page). |
| 1321 | + |
| 1322 | + These modifications blow the model (V1) out to 87 layers, 1046 tensors and 55B parameters. |
| 1323 | + |
| 1324 | + Note that some versions are smaller than this, with fewer layers/tensors and smaller parameter counts. |
| 1325 | + |
| 1326 | + The adapter extensively alters performance, reasoning and output generation. |
| 1327 | + |
| 1328 | + Exceptional changes in creative, prose and general performance. |
| 1329 | + |
| 1330 | + Regens of the same prompt - even with the same settings - will be very different. |
| 1331 | + |
| 1332 | + THREE example generations below - creative (generated with Q3_K_M, V1 model). |
| 1333 | + |
| 1334 | + ONE example generation (#4) - non creative (generated with Q3_K_M, V1 model). |
| 1335 | + |
| 1336 | + You can run this model on CPU and/or GPU due to unique model construction, size of experts and total activated experts at 3B parameters (8 experts), which translates into roughly almost 6B parameters in this version. |
| 1337 | + |
| 1338 | + Two quants uploaded for testing: Q3_K_M, Q4_K_M |
| 1339 | + |
| 1340 | + V3, V4 and V5 are also available in these two quants. |
| 1341 | + |
| 1342 | + V2 and V6 in Q3_k_m only; as are: V 1.3, 1.4, 1.5, 1.7 and V7 (newest) |
| 1343 | + |
| 1344 | + NOTE: V2 and up are from source model 2, V1 and 1.3,1.4,1.5,1.7 are from source model 1. |
| 1345 | + overrides: |
| 1346 | + parameters: |
| 1347 | + model: Qwen3-55B-A3B-TOTAL-RECALL-V5-Deep-40X-q4_K_M.gguf |
| 1348 | + files: |
| 1349 | + - filename: Qwen3-55B-A3B-TOTAL-RECALL-V5-Deep-40X-q4_K_M.gguf |
| 1350 | + sha256: 20ef786a8c8e74eb257aa3069e237cbd40f42d25f5502fed6fa016bb8afbdae4 |
| 1351 | + uri: huggingface://DavidAU/Qwen3-55B-A3B-TOTAL-RECALL-Deep-40X-GGUF/Qwen3-55B-A3B-TOTAL-RECALL-V5-Deep-40X-q4_K_M.gguf |
1310 | 1352 | - &gemma3 |
1311 | 1353 | url: "github:mudler/LocalAI/gallery/gemma.yaml@master" |
1312 | 1354 | name: "gemma-3-27b-it" |
|
0 commit comments