Skip to content

Commit 5f96917

Browse files
authored
Merge branch 'ggml-org:master' into master
2 parents 1f3c7dd + 8d88628 commit 5f96917

File tree

13 files changed

+1106
-167
lines changed

13 files changed

+1106
-167
lines changed

README.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -84,6 +84,7 @@ Instructions for adding support for new models: [HOWTO-add-model.md](docs/develo
8484
- [X] [Mistral 7B](https://huggingface.co/mistralai/Mistral-7B-v0.1)
8585
- [x] [Mixtral MoE](https://huggingface.co/models?search=mistral-ai/Mixtral)
8686
- [x] [DBRX](https://huggingface.co/databricks/dbrx-instruct)
87+
- [x] [Jamba](https://huggingface.co/ai21labs)
8788
- [X] [Falcon](https://huggingface.co/models?search=tiiuae/falcon)
8889
- [X] [Chinese LLaMA / Alpaca](https://github.com/ymcui/Chinese-LLaMA-Alpaca) and [Chinese LLaMA-2 / Alpaca-2](https://github.com/ymcui/Chinese-LLaMA-Alpaca-2)
8990
- [X] [Vigogne (French)](https://github.com/bofenghuang/vigogne)

ggml/src/ggml-cuda/common.cuh

Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1005,3 +1005,16 @@ struct ggml_backend_cuda_context {
10051005
return pool(device);
10061006
}
10071007
};
1008+
1009+
struct ggml_cuda_mm_fusion_args_host {
1010+
const ggml_tensor * x_bias = nullptr;
1011+
const ggml_tensor * gate = nullptr;
1012+
const ggml_tensor * gate_bias = nullptr;
1013+
ggml_glu_op glu_op;
1014+
};
1015+
struct ggml_cuda_mm_fusion_args_device {
1016+
const void * x_bias = nullptr;
1017+
const void * gate = nullptr;
1018+
const void * gate_bias = nullptr;
1019+
ggml_glu_op glu_op;
1020+
};

ggml/src/ggml-cuda/convert.cuh

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,4 @@
1+
#pragma once
12
#include "common.cuh"
23

34
#define CUDA_DEQUANTIZE_BLOCK_SIZE 256

ggml/src/ggml-cuda/ggml-cuda.cu

Lines changed: 352 additions & 1 deletion
Large diffs are not rendered by default.

0 commit comments

Comments
 (0)