Releases · mofosyne/llama.cpp

25 May 13:11

9588f19

b2998

train : change default FA argument (#7528)

Assets 21

24 May 14:16

github-actions

b2989

27891f6

b2989

docker.yml: disable light-intel and server-intel test (#7515)

* docker.yml: disable light-intel test

* docker.yml: disable server-intel test

Assets 21

24 May 13:44

github-actions

b2988

fbca2f2

b2988

Add support for ArcticForCausalLM (#7020)

* common : increase max number of experts to 128

* common : add tensor LLM_TENSOR_FFN_NORM_EXPS for normalization before MoE that runs in parallel to attention + ffn

* gguf-py : add architecture-specific block mappings that override selected general block mappings

* convert-hf : add model conversion support for ArcticForCausalLM

* convert-hf : use added_tokens_decoder from tokenizer_config.json to redefine tokens from SentencePiece model (only for ArcticForCausalLM)

* llama : add inference support for LLM_ARCH_ARCTIC

---------

Co-authored-by: Stanisław Szymczyk <[email protected]>

Assets 21

24 May 10:30

github-actions

b2987

0df0aa8

b2987

add build shared lib in win release package (#7438)

Assets 21

23 May 11:54

github-actions

b2979

9b82476

b2979

Add missing inference support for GPTNeoXForCausalLM (Pythia and GPT-…

Assets 21

22 May 09:10

github-actions

b2963

95fb0ae

b2963

CUDA: remove incorrect precision check (#7454)

Assets 21

20 May 04:18

github-actions

b2941

33c8d50

b2941

Add provisions for windows support for BF16 code including CMake prov…

Assets 21

19 May 10:22

github-actions

b2930

854d365

b2930

cmake : update android comments (#7341)

Assets 21

18 May 05:50

github-actions

b2918

0583484

b2918

ggml : fix quants nans when all the group weights are very close to z…

Assets 21

17 May 00:24

github-actions

b2903

24ecb58

b2903

Revert "server bench: fix bench not waiting for model load (#7284)" (…

Assets 21

Releases: mofosyne/llama.cpp

b2998

Uh oh!

b2989

Uh oh!

b2988

Uh oh!

b2987

Uh oh!

b2979

Uh oh!

b2963

Uh oh!

b2941

Uh oh!

b2930

Uh oh!

b2918

Uh oh!

b2903

Uh oh!