LLamaCausalLM add support for tokenizer.json #9931

robbiemu · 2024-10-18T03:14:18Z

SentencePiece by default uses BPE, which by default also uses a tokenizer.json. This does not have to be customized with tokens you cannot read from tokenizer.model, but as the BSC-LT/salamanda-7b and related models show, it can be. modified the LlamaModel class to augment the vocabulatry from the json file if it is present.

related issue: #9899

…l-org#9929) ggml-ci

add intel amx isa detection add vnni kernel for gemv cases add vnni and amx kernel support for block_q8_0 code cleanup fix packing B issue enable openmp fine tune amx kernel switch to aten parallel pattern add error message for nested parallelism code cleanup add f16 support in ggml-amx add amx kernels for QK_K quant formats: Q4_K, Q5_K, Q6_K and IQ4_XS update CMakeList update README fix some compilation warning fix compiler warning when amx is not enabled minor change ggml-ci move ggml_amx_init from ggml.c to ggml-amx/mmq.cpp ggml-ci update CMakeLists with -mamx-tile, -mamx-int8 and -mamx-bf16 ggml-ci add amx as an ggml-backend update header file, the old path for immintrin.h has changed to ggml-cpu-impl.h minor change update CMakeLists.txt minor change apply weight prepacking in set_tensor method in ggml-backend fix compile error ggml-ci minor change ggml-ci update CMakeLists.txt ggml-ci add march dependency minor change ggml-ci change ggml_backend_buffer_is_host to return false for amx backend ggml-ci fix supports_op use device reg for AMX backend ggml-ci minor change ggml-ci minor change fix rebase set .buffer_from_host_ptr to be false for AMX backend

…rg#9705) * implemented missing SYCL event APIs * sycl : Added device and backend reg interfaces * Restructured ggml-sycl.cpp

* rpc : refactor backend Use structs for RPC request/response messages * rpc : refactor server

…l-org#9745) * refactor llama_batch_get_one * adapt all examples * fix simple.cpp * fix llama_bench * fix * fix context shifting * free batch before return * use common_batch_add, reuse llama_batch in loop * null terminated seq_id list * fix save-load-state example * fix perplexity * correct token pos in llama_batch_allocr

…casuallm-sp-bpe

llama_cpp_canister allows you to run llama.cpp as a Smart Contract on the Internet Computer. The smart contract runs as WebAssembly in a so-called 'canister'.

Update the binding list by adding LM-Kit.NET (C# & VB.NET)

…casuallm-sp-bpe

robbiemu · 2024-10-20T18:32:15Z

I have no idea why I am seeing so many files changed on this page right now...

git status
On branch llamacasuallm-sp-bpe
Your branch is up to date with 'origin/llamacasuallm-sp-bpe'.

nothing to commit, working tree clean

git --no-pager diff --name-only upstream/master HEAD
.gitignore
convert_hf_to_gguf.py

git diff origin/llamacasuallm-sp-bpe --name-only

git remote get-url upstream
https://github.com/ggerganov/llama.cpp.git

I've been syncing and rebasing to keep my PR fresh to master.. I think I must have force pushed instead of merging first the last time, I will recreate this

basic concept

3c86af2

github-actions bot added the python python script changes label Oct 18, 2024

robbiemu mentioned this pull request Oct 18, 2024

Bug: imatrix crash - nan detected in blk.1.attn_output.weight #9899

Closed

ggerganov and others added 7 commits October 18, 2024 07:32

server : add n_indent parameter for line indentation requirement (ggm…

8901755

…l-org#9929) ggml-ci

[SYCL] Add SYCL Backend registry, device and Event Interfaces (ggml-o…

87421a2

…rg#9705) * implemented missing SYCL event APIs * sycl : Added device and backend reg interfaces * Restructured ggml-sycl.cpp

rpc : backend refactoring (ggml-org#9912)

afd9909

* rpc : refactor backend Use structs for RPC request/response messages * rpc : refactor server

basic concept

730756f

Merge remote-tracking branch 'origin/llamacasuallm-sp-bpe' into llama…

a8e48e3

…casuallm-sp-bpe

github-actions bot added build Compilation issues android Issues specific to Android examples server ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language labels Oct 19, 2024

icppWorld and others added 4 commits October 20, 2024 19:01

readme : update infra list (ggml-org#9942)

7cab208

llama_cpp_canister allows you to run llama.cpp as a Smart Contract on the Internet Computer. The smart contract runs as WebAssembly in a so-called 'canister'.

readme : update bindings list (ggml-org#9951)

45f0976

Update the binding list by adding LM-Kit.NET (C# & VB.NET)

basic concept

ff906dc

Merge remote-tracking branch 'origin/llamacasuallm-sp-bpe' into llama…

d89f49b

…casuallm-sp-bpe

robbiemu closed this Oct 20, 2024

robbiemu deleted the llamacasuallm-sp-bpe branch October 20, 2024 22:57

robbiemu restored the llamacasuallm-sp-bpe branch October 20, 2024 22:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

LLamaCausalLM add support for tokenizer.json #9931

LLamaCausalLM add support for tokenizer.json #9931

Uh oh!

robbiemu commented Oct 18, 2024 •

edited

Loading

Uh oh!

robbiemu commented Oct 20, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

LLamaCausalLM add support for tokenizer.json #9931

LLamaCausalLM add support for tokenizer.json #9931

Uh oh!

Conversation

robbiemu commented Oct 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

robbiemu commented Oct 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

robbiemu commented Oct 18, 2024 •

edited

Loading

robbiemu commented Oct 20, 2024 •

edited

Loading