Commit c80a775
authored
vocab : add special infill tokens for CodeLlama (ggml-org#11850)
* vocab : add special infill tokens for CodeLlama
The commit adds the following special tokens for CodeLlama infill:
- `▁<PRE>`
- `▁<SUF>`
- `▁<MID>`
The motivation for this is that currently the infill example uses
CodeLlama as a suggested model. But when using this model the following
error is generated:
```console
/llama.cpp-debug/examples/infill/infill.cpp:165: GGML_ASSERT(llama_vocab_fim_pre(vocab) >= 0) failed
Could not attach to process. If your uid matches the uid of the target
process, check the setting of /proc/sys/kernel/yama/ptrace_scope, or try
again as the root user. For more details, see /etc/sysctl.d/10-ptrace.conf
ptrace: Operation not permitted.
No stack.
The program is not being run.
305251 Aborted (core dumped)
./build/bin/llama-infill -t 10 -ngl 0 -m models/codellama-13b.Q5_K_S.gguf \
-c 4096 --temp 0.7 --repeat_penalty 1.1 -n 20 \
--in-prefix "def helloworld():\n print(\"hell" \
--in-suffix "\n print(\"goodbye world\")\n "
```
* squash! vocab : add special infill tokens for CodeLlama
Add _<EOT> as well.1 parent 250d795 commit c80a775
1 file changed
+5
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1807 | 1807 | | |
1808 | 1808 | | |
1809 | 1809 | | |
| 1810 | + | |
1810 | 1811 | | |
1811 | 1812 | | |
1812 | 1813 | | |
| |||
1839 | 1840 | | |
1840 | 1841 | | |
1841 | 1842 | | |
| 1843 | + | |
1842 | 1844 | | |
1843 | 1845 | | |
1844 | 1846 | | |
| |||
1856 | 1858 | | |
1857 | 1859 | | |
1858 | 1860 | | |
| 1861 | + | |
1859 | 1862 | | |
1860 | 1863 | | |
1861 | 1864 | | |
| |||
1873 | 1876 | | |
1874 | 1877 | | |
1875 | 1878 | | |
| 1879 | + | |
1876 | 1880 | | |
1877 | 1881 | | |
1878 | 1882 | | |
| |||
1957 | 1961 | | |
1958 | 1962 | | |
1959 | 1963 | | |
| 1964 | + | |
1960 | 1965 | | |
1961 | 1966 | | |
1962 | 1967 | | |
| |||
0 commit comments