Skip to content

Commit 7b1c65e

Browse files
author
kerem seyhan
committed
added some notes for qwen3.
1 parent 9543e29 commit 7b1c65e

File tree

1 file changed

+6
-0
lines changed

1 file changed

+6
-0
lines changed

models.json

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -9,6 +9,9 @@
99
"references": [
1010
"https://qwen.readthedocs.io/en/latest/run_locally/llama.cpp.html#llama-cli",
1111
"https://huggingface.co/Qwen/Qwen3-235B-A22B#switching-between-thinking-and-non-thinking-mode"
12+
],
13+
"notes": [
14+
"qwen team suggests to set the --presence-penalty parameter between 0 and 2 to reduce endless repetitions and adds that a higher value may occasionally result in language mixing and a slight decrease in model performance."
1215
]
1316
},
1417
{
@@ -17,6 +20,9 @@
1720
"references": [
1821
"https://qwen.readthedocs.io/en/latest/run_locally/llama.cpp.html#llama-cli",
1922
"https://huggingface.co/Qwen/Qwen3-235B-A22B#switching-between-thinking-and-non-thinking-mode"
23+
],
24+
"notes": [
25+
"qwen team suggests to set the --presence-penalty parameter between 0 and 2 to reduce endless repetitions and adds that a higher value may occasionally result in language mixing and a slight decrease in model performance."
2026
]
2127
}
2228
]

0 commit comments

Comments
 (0)