Skip to content

Commit 12213b6

Browse files
committed
reduce context size for resource constrained devices
1 parent 7e0ad63 commit 12213b6

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

src/embedder.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -4,13 +4,13 @@
44
from tqdm import tqdm
55

66
class SentenceTransformer:
7-
def __init__(self, model_path: str, n_ctx: int = 40960, n_threads: int = None):
7+
def __init__(self, model_path: str, n_ctx: int = 32768, n_threads: int = None):
88
"""
99
Initialize with a local GGUF model file path.
1010
1111
Args:
1212
model_path: Path to your local .gguf file
13-
n_ctx: Context window size (increased to match Qwen3 training context)
13+
n_ctx: Context window size (default 32768 to match Qwen3 training context)
1414
n_threads: Number of threads to use (None = auto-detect)
1515
"""
1616
print(f"Loading model with n_ctx={n_ctx}, n_threads={n_threads}")

0 commit comments

Comments
 (0)