Skip to content

Commit 7c335c9

Browse files
committed
make deployed app faster
1 parent 1d26224 commit 7c335c9

File tree

1 file changed

+3
-2
lines changed

1 file changed

+3
-2
lines changed

llm-complete-guide/deployment_hf.py

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -124,8 +124,9 @@ def predict(message, history):
124124
try:
125125
return process_input_with_retrieval(
126126
input=message,
127-
n_items_retrieved=20,
128-
use_reranking=True,
127+
n_items_retrieved=7,
128+
use_reranking=False,
129+
model="gpt-4o-mini",
129130
prompt=prompt,
130131
tracing_tags=["gradio", "web-interface", APP_ENVIRONMENT],
131132
)

0 commit comments

Comments
 (0)