jan loads/unloads the LLM model into memory for each query #6446
Answered
by
louis-jan
MRDOCTOROO
asked this question in
Get Help
-
|
Because it is unloaded every few seconds when not used, it takes a long time to load the model each time, and the cost is higher. Also, the model list cannot be obtained using the API service. When the model is not started, a whitelist is added and it is started with 0.0.0.0. The model cannot be accessed from outside the local cross-domain. |
Beta Was this translation helpful? Give feedback.
Answered by
louis-jan
Nov 10, 2025
Replies: 1 comment
-
|
@MRDOCTOROO we added Trusted Host setting so please help try again. |
Beta Was this translation helpful? Give feedback.
0 replies
Answer selected by
louis-jan
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
@MRDOCTOROO we added Trusted Host setting so please help try again.