-
每次调用predict方法都要经历一次模型加载的过程,这个过程特别耗费时间,请问有没有办法将模型常驻显存,下次做预测的时候,系统可以直接使用模型进行推理? |
Beta Was this translation helpful? Give feedback.
Answered by
zynovo1
Sep 2, 2025
Replies: 1 comment
-
找到了,发布成服务,模型在启动的时候就会加载并驻留。 |
Beta Was this translation helpful? Give feedback.
0 replies
Answer selected by
zynovo1
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
找到了,发布成服务,模型在启动的时候就会加载并驻留。