windows wsl 下 balance_serve 模式是不是无法使用 #1348
Unanswered
withyou971
asked this question in
Q&A
Replies: 1 comment
-
我这边是可以正常使用的。模型测试 的qwen3各个系列。CPUi514600kf,gpu是5060TI |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
到这里就下不去
loading model.layers.46.post_attention_layernorm.weight to cuda
loading model.layers.47.self_attn.q_norm.weight to cuda
loading model.layers.47.self_attn.k_norm.weight to cuda
loading model.layers.47.input_layernorm.weight to cuda
loading model.layers.47.post_attention_layernorm.weight to cuda
loading model.norm.weight to cuda
Getting inference context from sched_client.
sched_rpc started with PID: 116
Got inference context, sending it to subscribers.
Rebuilding kvcache
48
kv_cache loaded successfully.
Beta Was this translation helpful? Give feedback.
All reactions