You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
fix: always run scheduler.step() on MLX executor to prevent GPU race
When has_waiting=False, scheduler.step() was called inline on the event
loop thread while VLM vision encoding could be running on _mlx_executor.
Two threads accessing Metal GPU simultaneously caused segfault. Now all
scheduler steps run on the single-worker executor, serializing all MLX
operations.
0 commit comments