You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Fixes integer underflow when n_discard >= cache_tokens.size() that causes
std::length_error crashes. This commonly occurs during KV cache context
shifting, particularly with Chinese text translation workloads.
The fix adds proper bounds checking before resizing the cache_tokens vector.
Fixes#771
0 commit comments