Skip to content

Commit 914d9d2

Browse files
nicolexinliu-cong
andauthored
Update site-src/guides/troubleshooting.md
Co-authored-by: Cong Liu <[email protected]>
1 parent cac2e56 commit 914d9d2

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

site-src/guides/troubleshooting.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -16,7 +16,7 @@ This is a default gateway error, meaning the request never reached a backend ser
1616

1717
## 429 Too Many Requests
1818
### `system saturated, sheddable request dropped`
19-
This error indicates that the entire request pool has exceeded its saturation thresholds. This means the system is under heavy load and is shedding non-critical requests. To address this, check the following:
19+
This error indicates that the entire inference pool has exceeded its saturation thresholds. This means the system is under heavy load and is shedding non-critical requests. To address this, check the following:
2020

2121
* gateway-api-inference-extension version:
2222
* **v0.5.1 and earlier**: Verify you're using an `InferenceModel` and that its `criticality` is set to `Critical`. This ensures requests are queued on the model servers instead of being dropped.

0 commit comments

Comments
 (0)