You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: examples/server/README.md
+5-6Lines changed: 5 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -692,7 +692,10 @@ Given a ChatML-formatted json description in `messages`, it returns the predicte
692
692
693
693
### GET `/slots`: Returns the current slots processing state
694
694
695
-
This endpoint can be disabled with `--no-slots`
695
+
> [!WARNING]
696
+
> This endpoint is intended fordebugging and may be modifiedin future versions. For security reasons, we strongly advise against enabling it in production environments.
697
+
698
+
This endpoint is disabled by default and can be enabled with `--slots`
696
699
697
700
If query param `?fail_on_no_slot=1` is set, this endpoint will respond with status code 503 if there is no available slots.
698
701
@@ -709,6 +712,7 @@ Example:
709
712
"grammar": "",
710
713
"id": 0,
711
714
"ignore_eos": false,
715
+
"is_processing": false,
712
716
"logit_bias": [],
713
717
"min_p": 0.05000000074505806,
714
718
"mirostat": 0,
@@ -741,7 +745,6 @@ Example:
741
745
"temperature"
742
746
],
743
747
"seed": 42,
744
-
"state": 1,
745
748
"stop": [
746
749
"\n"
747
750
],
@@ -755,10 +758,6 @@ Example:
755
758
]
756
759
```
757
760
758
-
Possible values for`slot[i].state` are:
759
-
- `0`: SLOT_STATE_IDLE
760
-
- `1`: SLOT_STATE_PROCESSING
761
-
762
761
### GET `/metrics`: Prometheus compatible metrics exporter
763
762
764
763
This endpoint is only accessible if`--metrics` is set.
0 commit comments