Skip to content

Commit 08365ea

Browse files
committed
Update
1 parent da18f28 commit 08365ea

File tree

1 file changed

+10
-0
lines changed

1 file changed

+10
-0
lines changed

protobuf/model_config.proto

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1659,6 +1659,16 @@ message ModelEnsembling
16591659
//@@ The models and the input / output mappings used within the ensemble.
16601660
//@@
16611661
repeated Step step = 1;
1662+
1663+
//@@ .. cpp:var:: uint32 max_inflight_responses
1664+
//@@
1665+
//@@ The maximum number of concurrent inflight responses from ensemble
1666+
//@@ steps to downstream consumers. This limit prevents unbounded memory
1667+
//@@ growth when decoupled models produce responses faster than downstream
1668+
//@@ models can consume them. Default value is 0, which indicates that no
1669+
//@@ limit is enforced (unlimited).
1670+
//@@
1671+
uint32 max_inflight_responses = 2;
16621672
}
16631673

16641674
//@@

0 commit comments

Comments
 (0)