You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
and [gRPC](https://github.com/triton-inference-server/client#grpc-options)
200
195
client options
201
196
- Send input data (e.g. a jpeg image) directly to Triton in the [body of an HTTP
202
-
request without any additional metadata](https://github.com/triton-inference-server/server/blob/main/docs/protocol/extension_binary_data.md#raw-binary-request)
197
+
request without any additional metadata](https://github.com/triton-inference-server/server/blob/r25.01/docs/protocol/extension_binary_data.md#raw-binary-request)
203
198
204
199
### Extend Triton
205
200
@@ -208,7 +203,7 @@ designed for modularity and flexibility
208
203
209
204
-[Customize Triton Inference Server container](docs/customization_guide/compose.md) for your use case
help="This flag sets the Python version for RHEL platform of Triton Inference Server to be built. Default: the latest supported version.",
2699
2728
)
2729
+
parser.add_argument(
2730
+
"--build-secret",
2731
+
action="append",
2732
+
required=False,
2733
+
nargs=2,
2734
+
metavar=("key", "value"),
2735
+
help="Add build secrets in the form of <key> <value>. These secrets are used during the build process for vllm. The secrets are passed to the Docker build step as `--secret id=<key>`. The following keys are expected and their purposes are described below:\n\n"
2736
+
" - 'req': A file containing a list of dependencies for pip (e.g., requirements.txt).\n"
2737
+
" - 'vllm_index_url': The index URL for the pip install.\n"
2738
+
" - 'pytorch_triton_url': The location of the PyTorch wheel to download.\n"
2739
+
" - 'build_public_vllm': A flag (default is 'true') indicating whether to build the public VLLM version.\n\n"
2740
+
"Ensure that the required environment variables for these secrets are set before running the build.",
2741
+
)
2700
2742
FLAGS=parser.parse_args()
2701
2743
2702
2744
ifFLAGS.imageisNone:
@@ -2723,6 +2765,8 @@ def enable_all():
2723
2765
FLAGS.override_backend_cmake_arg= []
2724
2766
ifFLAGS.extra_backend_cmake_argisNone:
2725
2767
FLAGS.extra_backend_cmake_arg= []
2768
+
ifFLAGS.build_secretisNone:
2769
+
FLAGS.build_secret= []
2726
2770
2727
2771
# if --enable-all is specified, then update FLAGS to enable all
2728
2772
# settings, backends, repo-agents, caches, file systems, endpoints, etc.
0 commit comments