Update to latest version (#32)

oandreeva-nv · web-flow · commit 6f0afffaee7b · 2024-02-12T15:07:10.000-08:00
diff --git a/README.md b/README.md
@@ -1,5 +1,5 @@
 <!--
-# Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# Copyright 2023-2024, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
 #
 # Redistribution and use in source and binary forms, with or without
 # modification, are permitted provided that the following conditions
@@ -30,8 +30,8 @@
 
 **LATEST RELEASE: You are currently on the main branch which tracks
 under-development progress towards the next release. The current release branch
-is [r23.12](https://github.com/triton-inference-server/vllm_backend/tree/r23.12)
-and which corresponds to the 23.12 container release on
+is [r24.01](https://github.com/triton-inference-server/vllm_backend/tree/r24.01)
+and which corresponds to the 24.01 container release on
 [NVIDIA GPU Cloud (NGC)](https://catalog.ngc.nvidia.com/orgs/nvidia/containers/tritonserver).**
 
 # vLLM Backend
@@ -96,9 +96,9 @@ A sample command to build a Triton Server container with all options enabled is
                 --endpoint=grpc
                 --endpoint=sagemaker
                 --endpoint=vertex-ai
-                --upstream-container-version=23.12
-                --backend=python:r23.12
-                --backend=vllm:r23.12
+                --upstream-container-version=24.01
+                --backend=python:r24.01
+                --backend=vllm:r24.01
 ```
 
 ### Option 3. Add the vLLM Backend to the Default Triton Container