fix: Correct triton_container_version by yinggeh · Pull Request #7935 · triton-inference-server/server

yinggeh · 2025-01-14T22:02:49Z

Thanks for submitting a PR to Triton!
Please go the the Preview tab above this description box and select the appropriate sub-template:

If you already created the PR, please replace this message with one of

and fill it out.

* Add response statistics * Add L0_response_statistics * Enable http vs grpc statistics comparison * Add docs for response statistics protocol * Add more comments for response statistics test * Remove model name from config * Improve docs wordings * [Continue] Improve docs wordings * [Continue] Add more comments for response statistics test * [Continue 2] Improve docs wordings * Fix typo * Remove mentioning decoupled from docs * [Continue 3] Improve docs wordings * [Continue 4] Improve docs wordings Co-authored-by: Ryan McCormick <rmccormick@nvidia.com> --------- Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>

* Switch to Python model for busyop test * Clean up * Address comment * Remove unused import

* Add cancellation into response statistics * Add test for response statistics cancel * Remove debugging print * Use is None comparison * Fix docs * Use default args None * Refactor RegisterModelStatistics()

* Modify "header_forward_pattern" to match headers case-insensitively. Add unit tests. * fix indentation * fix pre-comiit errors * Update doc * Update copyright * Add test case for "(?-i)", which disables regex case-insensitive mode. * fix pre-commit * Name each test. Remove support of disabling --http-header-forward-pattern case-insensitive mode on http python client. * Update .md file. * fix typo * Reformat args. * Fix pre-commit * Fix test name issue. * Fix pre-commit. * Update md file and copyright.

* Update README and versions for 2.43.0 / 24.02 * Update Dockefile to reduce image size. * Update path in patch file for model generation Update README.md post-24.02

* patching git repository parameterization from production branch 1 * Fix go package directory name * pre-commit fixes * pre-commit fixes --------- Co-authored-by: kyle <kmcgill@kmcgill-ubuntu.nvidia.com>

* Enhance bound check for shm offset * Add test for enhance bound check for shm offset * Fix off by 1 on max offset * Improve comments * Improve comment and offset * Separate logic between computation and validation

…6017) * Allow non-decoupled model to send response and FINAL flag separately * Update copyright * Defer sending error until FINAL flag is seen to avoid invalid reference * Move timestamp capture location * Delay time-point of response complete timestamp in GPRC and SageMaker endpoint * Move location of RESPONSE_COMPLETE timestamp capture to better align with the meaning.

Added a test case to check for optional/required input params in a request and appropriate response from server. Includes addition of 3 simple models with a combination of required/optional input params

Add flag to enable compile of OpenAI support in PA

* Test Correlation Id string support for BLS

* Add AsyncIO HTTP compression test * Improve command line option handling

* Update Docerkfile to install genai * Change the installation script * install both build and hatch * Update name --------- Co-authored-by: Elias Bermudez <dbermudez@nvidia.com>

* Added TRITONSERVER_InferenceTraceSetContext logic

…odes (#6992) * Add documentation for mapping between Triton Errors and HTTP status codes * formatting * Update README.md

* Update README and versions for 2.44.0 / 24.03 (#6971) * Update README and versions for 2.44.0 / 24.03 * Mchornyi 24.03 (#6972) * Current location is dropped in 12.4 * Update Dockerfile.win10.min * Change to triton_sample_folder (#6973) --------- Co-authored-by: kyle <kmcgill@kmcgill-ubuntu.nvidia.com> Co-authored-by: Misha Chornyi <99709299+mc-nv@users.noreply.github.com> * Specify path for PyTorch model extension library (#7025) * Update README.md 2.44.0 / 24.03 (#7032) * Update README.md post-24.03 --------- Co-authored-by: Kyle McGill <101670481+nv-kmcgill53@users.noreply.github.com> Co-authored-by: kyle <kmcgill@kmcgill-ubuntu.nvidia.com>

…ts) (#7855)

…7849)

Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>

Co-authored-by: Meenakshi Sharma <163925564+nvda-mesharma@users.noreply.github.com> Co-authored-by: Kyle McGill <kmcgill@nvidia.com>

)

Co-authored-by: Ryan McCormick <rmccormick@nvidia.com> Co-authored-by: Kyle McGill <101670481+nv-kmcgill53@users.noreply.github.com> Co-authored-by: Suman Tatiraju <167138127+statiraju@users.noreply.github.com> Co-authored-by: Meenakshi Sharma <163925564+nvda-mesharma@users.noreply.github.com> Co-authored-by: Suman Tatiraju <statiraju@nvidia.com>

… yinggeh-fix-triton-container-version

mc-nv and others added 30 commits February 16, 2024 12:11

Update 'main' to track development of 2.44.0 / 24.03 (#6892)

8a2a229

Fix busyop test for L0_memory_growth (#6900)

21a7fc5

* Switch to Python model for busyop test * Clean up * Address comment * Remove unused import

Add cancellation into response statistics (#6904)

60872b9

* Add cancellation into response statistics * Add test for response statistics cancel * Remove debugging print * Use is None comparison * Fix docs * Use default args None * Refactor RegisterModelStatistics()

Install required pip pkgs (#6906)

8d8b607

Add note on --cache-config spacing and fix typos (#6929)

551978b

Remove ignore files that are not in use by repository (#6893)

246f46c

Update README and versions for 2.43.0 / 24.02 (#6886)

1dcf2cf

* Update README and versions for 2.43.0 / 24.02 * Update Dockefile to reduce image size. * Update path in patch file for model generation Update README.md post-24.02

Set ONNX Runtime version 1.17.2

9be77f1

Expose tritonserver args in values.yaml (#5582)

19b02a2

Parameterize git repository (#6934)

d0f332b

* patching git repository parameterization from production branch 1 * Fix go package directory name * pre-commit fixes * pre-commit fixes --------- Co-authored-by: kyle <kmcgill@kmcgill-ubuntu.nvidia.com>

Enhance bound check for shm offset (#6914)

c2299d5

* Enhance bound check for shm offset * Add test for enhance bound check for shm offset * Fix off by 1 on max offset * Improve comments * Improve comment and offset * Separate logic between computation and validation

Add test for max queue delay timeout prompt response (#6938)

25266a5

Test improved input validation errors (#6933)

b012bd0

Added a test case to check for optional/required input params in a request and appropriate response from server. Includes addition of 3 simple models with a combination of required/optional input params

Update Dockerfile.sdk with OpenAI support (#6941)

52a1cd2

Add flag to enable compile of OpenAI support in PA

Test Correlation Id string support for BLS (#6963)

b2e6e7e

* Test Correlation Id string support for BLS

Update 'main' to track development of 2.45.0 / 24.04 (#6974)

9786e40

Add AsyncIO HTTP compression test (#6975)

e92abf2

* Add AsyncIO HTTP compression test * Improve command line option handling

Install genai-pa into SDK container (#6942)

8139431

* Update Docerkfile to install genai * Change the installation script * install both build and hatch * Update name --------- Co-authored-by: Elias Bermudez <dbermudez@nvidia.com>

extend existing tests with more parameters (#6951)

5c6e487

Exposing trace context to python backend (#6985)

9f16eef

* Added TRITONSERVER_InferenceTraceSetContext logic

Add documentation for mapping between Triton Errors and HTTP status c…

8b36aa8

…odes (#6992) * Add documentation for mapping between Triton Errors and HTTP status codes * formatting * Update README.md

Remove hatch version (#7009)

afaa6f4

Update vLLM to 0.3.2 for gemma support (#6918)

fdbfb27

Add missing copyright for L0_trace (#6996)

2be127b

fix sphinx warnings (#7030)

df753d7

Add meetup invite banner (#7049)

a844eda

kthui and others added 23 commits December 11, 2024 14:05

test: Fix requested output deleting extra outputs (#7866)

11af829

Update generated Dockerfile (#7876)

fc0fe6b

build: Adding b64 dependency to relevant targets (fix L0_build_varian…

e8a6090

…ts) (#7855)

fix: Handle dict type for content field in Chat Completions endpoint (#…

fedcfac

…7849)

ci: Fix Windows CI Errors (#7837)

587f877

Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>

docs: Re-structure User Guides for Discoverability (#7807)

9758344

Co-authored-by: Meenakshi Sharma <163925564+nvda-mesharma@users.noreply.github.com> Co-authored-by: Kyle McGill <kmcgill@nvidia.com>

perf: Upgrade vLLM version to 0.6.3.post1 (#7858)

c775239

build: Add python build package for building core python bindings (#7898

071f2a2

)

docs: Update OpenAI README for 24.12 release (#7899)

d547d80

Add environment variable to compose.py (#7909)

50cfd6e

Switch to docker volumes in model generation (#7910)

8a5cc8f

ci: Fix OpenVINO models (#7904)

1e4d838

fix: Fix scalar model generation for L0_scalar_io (#7920)

827078e

Fixing typo in script (#7923)

02fafea

test: Validate request correlation ID data type (#7919)

958636d

build: Extend TRT Plugin Handling to Support Windows (#7924)

bfc7f1f

fix: Fix package placeholder file name (#7926)

d3ff71a

fix: Fix copyrights for new files added to documentation (#7921)

65ef9c8

ci: Fix error-masking bug and improve debugability in L0_trace (#7930)

6100d7f

ci: Stabilize L0_pinned_memory flakiness (#7929)

2af0c22

Install python build package inside Winbase build container (#7934)

9199c76

Update build.py

87344e2

yinggeh requested a review from mc-nv January 14, 2025 22:02

yinggeh self-assigned this Jan 14, 2025

Update build.py

9ecf20d

pvijayakrish force-pushed the yinggeh-fix-triton-container-version branch from 3b99c7b to 9ecf20d Compare January 15, 2025 17:13

Merge branch 'main' of github.com:triton-inference-server/server into…

3038595

… yinggeh-fix-triton-container-version

yinggeh closed this Jan 15, 2025

yinggeh deleted the yinggeh-fix-triton-container-version branch January 15, 2025 23:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Correct triton_container_version#7935

fix: Correct triton_container_version#7935
yinggeh wants to merge 3495 commits intomainfrom
yinggeh-fix-triton-container-version

yinggeh commented Jan 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

20 participants

Conversation

yinggeh commented Jan 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

20 participants