Skip to content

Commit dd4b565

Browse files
committed
update docs for deploying via trt-llm
1 parent f53793b commit dd4b565

File tree

23 files changed

+69
-20
lines changed

23 files changed

+69
-20
lines changed

11-embeddings-reranker-classification-tensorrt/BEI-allenai-llama-3.1-tulu-3-8b-reward-model-fp8/README.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -102,7 +102,9 @@ environment_variables: {}
102102
external_package_dirs: []
103103
model_metadata:
104104
example_model_input:
105-
inputs: Baseten is a fast inference provider
105+
inputs:
106+
- - Baseten is a fast inference provider
107+
- - Classify this separately.
106108
raw_scores: true
107109
truncate: true
108110
truncation_direction: Right

11-embeddings-reranker-classification-tensorrt/BEI-allenai-llama-3.1-tulu-3-8b-reward-model-fp8/config.yaml

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,9 @@ environment_variables: {}
33
external_package_dirs: []
44
model_metadata:
55
example_model_input:
6-
inputs: Baseten is a fast inference provider
6+
inputs:
7+
- - Baseten is a fast inference provider
8+
- - Classify this separately.
79
raw_scores: true
810
truncate: true
911
truncation_direction: Right

11-embeddings-reranker-classification-tensorrt/BEI-baseten-example-meta-llama-3-70b-instructforsequenceclassification-fp8/README.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -102,7 +102,9 @@ environment_variables: {}
102102
external_package_dirs: []
103103
model_metadata:
104104
example_model_input:
105-
inputs: Baseten is a fast inference provider
105+
inputs:
106+
- - Baseten is a fast inference provider
107+
- - Classify this separately.
106108
raw_scores: true
107109
truncate: true
108110
truncation_direction: Right

11-embeddings-reranker-classification-tensorrt/BEI-baseten-example-meta-llama-3-70b-instructforsequenceclassification-fp8/config.yaml

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,9 @@ environment_variables: {}
33
external_package_dirs: []
44
model_metadata:
55
example_model_input:
6-
inputs: Baseten is a fast inference provider
6+
inputs:
7+
- - Baseten is a fast inference provider
8+
- - Classify this separately.
79
raw_scores: true
810
truncate: true
911
truncation_direction: Right

11-embeddings-reranker-classification-tensorrt/BEI-mixedbread-ai-mxbai-rerank-base-v2-reranker-fp8/README.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -102,7 +102,9 @@ environment_variables: {}
102102
external_package_dirs: []
103103
model_metadata:
104104
example_model_input:
105-
inputs: Baseten is a fast inference provider
105+
inputs:
106+
- - Baseten is a fast inference provider
107+
- - Classify this separately.
106108
raw_scores: true
107109
truncate: true
108110
truncation_direction: Right

11-embeddings-reranker-classification-tensorrt/BEI-mixedbread-ai-mxbai-rerank-base-v2-reranker-fp8/config.yaml

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,9 @@ environment_variables: {}
33
external_package_dirs: []
44
model_metadata:
55
example_model_input:
6-
inputs: Baseten is a fast inference provider
6+
inputs:
7+
- - Baseten is a fast inference provider
8+
- - Classify this separately.
79
raw_scores: true
810
truncate: true
911
truncation_direction: Right

11-embeddings-reranker-classification-tensorrt/BEI-mixedbread-ai-mxbai-rerank-large-v2-reranker-fp8/README.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -102,7 +102,9 @@ environment_variables: {}
102102
external_package_dirs: []
103103
model_metadata:
104104
example_model_input:
105-
inputs: Baseten is a fast inference provider
105+
inputs:
106+
- - Baseten is a fast inference provider
107+
- - Classify this separately.
106108
raw_scores: true
107109
truncate: true
108110
truncation_direction: Right

11-embeddings-reranker-classification-tensorrt/BEI-mixedbread-ai-mxbai-rerank-large-v2-reranker-fp8/config.yaml

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,9 @@ environment_variables: {}
33
external_package_dirs: []
44
model_metadata:
55
example_model_input:
6-
inputs: Baseten is a fast inference provider
6+
inputs:
7+
- - Baseten is a fast inference provider
8+
- - Classify this separately.
79
raw_scores: true
810
truncate: true
911
truncation_direction: Right

11-embeddings-reranker-classification-tensorrt/BEI-papluca-xlm-roberta-base-language-detection-classification/README.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -101,7 +101,9 @@ environment_variables: {}
101101
external_package_dirs: []
102102
model_metadata:
103103
example_model_input:
104-
inputs: Baseten is a fast inference provider
104+
inputs:
105+
- - Baseten is a fast inference provider
106+
- - Classify this separately.
105107
raw_scores: true
106108
truncate: true
107109
truncation_direction: Right

11-embeddings-reranker-classification-tensorrt/BEI-papluca-xlm-roberta-base-language-detection-classification/config.yaml

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,9 @@ environment_variables: {}
33
external_package_dirs: []
44
model_metadata:
55
example_model_input:
6-
inputs: Baseten is a fast inference provider
6+
inputs:
7+
- - Baseten is a fast inference provider
8+
- - Classify this separately.
79
raw_scores: true
810
truncate: true
911
truncation_direction: Right

0 commit comments

Comments
 (0)