llm-d
diff --git a/‎README.md‎
Lines changed: 54 additions & 0 deletions b/‎README.md‎
Lines changed: 54 additions & 0 deletions
diff --git a/‎pkg/common/config.go‎
Lines changed: 4 additions & 0 deletions b/‎pkg/common/config.go‎
Lines changed: 4 additions & 0 deletions
diff --git a/‎pkg/common/config_test.go‎
Lines changed: 6 additions & 0 deletions b/‎pkg/common/config_test.go‎
Lines changed: 6 additions & 0 deletions
@@ -362,3 +362,57 @@ curl -X POST http://localhost:8000/v1/chat/completions \
     ]
   }'
 ```
+
+## Response generation
+
+The `/v1/completions` and `/v1/chat/completions` endpoints produce responses based on simulator configurations and the specific request parameters.
+
+### Echo mode
+In `echo` mode, responses always mirror the request content. 
+In case of /v1/completions the prompt field is returned.
+In case of /v1/chat/completions the last message is returned.
+
+Parameters `max_tokens`, `max_completions_tokens` and `ignore_eos` are ignored in this mode.
+
+### Random mode
+In `random` mode, the fields `max_tokens`, `max_completions_tokens` and `ignore_eos` from the request are used during response generation.
+
+#### Use predefined texts for response generation
+The simulator can generate responses from a predefined list of sentences.
+If `max_tokens` or `max_completions_tokens` is specified, the response length is caclulated using a histogram with six buckets and the following probabilities: 20%, 30%, 20%, 5%, 10%, 15%.
+For a maximum length ≤ 120, bucket sizes are equal.
+For a maximum length > 120, all buckets except the fourth are of size 20;
+the fourth bucket covers the remaining range.
+After the buckets are set, response length is sampled according to these probabilities.
+
+
+Examples: <br>
+max-len = 120: the buckets are 1-20, 21-40, 41-60, 61-80, 81-100, 101-120. <br>
+max-len = 200: the buckets are 1-20, 21-40, 41-60, 61-160, 161-180, 181-200. <br>
+
+If the maximum response length is not specified, it defaults to `<model length>-<input-length>`.
+In this case, response length is sampled from a Gaussian distribution with mean 40 and standard deviation 20.
+
+
+After determining the response length:
+
+A random sentence from the predefined list is chosen and trimmed if it exceeds the required length.
+If the sentence is shorter, additional random sentences are concatenated until the required token count is met.
+
+If `ignore_eos` is true, the response always reaches the maximum allowed length.
+
+The finish_reason is set to LENGTH if the response length equals the maximum; otherwise, it is set to STOP.
+
+
+#### Use responses dataset for response generation
+If `dataset-url` is set in command line, the dataset is downloaded to the location specified by `dataset-path`.
+
+If a valid dataset exists in the `dataset-path`, it is used for response selection.
+The request prompt is hashed, and this value is matched against dataset entries.
+If all matches are longer, a random match is selected and then trimmed.
+
+If `ignore_eos` is true and no match meets the required length, the response is completed with random tokens from the predefined list.
+
+If the prompt hash is not present in the dataset, a random response of length ≤ maximum is selected;
+if all responses are longer, a random response is chosen and trimmed.
+
@@ -660,6 +660,10 @@ func (c *Configuration) validate() error {
 		return errors.New("dataset-path is required when dataset-url is set")
 	}
 
+	if c.Mode == ModeEcho && (c.DatasetPath != "" || c.DatasetURL != "") {
+		return errors.New("dataset cannot be defined in echo mode")
+	}
+
 	return nil
 }
 
 
@@ -532,6 +532,12 @@ var _ = Describe("Simulator configuration", func() {
 				"--config", "../../manifests/config.yaml"},
 			expectedError: "fake metrics request-max-generation-tokens cannot contain negative values",
 		},
+		{
+			name: "invalid echo mode with dataset",
+			args: []string{"random", "--model", "test", "--dataset-path", "my/path",
+				"--mode", "echo"},
+			expectedError: "dataset cannot be defined in echo mode",
+		},
 	}
 
 	for _, test := range invalidTests {
Original file line number	Diff line number	Diff line change
`@@ -660,6 +660,10 @@ func (c *Configuration) validate() error {`
`660`	`660`	`return errors.New("dataset-path is required when dataset-url is set")`
`661`	`661`	`}`
`662`	`662`
	`663`	`+ if c.Mode == ModeEcho && (c.DatasetPath != "" \|\| c.DatasetURL != "") {`
	`664`	`+ return errors.New("dataset cannot be defined in echo mode")`
	`665`	`+ }`
	`666`	`+`
`663`	`667`	`return nil`
`664`	`668`	`}`
`665`	`669`