Skip to content

Commit ea8985d

Browse files
Update datasets.md
Added recommended dataset profiles for four usecases
1 parent 678adea commit ea8985d

File tree

1 file changed

+6
-0
lines changed

1 file changed

+6
-0
lines changed

docs/datasets.md

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -39,6 +39,12 @@ GuideLLM supports several types of datasets, each with its own advantages and us
3939

4040
Synthetic datasets allow you to generate data on the fly with customizable parameters. This is useful for controlled experiments, stress testing, and simulating specific scenarios. For example, you might want to evaluate how a model handles long prompts or generates outputs with specific characteristics.
4141

42+
For different use cases, here are the recommendeded dataset profiles that can be passed in as arguments in GuideLLM:
43+
- Chat: --data "prompt_tokens=512,output_tokens=256"
44+
- RAG: --data "prompt_tokens=4096,output_tokens=512"
45+
- Summarization: --data "prompt_tokens=1024,output_tokens=256"
46+
- Code Generation: --data "prompt_tokens=512,output_tokens=512"
47+
4248
#### Example Commands
4349

4450
```bash

0 commit comments

Comments
 (0)