@@ -29,20 +29,31 @@ lighteval vllm \
2929 " leaderboard|truthfulqa:mc|0|0"
3030```
3131
32- Available arguments for ` vllm ` can be found in the ` VLLMModelConfig ` :
33-
34- - ** pretrained** (str): HuggingFace Hub model ID name or the path to a pre-trained model to load.
35- - ** gpu_memory_utilisation** (float): The fraction of GPU memory to use.
36- - ** revision** (str): The revision of the model.
37- - ** dtype** (str, None): The data type to use for the model.
38- - ** tensor_parallel_size** (int): The number of tensor parallel units to use.
39- - ** data_parallel_size** (int): The number of data parallel units to use.
40- - ** max_model_length** (int): The maximum length of the model.
41- - ** swap_space** (int): The CPU swap space size (GiB) per GPU.
42- - ** seed** (int): The seed to use for the model.
43- - ** trust_remote_code** (bool): Whether to trust remote code during model loading.
44- - ** add_special_tokens** (bool): Whether to add special tokens to the input sequences.
45- - ** multichoice_continuations_start_space** (bool): Whether to add a space at the start of each continuation in multichoice generation.
32+ ## Use a config file
33+
34+ For more advanced configurations, you can use a config file for the model.
35+ An example of a config file is shown below and can be found at ` examples/model_configs/vllm_model_config.yaml ` .
36+
37+ ``` bash
38+ lighteval vllm \
39+ " examples/model_configs/vllm_model_config.yaml" \
40+ " leaderboard|truthfulqa:mc|0|0"
41+ ```
42+
43+ ``` yaml
44+ model : # Model specific parameters
45+ base_params :
46+ model_args : " pretrained=HuggingFaceTB/SmolLM-1.7B,revision=main,dtype=bfloat16" # Model args that you would pass in the command line
47+ generation : # Generation specific parameters
48+ temperature : 0.3
49+ repetition_penalty : 1.0
50+ frequency_penalty : 0.0
51+ presence_penalty : 0.0
52+ seed : 42
53+ top_k : 0
54+ min_p : 0.0
55+ top_p : 0.9
56+ ` ` `
4657
4758> [!WARNING]
4859> In the case of OOM issues, you might need to reduce the context size of the
0 commit comments