You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Lighteval allows to use Hugging Face's Inference Providers to evaluate llms on supported providers such as Black Forest Labs, Cerebras, Fireworks AI, Nebius, Together AI and many more.
4
+
5
+
## Quick use
6
+
7
+
> [!WARNING]
8
+
> Do not forget to set your HuggingFace API key.
9
+
> You can set it using the `HF_TOKEN` environment variable or by using the `huggingface-cli` command.
help="config file path for the inference provider model, or a comma separated string of model args (model_name={},provider={},generation={temperature: 0.6})"
512
+
),
513
+
],
514
+
tasks: Annotated[str, Argument(help="Comma-separated list of tasks to evaluate on.")],
515
+
# === Common parameters ===
516
+
system_prompt: Annotated[
517
+
Optional[str], Option(help="Use system prompt for evaluation.", rich_help_panel=HELP_PANEL_NAME_4)
518
+
] =None,
519
+
dataset_loading_processes: Annotated[
520
+
int, Option(help="Number of processes to use for dataset loading.", rich_help_panel=HELP_PANEL_NAME_1)
521
+
] =1,
522
+
custom_tasks: Annotated[
523
+
Optional[str], Option(help="Path to custom tasks directory.", rich_help_panel=HELP_PANEL_NAME_1)
524
+
] =None,
525
+
num_fewshot_seeds: Annotated[
526
+
int, Option(help="Number of seeds to use for few-shot evaluation.", rich_help_panel=HELP_PANEL_NAME_1)
527
+
] =1,
528
+
# === saving ===
529
+
output_dir: Annotated[
530
+
str, Option(help="Output directory for evaluation results.", rich_help_panel=HELP_PANEL_NAME_2)
531
+
] ="results",
532
+
push_to_hub: Annotated[
533
+
bool, Option(help="Push results to the huggingface hub.", rich_help_panel=HELP_PANEL_NAME_2)
534
+
] =False,
535
+
push_to_tensorboard: Annotated[
536
+
bool, Option(help="Push results to tensorboard.", rich_help_panel=HELP_PANEL_NAME_2)
537
+
] =False,
538
+
public_run: Annotated[
539
+
bool, Option(help="Push results and details to a public repo.", rich_help_panel=HELP_PANEL_NAME_2)
540
+
] =False,
541
+
results_org: Annotated[
542
+
Optional[str], Option(help="Organization to push results to.", rich_help_panel=HELP_PANEL_NAME_2)
543
+
] =None,
544
+
save_details: Annotated[
545
+
bool, Option(help="Save detailed, sample per sample, results.", rich_help_panel=HELP_PANEL_NAME_2)
546
+
] =False,
547
+
# === debug ===
548
+
max_samples: Annotated[
549
+
Optional[int], Option(help="Maximum number of samples to evaluate on.", rich_help_panel=HELP_PANEL_NAME_3)
550
+
] =None,
551
+
job_id: Annotated[
552
+
int, Option(help="Optional job id for future reference.", rich_help_panel=HELP_PANEL_NAME_3)
0 commit comments