You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
|`score_columns`| Alternative: separate columns for each metric (e.g., `accuracy`, `helpfulness`) instead of a dict |`score_columns=["accuracy", "helpfulness"]`|
202
+
|`prompt_column`| Name of the prompt column in your dataframe (default: `"prompt"`) |`prompt_column="input"`|
203
+
|`model_column`| Name of the model column for single_model (default: `"model"`) |`model_column="llm_name"`|
204
+
|`model_response_column`| Name of the model response column for single_model (default: `"model_response"`) |`model_response_column="output"`|
205
+
|`question_id_column`| Name of the question_id column (default: `"question_id"` if column exists) |`question_id_column="qid"`|
143
206
144
207
### Side-by-Side Comparisons
145
208
@@ -159,6 +222,12 @@ Use the React frontend or other visualization tools to explore your results.
159
222
|--------|-------------|---------|
160
223
|`score`| Winner and metrics |`{"winner": "model_a", "helpfulness_a": 4.2, "helpfulness_b": 3.8}`|
161
224
|`score_columns`| Alternative: separate columns for each metric with `_a` and `_b` suffixes (e.g., `accuracy_a`, `accuracy_b`) |`score_columns=["accuracy_a", "accuracy_b", "helpfulness_a", "helpfulness_b"]`|
225
+
|`prompt_column`| Name of the prompt column in your dataframe (default: `"prompt"`) |`prompt_column="query"`|
226
+
|`model_a_column`| Name of the model_a column (default: `"model_a"`) |`model_a_column="model_1"`|
227
+
|`model_b_column`| Name of the model_b column (default: `"model_b"`) |`model_b_column="model_2"`|
228
+
|`model_a_response_column`| Name of the model_a_response column (default: `"model_a_response"`) |`model_a_response_column="response_1"`|
229
+
|`model_b_response_column`| Name of the model_b_response column (default: `"model_b_response"`) |`model_b_response_column="response_2"`|
230
+
|`question_id_column`| Name of the question_id column (default: `"question_id"` if column exists) |`question_id_column="qid"`|
0 commit comments