You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A conversation is a python dictionary of a list of messages (which include content, role, and optionally context). The following is an example of a two-turn conversation.
99
+
A conversation is a Python dictionary of a list of messages (which include content, role, and optionally context). The following is an example of a two-turn conversation.
100
100
101
101
```json
102
102
{"conversation":
@@ -172,7 +172,7 @@ Here's an example of the result:
172
172
173
173
### Risk and safety evaluators
174
174
175
-
When you use AI-assisted risk and safety metrics, a GPT model isn't required. Instead of `model_config`, provide your `azure_ai_project` information. This accesses the Azure AI Studio safety evaluations back-end service, which provisions an GPT model specific to harms evaluation that can generate content risk severity scores and reasoning to enable the safety evaluators.
175
+
When you use AI-assisted risk and safety metrics, a GPT model isn't required. Instead of `model_config`, provide your `azure_ai_project` information. This accesses the Azure AI Studio safety evaluations back-end service, which provisions a GPT model specific to harms evaluation that can generate content risk severity scores and reasoning to enable the safety evaluators.
176
176
177
177
#### Region support
178
178
@@ -366,7 +366,7 @@ assistant: {{response}}
366
366
output:
367
367
```
368
368
369
-
You can create your own prompty-based evaluator and run it on a row of data:
369
+
You can create your own Prompty-based evaluator and run it on a row of data:
370
370
371
371
```python
372
372
withopen("apology.prompty") as fin:
@@ -548,7 +548,7 @@ result = evaluate(
548
548
549
549
If you have a list of queries that you'd like to run then evaluate, the `evaluate()` also supports a `target` parameter, which can send queries to an application to collect answers then run your evaluators on the resulting query and response.
550
550
551
-
A target can be any callable class in your directory. In this case we have a python script `askwiki.py` with a callable class `askwiki()` that we can set as our target. Given a dataset of queries we can send into our simple `askwiki` app, we can evaluate the relevance of the outputs. Ensure you specify the proper column mapping for your data in `"column_mapping"`. You can use `"default"` to specify column mapping for all evaluators.
551
+
A target can be any callable class in your directory. In this case we have a Python script `askwiki.py` with a callable class `askwiki()` that we can set as our target. Given a dataset of queries we can send into our simple `askwiki` app, we can evaluate the relevance of the outputs. Ensure you specify the proper column mapping for your data in `"column_mapping"`. You can use `"default"` to specify column mapping for all evaluators.
0 commit comments