Merge pull request #471 from needuv/patch-1

prmerger-automator[bot] · web-flow · commit 65175039e154 · 2024-09-24T19:46:53.000Z
Update Evaluation SDK Docs
diff --git a/articles/ai-studio/how-to/develop/evaluate-sdk.md b/articles/ai-studio/how-to/develop/evaluate-sdk.md
@@ -87,17 +87,16 @@ When using AI-assisted performance and quality metrics, you must specify a GPT m
 You can run the built-in evaluators by importing the desired evaluator class. Ensure that you set your environment variables.
 ```python
 import os
-from promptflow.core import AzureOpenAIModelConfiguration
 
 # Initialize Azure OpenAI Connection with your environment variables
-model_config = AzureOpenAIModelConfiguration(
-    azure_endpoint=os.environ.get("AZURE_OPENAI_ENDPOINT"),
-    api_key=os.environ.get("AZURE_OPENAI_API_KEY"),
-    azure_deployment=os.environ.get("AZURE_OPENAI_DEPLOYMENT"),
-    api_version=os.environ.get("AZURE_OPENAI_API_VERSION"),
-)
+model_config = {
+    "azure_endpoint": os.environ.get("AZURE_OPENAI_ENDPOINT"),
+    "api_key": os.environ.get("AZURE_OPENAI_API_KEY"),
+    "azure_deployment": os.environ.get("AZURE_OPENAI_DEPLOYMENT"),
+    "api_version": os.environ.get("AZURE_OPENAI_API_VERSION"),
+}
 
-from azure.ai.evaluation.evaluators import RelevanceEvaluator
+from azure.ai.evaluation import RelevanceEvaluator
 
 # Initialzing Relevance Evaluator
 relevance_eval = RelevanceEvaluator(model_config)
@@ -131,7 +130,7 @@ azure_ai_project = {
     "project_name": "<project_name>",
 }
 
-from azure.ai.evaluation.evaluators import ViolenceEvaluator
+from azure.ai.evaluation import ViolenceEvaluator
 
 # Initializing Violence Evaluator with project information
 violence_eval = ViolenceEvaluator(azure_ai_project)
@@ -329,7 +328,7 @@ After logging your custom evaluator to your AI Studio project, you can view it i
 After you spot-check your built-in or custom evaluators on a single row of data, you can combine multiple evaluators with the `evaluate()` API on an entire test dataset. In order to ensure the `evaluate()` can correctly parse the data, you must specify column mapping to map the column from the dataset to key words that are accepted by the evaluators. In this case, we specify the data mapping for `ground_truth`.
 
 ```python
-from azure.ai.evaluation.evaluate import evaluate
+from azure.ai.evaluation import evaluate
 
 result = evaluate(
     data="data.jsonl", # provide your data here
diff --git a/articles/ai-studio/how-to/develop/simulator-interaction-data.md b/articles/ai-studio/how-to/develop/simulator-interaction-data.md
@@ -306,8 +306,8 @@ The `AdversarialSimulator` supports a range of scenarios, hosted in the service,
 | Text Rewrite                  | `ADVERSARIAL_REWRITE`                |1000 |Hateful and unfair content, Sexual content, Violent content, Self-harm-related content, Direct Attack (UPIA) Jailbreak |
 | Ungrounded Content Generation | `ADVERSARIAL_CONTENT_GEN_UNGROUNDED` |496 | Groundedness |
 | Grounded Content Generation   | `ADVERSARIAL_CONTENT_GEN_GROUNDED`   |475 |Groundedness |
-| Protected Material | `ADVERSARIAL_PROTECTED_MATERIAL` | 200 | Protected Material |
-|Indirect Attack (XPIA) Jailbreak | `ADVERSARIAL_INDIRECT_JAILBREAK` | 200 | Indirect Attack (XPIA) Jailbreak|
+| Protected Material | `ADVERSARIAL_PROTECTED_MATERIAL` | 306 | Protected Material |
+|Indirect Attack (XPIA) Jailbreak | `ADVERSARIAL_INDIRECT_JAILBREAK` | 100 | Indirect Attack (XPIA) Jailbreak|
 
 ### Simulating jailbreak attacks