Azure
diff --git a/‎cli/foundation-models/system/distillation/summarization/README.md‎
Lines changed: 33 additions & 0 deletions b/‎cli/foundation-models/system/distillation/summarization/README.md‎
Lines changed: 33 additions & 0 deletions
diff --git a/‎cli/foundation-models/system/distillation/summarization/distillation_summarization.yaml‎
Lines changed: 59 additions & 0 deletions b/‎cli/foundation-models/system/distillation/summarization/distillation_summarization.yaml‎
Lines changed: 59 additions & 0 deletions
diff --git a/‎cli/foundation-models/system/distillation/summarization/serverless_endpoint.yaml‎
Lines changed: 2 additions & 0 deletions b/‎cli/foundation-models/system/distillation/summarization/serverless_endpoint.yaml‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎cli/foundation-models/system/distillation/summarization/validation_summarization.jsonl‎
Lines changed: 400 additions & 0 deletions b/‎cli/foundation-models/system/distillation/summarization/validation_summarization.jsonl‎
Lines changed: 400 additions & 0 deletions
diff --git a/‎cli/jobs/automl-standalone-jobs/cli-automl-classification-task-bankmarketing/cli-automl-classification-task-bankmarketing.yml‎
Lines changed: 1 addition & 1 deletion b/‎cli/jobs/automl-standalone-jobs/cli-automl-classification-task-bankmarketing/cli-automl-classification-task-bankmarketing.yml‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎sdk/python/dev-requirements.txt‎
Lines changed: 1 addition & 1 deletion b/‎sdk/python/dev-requirements.txt‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎sdk/python/foundation-models/meta-llama3/langchain.ipynb‎
Lines changed: 3 additions & 3 deletions b/‎sdk/python/foundation-models/meta-llama3/langchain.ipynb‎
Lines changed: 3 additions & 3 deletions
diff --git a/‎sdk/python/foundation-models/system/distillation/README.md‎
Lines changed: 2 additions & 0 deletions b/‎sdk/python/foundation-models/system/distillation/README.md‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎sdk/python/foundation-models/system/distillation/conversation/distillation_conversational_task.ipynb‎
Lines changed: 1 addition & 1 deletion b/‎sdk/python/foundation-models/system/distillation/conversation/distillation_conversational_task.ipynb‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎sdk/python/foundation-models/system/distillation/math/distillation_math.ipynb‎
Lines changed: 1 addition & 1 deletion b/‎sdk/python/foundation-models/system/distillation/math/distillation_math.ipynb‎
Lines changed: 1 addition & 1 deletion
@@ -0,0 +1,33 @@
+# Distillation with CLI (Summarization)
+
+## 1. Create the Job
+Ensure you have the proper setup.
+1. Run `az version` and ensure the `ml` extension is installed. `ml` version should be greater or equal to 2.32.0.
+2. If the `ml` extension is not installed, run `az extension add -n ml`
+
+Run the Distillation CLI command pointing to the .YAML file in this folder and fill out the Azure ML IDs needed:
+
+```text
+az ml job create --file distillation_summarization.yaml --workspace-name [YOUR_AZURE_WORKSPACE] --resource-group [YOUR_AZURE_RESOURCE_GROUP] --subscription [YOUR_AZURE_SUBSCRIPTION]
+```
+
+**Note:** To see how the train and validation files were created, see section 2 of this [notebook](/sdk/python/foundation-models/system/distillation/summarization/distillation_summarization.ipynb)
+
+## 2. Deploy to Endpoint
+Once the distilled model is ready, you can deploy the model through the UI or CLI.
+
+### UI Deployment
+1. Navigate to the `model` tab in [ml studio](https://ml.azure.com) or navigate to the `Finetuning` tab in the [ai platform](https://ai.azure.com)
+2. If using the ml studio, locate the model using the `name` of the `registered_model` in the yaml file used to create this job. Select deploy to deploy a serverless endpoint. If using the ai platform, search for the name of the job, which in this example is `Distillation-summarization-llama`. Click on that name, and select Deploy to deploy a serverless endpoint.
+
+### CLI Deployment
+Fill out the serverless_endpoint.yaml file in this folder. The necessary information can be found by 
+1. Navigating to the `model` tab in [ml studio](https://ml.azure.com).
+2. Using the `name` of the `registered_model` in the yaml file used to create this job, select the model with that `name`. In this example, the name to use is `llama-summarization-distilled`
+3. Use the `asset_id` to fill out the `model_id` in the yaml.
+
+With the information filled out, run the command
+
+```text
+az ml serverless-endpoint create -f serverless_endpoint.yaml
+```
@@ -0,0 +1,59 @@
+type: distillation
+
+name: "Distillation-summarization-llama"
+description: "Distill student model using a teacher model"
+experiment_name: "Distillation-summarization"
+
+# Data Generation Properties
+data_generation_type: label_generation
+data_generation_task_type: summarization
+
+# Input data
+training_data:
+  type: uri_file
+  path: ./train_summarization.jsonl
+validation_data:
+  type: uri_file
+  path: ./validation_summarization.jsonl
+
+# Teacher model serverless endpoint information
+teacher_model_endpoint_connection: 
+  type: serverless
+  name: Meta-Llama-3-1-405B-Instruct-vkn
+  endpoint: https://Meta-Llama-3-1-405B-Instruct-vkn.westus3.models.ai.azure.com/chat/completions
+  api_key: EXAMPLE_API_KEY
+
+# Model ID
+student_model: azureml://registries/azureml-meta/models/Meta-Llama-3.1-8B-Instruct/versions/2
+
+# Output distilled model
+outputs:
+  registered_model:
+    type: mlflow_model
+    name: llama-summarization-distilled
+
+
+# Teacher model related properties (OPTIONAL)
+teacher_model_settings:
+  inference_parameters:
+    temperature: 0.8
+    max_tokens: 1024
+    top_p: 0.95
+  endpoint_request_settings:
+    request_batch_size: 10
+    min_endpoint_success_ratio: 0.7
+
+# System prompt settings (OPTIONAL)
+prompt_settings:
+  enable_chain_of_density: true
+  max_len_summary: 80
+
+# For finetuning (OPTIONAL)
+hyperparameters:
+  learning_rate_multiplier: "0.2"
+  n_epochs: "5"
+  batch_size: "2"
+
+# Resource for Data Generation Step (OPTIONAL)
+resources:
+  instance_type: Standard_D2_v2
@@ -0,0 +1,2 @@
+name: llama-summarization-distilled
+model_id: azureml://locations/{AI_PROJECT_LOCATION}/workspaces/{WORKSPACE_ID}/models/llama-summarization-distilled/versions/{VERSION}
@@ -30,7 +30,7 @@ test_data:
 limits:
   timeout_minutes: 180
   max_trials: 40
-  max_concurrent_trials: 5
+  # max_concurrent_trials: 5
   trial_timeout_minutes: 20
   enable_early_termination: true
   exit_score: 0.92
 
@@ -10,4 +10,4 @@ tensorflow==2.9.3
 tensorflow-hub==0.15.0
 transformers==4.34.0
 keras==2.9
-jupyter-client==7.4.9
+jupyter-client==7.4.9
@@ -4,9 +4,9 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "# Use LangChain with Meta Llama 3 in Azure AI and Azure ML\n",
+    "# Use LangChain with Meta Llama 3 family of models in Azure AI and Azure ML\n",
     "\n",
-    "You can use Meta-Llama-3 models deployed in Azure AI and Azure ML with `langchain` to create more sophisticated intelligent applications. Use `langchain_community` package with the Azure Machine Learning integration."
+    "You can use Meta-Llama-3 models family of models deployed in Azure AI and Azure ML with `langchain` to create more sophisticated intelligent applications. Use `langchain_community` package with the Azure Machine Learning integration."
    ]
   },
   {
@@ -20,7 +20,7 @@
     "* Register for a valid Azure account with subscription \n",
     "* Make sure you have access to [Azure AI Studio](https://learn.microsoft.com/en-us/azure/ai-studio/what-is-ai-studio?tabs=home)\n",
     "* Create a project and resource group\n",
-    "* Select Meta-Llama-3 models from Model catalog. This example assumes you are deploying `Meta-Llama-3-70B-Instruct`.\n",
+    "* Select one of Meta-Llama-3 family of models from Model catalog. This example assumes you are deploying `Meta-Llama-3-70B-Instruct`.\n",
     "\n",
     "    > Notice that some models may not be available in all the regions in Azure AI and Azure Machine Learning. On those cases, you can create a workspace or project in the region where the models are available and then consume it with a connection from a different one. To learn more about using connections see [Consume models with connections](https://learn.microsoft.com/en-us/azure/ai-studio/how-to/deployments-connections)\n",
     "\n",
 
@@ -47,9 +47,11 @@ We currently support numerous task types for model distillation. To view example
 - [Conversation](./conversation/distillation_conversational_task.ipynb)
 - [NLU QA (Natural Language Understanding Question and Answer)](./nlu_qa/distillation_nlu_qa_task.ipynb)
 - [Math](./math/distillation_math.ipynb)
+- [Summarization](./summarization/distillation_summarization.ipynb)
 
 ### CLI Examples
 - [NLI (Natural Language Interpretation)](/cli/foundation-models/system/distillation/nli/README.md)
 - [Conversation](/cli/foundation-models/system/distillation/conversation/README.md)
 - [NLU QA (Natural Language Understanding Question and Answer)](/cli/foundation-models/system/distillation/nlu_qa/README.md)
 - [Math](/cli/foundation-models/system/distillation/math/README.md)
+- [Summarization](/cli/foundation-models/system/distillation/summarization/README.md)
@@ -349,7 +349,7 @@
     "\n",
     "# If validation data was registered to workspace already, navigate to the Data tab, select the data to use and use the 'Named asset URI'.\n",
     "# Example of the format is seen below\n",
-    "# train_data = \"azureml:validation_conversation_quora:1\""
+    "# valid_data = \"azureml:validation_conversation_quora:1\""
    ]
   },
   {
 
@@ -349,7 +349,7 @@
     "valid_data = Input(type=AssetTypes.URI_FILE, path=valid_data_path)\n",
     "\n",
     "# If validation data was registered to workspace already, navigate to the Data tab, select the data to use and use the 'Named asset URI'\n",
-    "# train_data = \"azureml:math_valid_multi_arith:1\""
+    "# valid_data = \"azureml:math_valid_multi_arith:1\""
    ]
   },
   {
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,2 @@`
	`1`	`+name: llama-summarization-distilled`
	`2`	`+model_id: azureml://locations/{AI_PROJECT_LOCATION}/workspaces/{WORKSPACE_ID}/models/llama-summarization-distilled/versions/{VERSION}`
Original file line number	Diff line number	Diff line change
`@@ -349,7 +349,7 @@`
`349`	`349`	`"\n",`
`350`	`350`	`"# If validation data was registered to workspace already, navigate to the Data tab, select the data to use and use the 'Named asset URI'.\n",`
`351`	`351`	`"# Example of the format is seen below\n",`
`352`		`- "# train_data = \"azureml:validation_conversation_quora:1\""`
	`352`	`+ "# valid_data = \"azureml:validation_conversation_quora:1\""`
`353`	`353`	`]`
`354`	`354`	`},`
`355`	`355`	`{`