azure-sdk
diff --git a/‎common/config/rush/pnpm-lock.yaml
Lines changed: 31 additions & 7 deletions b/‎common/config/rush/pnpm-lock.yaml
Lines changed: 31 additions & 7 deletions
diff --git a/‎sdk/openai/openai/CHANGELOG.md
Lines changed: 7 additions & 6 deletions b/‎sdk/openai/openai/CHANGELOG.md
Lines changed: 7 additions & 6 deletions
diff --git a/‎sdk/openai/openai/README.md
Lines changed: 61 additions & 7 deletions b/‎sdk/openai/openai/README.md
Lines changed: 61 additions & 7 deletions
diff --git a/‎sdk/openai/openai/assets.json
Lines changed: 1 addition & 1 deletion b/‎sdk/openai/openai/assets.json
Lines changed: 1 addition & 1 deletion
diff --git a/‎sdk/openai/openai/assets/audio/countdown.flac
347 KB b/‎sdk/openai/openai/assets/audio/countdown.flac
347 KB
diff --git a/‎sdk/openai/openai/assets/audio/countdown.m4a
162 KB b/‎sdk/openai/openai/assets/audio/countdown.m4a
162 KB
diff --git a/‎sdk/openai/openai/assets/audio/countdown.mp3
471 KB b/‎sdk/openai/openai/assets/audio/countdown.mp3
471 KB
diff --git a/‎sdk/openai/openai/assets/audio/countdown.mp4
134 KB b/‎sdk/openai/openai/assets/audio/countdown.mp4
134 KB
diff --git a/‎sdk/openai/openai/assets/audio/countdown.mpeg
600 KB b/‎sdk/openai/openai/assets/audio/countdown.mpeg
600 KB
diff --git a/‎sdk/openai/openai/assets/audio/countdown.mpga
471 KB b/‎sdk/openai/openai/assets/audio/countdown.mpga
471 KB
@@ -1,17 +1,18 @@
 # Release History
 
-## 1.0.0-beta.6 (Unreleased)
+## 1.0.0-beta.6 (2023-09-21)
 
 ### Features Added
 
-### Breaking Changes
+- Introduces speech to text and translation capabilities for a wide variety of audio file formats.
+  - Adds `getAudioTranscription` and `getAudioTranslation` methods for transcribing and translating audio files. The result can be either a simple JSON structure with just a `text` field or a more detailed JSON structure containing the text alongside additional information. In addition, VTT (Web Video Text Tracks), SRT (SubRip Text), and plain text formats are also supported. The type of the result depends on the `format` parameter if specified, otherwise, a simple JSON output is assumed. The methods could take as input an optional text prompt to guide the model's style or continue a previous audio segment. The language of the prompt should match that of the audio file.
+  - The available model at the time of this release supports the following list of audio file formats: m4a, mp3, wav, ogg, flac, webm, mp4, mpga, mpeg, and oga.
 
 ### Bugs Fixed
 
-- Return `usage` information when available.
-- Return `error` information in `ContentFilterResults` when available.
-
-### Other Changes
+- Returns `usage` information when available.
+- Fixes a bug where errors weren't properly being thrown from the streaming methods.
+- Returns `error` information in `ContentFilterResults` when available.
 
 ## 1.0.0-beta.5 (2023-08-25)
 
 
@@ -6,10 +6,12 @@ non-Azure OpenAI inference endpoint, making it a great choice for even non-Azure
 
 Use the client library for Azure OpenAI to:
 
-* [Create a completion for text][msdocs_openai_completion]
-* [Create a chat completion with ChatGPT][msdocs_openai_chat_completion]
+* [Create a completion for text][get_completions_sample]
+* [Create a chat completion with ChatGPT][list_chat_completion_sample]
 * [Create a text embedding for comparisons][msdocs_openai_embedding]
-* [Use your own data with Azure OpenAI][msdocs_openai_custom_data]
+* [Use your own data with Azure OpenAI][byod_sample]
+* [Generate images][get_images_sample]
+* [Transcribe and Translate audio files][transcribe_audio_sample]
 
 Azure OpenAI is a managed service that allows developers to deploy, tune, and generate content from OpenAI models on Azure resources.
 
@@ -20,6 +22,7 @@ Checkout the following examples:
 - [Summarize Text](#summarize-text-with-completion)
 - [Generate Images](#generate-images-with-dall-e-image-generation-models)
 - [Analyze Business Data](#analyze-business-data)
+- [Transcribe and Translate audio files](#transcribe-and-translate-audio-files)
 
 Key links:
 
@@ -140,6 +143,10 @@ async function main(){
     console.log(choice.text);
   }
 }
+
+main().catch((err) => {
+  console.error("The sample encountered an error:", err);
+});
 ```
 
 ## Examples
@@ -179,6 +186,10 @@ async function main(){
     }
   }
 }
+
+main().catch((err) => {
+  console.error("The sample encountered an error:", err);
+});
 ```
 
 ### Generate Multiple Completions With Subscription Key
@@ -212,6 +223,10 @@ async function main(){
     console.log(`Chatbot: ${completion}`);
   }
 }
+
+main().catch((err) => {
+  console.error("The sample encountered an error:", err);
+});
 ```
 
 ### Summarize Text with Completion
@@ -254,6 +269,9 @@ async function main(){
   console.log(`Summarization: ${completion}`);
 }
 
+main().catch((err) => {
+  console.error("The sample encountered an error:", err);
+});
 ```
 ### Generate images with DALL-E image generation models
 
@@ -276,6 +294,10 @@ async function main() {
     console.log(`Image generation result URL: ${image.url}`);
   }
 }
+
+main().catch((err) => {
+  console.error("The sample encountered an error:", err);
+});
 ```
 
 ### Analyze Business Data
@@ -285,7 +307,7 @@ This example generates chat responses to input chat questions about your busines
 
 ```javascript
 const { OpenAIClient } = require("@azure/openai");
-const { DefaultAzureCredential } = require("@azure/identity")
+const { DefaultAzureCredential } = require("@azure/identity");
 
 async function main(){
   const endpoint = "https://myaccount.openai.azure.com/";
@@ -323,6 +345,36 @@ async function main(){
     }
   }
 }
+
+main().catch((err) => {
+  console.error("The sample encountered an error:", err);
+});
+```
+
+### Transcribe and translate audio files
+
+The speech to text and translation capabilities of Azure OpenAI can be used to transcribe and translate a wide variety of audio file formats. The following example shows how to use the `getAudioTranscription` method to transcribe audio into the language the audio is in. You can also translate and transcribe the audio into English using the `getAudioTranslation` method.
+
+The audio file can be loaded into memory using the NodeJS file system APIs. In the browser, the file can be loaded using the `FileReader` API and the output of `arrayBuffer` instance method can be passed to the `getAudioTranscription` method.
+
+```js
+const { OpenAIClient, AzureKeyCredential } = require("@azure/openai");
+const fs = require("fs/promises");
+
+async function main() {
+  console.log("== Transcribe Audio Sample ==");
+
+  const client = new OpenAIClient(endpoint, new AzureKeyCredential(azureApiKey));
+  const deploymentName = "whisper-deployment";
+  const audio = await fs.readFile("< path to an audio file >");
+  const result = await client.getAudioTranscription(deploymentName, audio);
+
+  console.log(`Transcription: ${result.text}`);
+}
+
+main().catch((err) => {
+  console.error("The sample encountered an error:", err);
+});
 ```
 
 ## Troubleshooting
@@ -340,9 +392,11 @@ setLogLevel("info");
 For more detailed instructions on how to enable logs, you can look at the [@azure/logger package docs](https://github.com/Azure/azure-sdk-for-js/tree/main/sdk/core/logger).
 
 <!-- LINKS -->
-[msdocs_openai_completion]: https://github.com/Azure/azure-sdk-for-js/blob/main/sdk/openai/openai/samples/v1-beta/javascript/completions.js
-[msdocs_openai_chat_completion]: https://github.com/Azure/azure-sdk-for-js/blob/main/sdk/openai/openai/samples/v1-beta/javascript/listChatCompletions.js
-[msdocs_openai_custom_data]: https://github.com/Azure/azure-sdk-for-js/blob/main/sdk/openai/openai/samples-dev/bringYourOwnData.ts
+[get_completions_sample]: https://github.com/Azure/azure-sdk-for-js/blob/main/sdk/openai/openai/samples/v1-beta/javascript/completions.js
+[list_chat_completion_sample]: https://github.com/Azure/azure-sdk-for-js/blob/main/sdk/openai/openai/samples/v1-beta/javascript/listChatCompletions.js
+[byod_sample]: https://github.com/Azure/azure-sdk-for-js/blob/main/sdk/openai/openai/samples/v1-beta/javascript/bringYourOwnData.js
+[get_images_sample]: https://github.com/Azure/azure-sdk-for-js/blob/main/sdk/openai/openai/samples/v1-beta/javascript/getImages.js
+[transcribe_audio_sample]: https://github.com/Azure/azure-sdk-for-js/tree/openai/add-whisper/sdk/openai/openai/samples-dev/audioTranscription.ts
 [msdocs_openai_embedding]: https://learn.microsoft.com/azure/cognitive-services/openai/concepts/understand-embeddings
 [azure_openai_completions_docs]: https://learn.microsoft.com/azure/cognitive-services/openai/how-to/completions
 [defaultazurecredential]: https://github.com/Azure/azure-sdk-for-js/tree/main/sdk/identity/identity#defaultazurecredential
 
@@ -2,5 +2,5 @@
   "AssetsRepo": "Azure/azure-sdk-assets",
   "AssetsRepoPrefixPath": "js",
   "TagPrefix": "js/openai/openai",
-  "Tag": "js/openai/openai_353545d522"
+  "Tag": "js/openai/openai_85d9317957"
 }
Original file line number	Diff line number	Diff line change
`@@ -2,5 +2,5 @@`
`2`	`2`	`"AssetsRepo": "Azure/azure-sdk-assets",`
`3`	`3`	`"AssetsRepoPrefixPath": "js",`
`4`	`4`	`"TagPrefix": "js/openai/openai",`
`5`		`- "Tag": "js/openai/openai_353545d522"`
	`5`	`+ "Tag": "js/openai/openai_85d9317957"`
`6`	`6`	`}`