typos

Juliako · Juliako · commit 95328bae0c56 · 2023-03-27T12:04:55.000-07:00
diff --git a/articles/azure-video-indexer/audio-effects-detection-overview.md b/articles/azure-video-indexer/audio-effects-detection-overview.md
@@ -12,17 +12,17 @@ ms.topic: article
 
 # Audio effects detection 
 
-Audio effects detection is an Azure Video Indexer feature that detects insights on a variety of acoustic events and classifies them into acoustic categories. Audio effect detection can detect and classify different categories such as laughter, crowd reactions, alarms and/or sirens.  
+Audio effects detection is an Azure Video Indexer feature that detects insights on various acoustic events and classifies them into acoustic categories. Audio effect detection can detect and classify different categories such as laughter, crowd reactions, alarms and/or sirens.  
 
-When working on the website, the instances are displayed in the Insights tab. They can also be generated in a categorized list in a JSON file which includes the category ID, type, name, and instances per category together with the specific timeframes and confidence score. 
+When working on the website, the instances are displayed in the Insights tab. They can also be generated in a categorized list in a JSON file that includes the category ID, type, name, and instances per category together with the specific timeframes and confidence score. 
 
 ## Prerequisites  
 
 Review [transparency note overview](/legal/azure-video-indexer/transparency-note?context=/azure/azure-video-indexer/context/context)
 
 ## General principles 
 
-This article discusses audio effects detection and the key considerations for making use of this technology responsibly. There are a number of things you need to consider when deciding how to use and implement an AI-powered feature: 
+This article discusses audio effects detection and the key considerations for making use of this technology responsibly. There are many things you need to consider when deciding how to use and implement an AI-powered feature: 
 
 * Will this feature perform well in my scenario? Before deploying audio effects detection into your scenario, test how it performs using real-life data and make sure it can deliver the accuracy you need. 
 * Are we equipped to identify and respond to errors? AI-powered products and features won't be 100% accurate, so consider how you'll identify and respond to any errors that may occur. 
@@ -36,7 +36,7 @@ To see the instances on the website, do the following:
 
 To display the JSON file, do the following: 
 
-1. Click Download -> Insights (JSON).  
+1. Select Download -> Insights (JSON).  
 1. Copy the `audioEffects` element, under `insights`, and paste it into your Online JSON viewer. 
 
     ```json
@@ -81,40 +81,40 @@ During the audio effects detection procedure, audio in a media file is processed
 |Source file |	The user uploads the source file for indexing. |
 |Segmentation|  	The audio is analyzed, non-speech audio is identified and then split into short overlapping internals. |
 |Classification| 	An AI process analyzes each segment and classifies its contents into event categories such as crowd reaction or laughter. A probability list is then created for each event category according to department-specific rules. |
-|Confidence level|	The estimated confidence level of each audio effect is calculated as a range of 0 to 1. The confidence score represents the certainty in the accuracy of the result. For example, an 82% certainty will be represented as an 0.82 score.|
+|Confidence level|	The estimated confidence level of each audio effect is calculated as a range of 0 to 1. The confidence score represents the certainty in the accuracy of the result. For example, an 82% certainty is represented as an 0.82 score.|
 
 ## Example use cases 
 
-- Companies with a large video archive can improve accessibility by offering more context for a hearing- impaired audience by transcription of non-speech effects. 
-- Improved efficiency when creating raw data for content creators. Important moments in promos and trailers such as laughter, crowd reactions, gunshots, or explosions can be identified for example in Media and Entertainment. 
+- Companies with a large video archive can improve accessibility by offering more context for a hearing- impaired audience by transcription of nonspeech effects. 
+- Improved efficiency when creating raw data for content creators. Important moments in promos and trailers such as laughter, crowd reactions, gunshots, or explosions can be identified, for example,  in Media and Entertainment. 
 - Detecting and classifying gunshots, explosions, and glass shattering in a smart-city system or in other public environments that include cameras and microphones to offer fast and accurate detection of violence incidents.  
 
 ## Considerations and limitations when choosing a use case 
 
-- Avoid use of very short or low-quality audio, audio effects detection provides probabilistic and partial data on detected non-speech audio events. For accuracy, audio effects detection requires at least 2 seconds of clear non-speech audio. Voice commands or singing are not supported.   
-- Avoid use of audio with very loud background music or music with repetitive and/or linearly scanned frequency, audio effects detection is designed for non-speech audio only and therefore cannot classify events in loud music. Music with repetitive and/or linearly scanned frequency many be incorrectly classified as an alarm or siren. 
+- Avoid use of very short or low-quality audio, audio effects detection provides probabilistic and partial data on detected nonspeech audio events. For accuracy, audio effects detection requires at least 2 seconds of clear nonspeech audio. Voice commands or singing aren't supported.   
+- Avoid use of audio with loud background music or music with repetitive and/or linearly scanned frequency, audio effects detection is designed for nonspeech audio only and therefore can't classify events in loud music. Music with repetitive and/or linearly scanned frequency many be incorrectly classified as an alarm or siren. 
 - Carefully consider the methods of usage in law enforcement and similar institutions, to promote more accurate probabilistic data, carefully review the following: 
 
-    - Audio effects can be detected in non-speech segments only. 
-    - The duration of a non-speech section should be at least 2 seconds. 
+    - Audio effects can be detected in nonspeech segments only. 
+    - The duration of a nonspeech section should be at least 2 seconds. 
     - Low quality audio might impact the detection results.  
-    - Events in loud background music are not classified.  
+    - Events in loud background music aren't classified.  
     - Music with repetitive and/or linearly scanned frequency might be incorrectly classified as an alarm or siren. 
-    - Knocking on a door or slamming a door might be labelled as a gunshot or explosion. 
+    - Knocking on a door or slamming a door might be labeled as a gunshot or explosion. 
     - Prolonged shouting or sounds of physical human effort might be incorrectly classified. 
     - A group of people laughing might be classified as both laughter and crowd. 
-    - Natural and non-synthetic gunshot and explosions sounds are supported. 
+    - Natural and nonsynthetic gunshot and explosions sounds are supported. 
 
 When used responsibly and carefully, Azure Video Indexer is a valuable tool for many industries. To respect the privacy and safety of others, and to comply with local and global regulations, we recommend the following:   
 
 - Always respect an individual’s right to privacy, and only ingest audio for lawful and justifiable purposes.   
-- Do not purposely disclose inappropriate audio of young children or family members of celebrities or other content that may be detrimental or pose a threat to an individual’s personal freedom.   
+- Don't purposely disclose inappropriate audio of young children or family members of celebrities or other content that may be detrimental or pose a threat to an individual’s personal freedom.   
 - Commit to respecting and promoting human rights in the design and deployment of your analyzed audio.   
 - When using 3rd party materials, be aware of any existing copyrights or permissions required before distributing content derived from them.  
 - Always seek legal advice when using audio from unknown sources.  
 - Be aware of any applicable laws or regulations that exist in your area regarding processing, analyzing, and sharing audio containing people.  
-- Keep a human in the loop. Do not use any solution as a replacement for human oversight and decision-making.   
-- Fully examine and review the potential of any AI model you are using to understand its capabilities and limitations.  
+- Keep a human in the loop. Don't use any solution as a replacement for human oversight and decision-making.   
+- Fully examine and review the potential of any AI model you're using to understand its capabilities and limitations.  
 
 ## Next steps
 
diff --git a/articles/azure-video-indexer/insights-overview.md b/articles/azure-video-indexer/insights-overview.md
@@ -10,6 +10,18 @@ ms.author: juliako
 
 When a video is indexed, Azure Video Indexer analyzes the video and audio content by running 30+ AI models, generating rich insights. Insights contain an aggregated view of the data: transcripts, optical character recognition elements (OCRs), face, topics, emotions, etc. Once the video is indexed and analyzed, Azure Video Indexer produces a JSON content that contains details of the video insights. For example, each insight type includes instances of time ranges that show when the insight appears in the video. 
 
+Read details about the following insights here:
+
+- [Audio effects detection](audio-effects-detection-overview.md)
+- [Faces detection](face-detection.md)
+- [OCR](ocr.md)
+- [Keywords extraction](keywords.md)
+- [Transcription, translation, language](transcription-translation-lid.md)
+- [Labels identification](labels-identification.md)
+- [Named entities](named-entities.md)
+- [Observed people tracking & matched faces](observed-matched-people.md)
+- [Topics inference](topics-inference.md)
+
 For information about features and other insights, see:
 
 - [Azure Video Indexer overview](video-indexer-overview.md)
diff --git a/articles/azure-video-indexer/keywords.md b/articles/azure-video-indexer/keywords.md
@@ -20,10 +20,10 @@ Review [Transparency Note overview](/legal/azure-video-indexer/transparency-note
 
 ## General principles 
 
-This article discusses Keywords and the key considerations for making use of this technology responsibly. There are a number of things you need to consider when deciding how to use and implement an AI-powered feature: 
+This article discusses Keywords and the key considerations for making use of this technology responsibly. There are many things you need to consider when deciding how to use and implement an AI-powered feature: 
 
 - Will this feature perform well in my scenario? Before deploying Keywords Extraction into your scenario, test how it performs using real-life data and make sure it can deliver the accuracy you need. 
-- Are we equipped to identify and respond to errors? AI-powered products and features will not be 100% accurate, so consider how you will identify and respond to any errors that may occur. 
+- Are we equipped to identify and respond to errors? AI-powered products and features won't be 100% accurate, so consider how you'll identify and respond to any errors that may occur. 
 
 ## View the insight
 
@@ -112,20 +112,20 @@ During the Keywords procedure, audio and images in a media file are processed, a
 Below are some considerations to keep in mind when using keywords extraction: 
 
 - When uploading a file always use high-quality video content. The recommended maximum frame size is HD and frame rate is 30 FPS. A frame should contain no more than 10 people. When outputting frames from videos to AI models, only send around 2 or 3 frames per second. Processing 10 and more frames might delay the AI result.   
-- When uploading a file always use high quality audio and video content. At least 1 minute of spontaneous conversational speech is required to perform analysis. Audio effects are detected in non-speech segments only. The minimal duration of a non-speech section is 2 seconds. Voice commands and singing are not supported.  
+- When uploading a file always use high quality audio and video content. At least 1 minute of spontaneous conversational speech is required to perform analysis. Audio effects are detected in non-speech segments only. The minimal duration of a non-speech section is 2 seconds. Voice commands and singing aren't supported.  
 
 When used responsibly and carefully Keywords is a valuable tool for many industries. To respect the privacy and safety of others, and to comply with local and global regulations, we recommend the following:   
 
 - Always respect an individual’s right to privacy, and only ingest media for lawful and justifiable purposes.   
-- Do not purposely disclose inappropriate media showing young children or family members of celebrities or other content that may be detrimental or pose a threat to an individual’s personal freedom.   
+- Don't purposely disclose inappropriate media showing young children or family members of celebrities or other content that may be detrimental or pose a threat to an individual’s personal freedom.   
 - Commit to respecting and promoting human rights in the design and deployment of your analyzed media.   
 - When using 3rd party materials, be aware of any existing copyrights or permissions required before distributing content derived from them.  
 - Always seek legal advice when using media from unknown sources.  
 - Always obtain appropriate legal and professional advice to ensure that your uploaded media is secured and have adequate controls to preserve the integrity of your content and to prevent unauthorized access.     
 - Provide a feedback channel that allows users and individuals to report issues with the service.   
 - Be aware of any applicable laws or regulations that exist in your area regarding processing, analyzing, and sharing media containing people.  
-- Keep a human in the loop. Do not use any solution as a replacement for human oversight and decision-making.   
-- Fully examine and review the potential of any AI model you are using to understand its capabilities and limitations.  
+- Keep a human in the loop. Don't use any solution as a replacement for human oversight and decision-making.   
+- Fully examine and review the potential of any AI model you're using to understand its capabilities and limitations.  
 
 ## Next steps
 
diff --git a/articles/azure-video-indexer/toc.yml b/articles/azure-video-indexer/toc.yml