Skip to content

Commit fc71adc

Browse files
committed
Merge branch 'master' of https://github.com/MicrosoftDocs/azure-docs-pr into two-azure-monitor-topics
2 parents 3af5b3d + 5ba4b12 commit fc71adc

File tree

62 files changed

+659
-381
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

62 files changed

+659
-381
lines changed

articles/cognitive-services/Computer-vision/concept-recognizing-text.md

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -22,10 +22,9 @@ Computer Vision provides a number of services that detect and extract printed or
2222

2323
The Read API detects text content in an image using our latest recognition models and converts the identified text into a machine-readable character stream. It's optimized for text-heavy images (such as documents that have been digitally scanned) and for images with a lot of visual noise. It will determine which recognition model to use for each line of text, supporting images with both printed and handwritten text. The Read API executes asynchronously because larger documents can take several minutes to return a result.
2424

25-
The Read operation maintains the original line groupings of recognized words in its output. Each line comes with bounding box coordinates, and each word within the line also has its own coordinates. If a word was recognized with low confidence, that information is conveyed as well. See the [Read API v2.0 reference docs](https://westus.dev.cognitive.microsoft.com/docs/services/5adf991815e1060e6355ad44/operations/2afb498089f74080d7ef85eb) or [Read API v3.0 preview reference docs](https://go.microsoft.com/fwlink/?linkid=2118322) to learn more.
25+
The Read operation maintains the original line groupings of recognized words in its output. Each line comes with bounding box coordinates, and each word within the line also has its own coordinates. If a word was recognized with low confidence, that information is conveyed as well. See the [Read API v2.0 reference docs](https://westus.dev.cognitive.microsoft.com/docs/services/5adf991815e1060e6355ad44/operations/2afb498089f74080d7ef85eb) or [Read API v3.0 reference docs](https://aka.ms/computer-vision-v3-ref) to learn more.
2626

27-
> [!NOTE]
28-
> This feature is only available for English and Spanish (v3.0 preview) text.
27+
The Read operation can recognize text in English, Spanish, German, French, Italian, Portuguese, and Dutch.
2928

3029
### Image requirements
3130

articles/cognitive-services/Computer-vision/language-support.md

Lines changed: 31 additions & 31 deletions
Original file line numberDiff line numberDiff line change
@@ -18,38 +18,38 @@ Some features of Computer Vision support multiple languages; any features not me
1818

1919
## Text recognition
2020

21-
Computer Vision can recognize text in many languages. Specifically, the [OCR](https://westus.dev.cognitive.microsoft.com/docs/services/5adf991815e1060e6355ad44/operations/56f91f2e778daf14a499e1fc) API supports a variety of languages, whereas the [Read](https://westus.dev.cognitive.microsoft.com/docs/services/5adf991815e1060e6355ad44/operations/2afb498089f74080d7ef85eb) API and [Recognize Text](https://westus.dev.cognitive.microsoft.com/docs/services/5adf991815e1060e6355ad44/operations/587f2c6a154055056008f200) API only support English. See [Recognize printed and handwritten text](concept-recognizing-text.md) for more information on this functionality and the advantages of each API.
21+
Computer Vision can recognize text in many languages.The [OCR](https://westus.dev.cognitive.microsoft.com/docs/services/5adf991815e1060e6355ad44/operations/56f91f2e778daf14a499e1fc) API and the [Read](https://aka.ms/computer-vision-v3-ref) API (v3.0) support several languages, and [Recognize Text](https://westus.dev.cognitive.microsoft.com/docs/services/5adf991815e1060e6355ad44/operations/587f2c6a154055056008f200) API only support English. See [Recognize printed and handwritten text](concept-recognizing-text.md) for more information on this functionality and the advantages of each API.
2222

23-
OCR automatically detects the language of the input material, so there is no need to specify a language code in the API call. However, language codes are always returned as the value of the `"language"` node in the JSON response.
23+
The OCR and Read APIs automatically detect the language of the input material, so there is no need to specify a language code. Additionally, the language code for the detected language is always returned in the JSON response.
2424

25-
|Language| Language code | OCR API |
26-
|:-----|:----:|:-----:|
27-
|Arabic | `ar`||
28-
|Chinese (Simplified) | `zh-Hans`||
29-
|Chinese (Traditional) | `zh-Hant`||
30-
|Czech | `cs` ||
31-
|Danish | `da` ||
32-
|Dutch | `nl` ||
33-
|English | `en` ||
34-
|Finnish | `fi` ||
35-
|French | `fr` ||
36-
|German | `de` ||
37-
|Greek | `el` ||
38-
|Hungarian | `hu` ||
39-
|Italian | `it` ||
40-
|Japanese | `ja` ||
41-
|Korean | `ko` ||
42-
|Norwegian | `nb` ||
43-
|Polish | `pl` ||
44-
|Portuguese | `pt` ||
45-
|Romanian | `ro` ||
46-
|Russian | `ru` ||
47-
|Serbian (Cyrillic) | `sr-Cyrl` ||
48-
|Serbian (Latin) | `sr-Latn` ||
49-
|Slovak | `sk` ||
50-
|Spanish | `es` ||
51-
|Swedish | `sw` ||
52-
|Turkish | `tr` ||
25+
|Language| Language code | OCR API | Read API |
26+
|:-----|:----:|:-----:|:---:|
27+
|Arabic | `ar`|| |
28+
|Chinese (Simplified) | `zh-Hans`|| |
29+
|Chinese (Traditional) | `zh-Hant`|| |
30+
|Czech | `cs` || |
31+
|Danish | `da` || |
32+
|Dutch | `nl` |||
33+
|English | `en` |||
34+
|Finnish | `fi` || |
35+
|French | `fr` |||
36+
|German | `de` |||
37+
|Greek | `el` || |
38+
|Hungarian | `hu` || |
39+
|Italian | `it` |||
40+
|Japanese | `ja` || |
41+
|Korean | `ko` || |
42+
|Norwegian | `nb` || |
43+
|Polish | `pl` || |
44+
|Portuguese | `pt` |||
45+
|Romanian | `ro` || |
46+
|Russian | `ru` || |
47+
|Serbian (Cyrillic) | `sr-Cyrl` || |
48+
|Serbian (Latin) | `sr-Latn` || |
49+
|Slovak | `sk` || |
50+
|Spanish | `es` |||
51+
|Swedish | `sw` || |
52+
|Turkish | `tr` || |
5353

5454
## Image analysis
5555

@@ -68,4 +68,4 @@ Some actions of the [Analyze - Image](https://westus.dev.cognitive.microsoft.com
6868
Get started using the Computer Vision features mentioned in this guide.
6969

7070
* [Analyze a local image (REST)](./quickstarts/csharp-analyze.md)
71-
* [Extract printed text (REST)](./quickstarts/csharp-print-text.md)
71+
* [Extract printed text (REST)](./quickstarts/csharp-print-text.md)

articles/cognitive-services/Computer-vision/toc.yml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -135,6 +135,8 @@
135135
href: https://docs.microsoft.com/cli/azure/cognitiveservices?view=azure-cli-latest#az_cognitiveservices_list
136136
- name: Azure PowerShell
137137
href: https://aka.ms/azure-powershell-cognitiveservices
138+
- name: Computer Vision API v3.0
139+
href: https://aka.ms/computer-vision-v3-ref
138140
- name: Computer Vision API v3.0-preview
139141
href: https://westus2.dev.cognitive.microsoft.com/docs/services/5d98695995feb7853f67d6a6
140142
- name: Computer Vision API v2.1

articles/cognitive-services/Speech-Service/how-to-configure-rhel-centos-7.md

Lines changed: 45 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -40,7 +40,7 @@ ldconfig -p | grep libstdc++
4040

4141
The output on vanilla RHEL/CentOS 7 (x64) is:
4242

43-
```
43+
```bash
4444
libstdc++.so.6 (libc6,x86-64) => /lib64/libstdc++.so.6
4545
```
4646

@@ -52,7 +52,7 @@ strings /lib64/libstdc++.so.6 | egrep "GLIBCXX_|CXXABI_"
5252

5353
The output should be:
5454

55-
```
55+
```bash
5656
...
5757
GLIBCXX_3.4.19
5858
...
@@ -67,7 +67,11 @@ The Speech SDK requires **CXXABI_1.3.9** and **GLIBCXX_3.4.21**. You can find th
6767
6868
## Example
6969

70-
This is a sample command that illustrates how to configure RHEL/CentOS 7 x64 for development (C++, C#, Java, Python) with the Speech SDK 1.10.0 or later:
70+
This is a sample command set that illustrates how to configure RHEL/CentOS 7 x64 for development (C++, C#, Java, Python) with the Speech SDK 1.10.0 or later:
71+
72+
### 1. General setup
73+
74+
First install all general dependencies:
7175

7276
```bash
7377
# Only run ONE of the following two commands
@@ -81,16 +85,53 @@ sudo yum update -y
8185
sudo yum groupinstall -y "Development tools"
8286
sudo yum install -y alsa-lib dotnet-sdk-2.1 java-1.8.0-openjdk-devel openssl python3
8387
sudo yum install -y gstreamer1 gstreamer1-plugins-base gstreamer1-plugins-good gstreamer1-plugins-bad-free gstreamer1-plugins-ugly-free
88+
```
8489

85-
# Build GCC 5.4.0 and runtimes and install them under /usr/local
90+
### 2. C/C++ compiler and runtime libraries
91+
92+
Install the prerequisite packages with this command:
93+
94+
```bash
8695
sudo yum install -y gmp-devel mpfr-devel libmpc-devel
96+
```
97+
98+
> [!NOTE]
99+
> The libmpc-devel package has been deprecated in the RHEL 7.8 update. If the output of the previous command includes a message
100+
>
101+
> ```bash
102+
> No package libmpc-devel available.
103+
> ```
104+
>
105+
> then the necessary files need to be installed from original sources. Run the following commands:
106+
>
107+
> ```bash
108+
> curl https://ftp.gnu.org/gnu/mpc/mpc-1.1.0.tar.gz -O
109+
> tar zxf mpc-1.1.0.tar.gz
110+
> mkdir mpc-1.1.0-build && cd mpc-1.1.0-build
111+
> ../mpc-1.1.0/configure --prefix=/usr/local --libdir=/usr/local/lib64
112+
> make -j$(nproc)
113+
> sudo make install-strip
114+
> ```
115+
116+
Next update the compiler and runtime libraries:
117+
118+
```bash
119+
# Build GCC 5.4.0 and runtimes and install them under /usr/local
87120
curl https://ftp.gnu.org/gnu/gcc/gcc-5.4.0/gcc-5.4.0.tar.bz2 -O
88121
tar jxf gcc-5.4.0.tar.bz2
89122
mkdir gcc-5.4.0-build && cd gcc-5.4.0-build
90123
../gcc-5.4.0/configure --enable-languages=c,c++ --disable-bootstrap --disable-multilib --prefix=/usr/local
91124
make -j$(nproc)
92125
sudo make install-strip
126+
```
127+
128+
If the updated compiler and libraries need to be deployed on several machines, you can simply copy them from under `/usr/local` to other machines. If only the runtime libraries are needed then the files in `/usr/local/lib64` will be enough.
129+
130+
### 3. Environment settings
93131

132+
Run the following commands to complete the configuration:
133+
134+
```bash
94135
# Set SSL cert file location
95136
# (this is required for any development/testing with Speech SDK)
96137
export SSL_CERT_FILE=/etc/pki/tls/certs/ca-bundle.crt

articles/cognitive-services/Speech-Service/how-to-use-audio-input-streams.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -20,14 +20,14 @@ The following steps are required when using audio input streams:
2020

2121
- Identify the format of the audio stream. The format must be supported by the Speech SDK and the Speech service. Currently, only the following configuration is supported:
2222

23-
Audio samples in PCM format, one channel, 16000 samples per second, 32000 bytes per second, two block align (16 bit including padding for a sample), 16 bits per sample.
23+
Audio samples in PCM format, one channel, 16 bits per sample, 8000 or 16000 samples per second (16000 or 32000 bytes per second), two block align (16 bit including padding for a sample).
2424

2525
The corresponding code in the SDK to create the audio format looks like this:
2626

2727
```csharp
2828
byte channels = 1;
2929
byte bitsPerSample = 16;
30-
int samplesPerSecond = 16000;
30+
int samplesPerSecond = 16000; // or 8000
3131
var audioFormat = AudioStreamFormat.GetWaveFormatPCM(samplesPerSecond, bitsPerSample, channels);
3232
```
3333

articles/cognitive-services/Speech-Service/how-to-use-codec-compressed-audio-input-streams.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,7 @@ zone_pivot_groups: programming-languages-set-twelve
1717

1818
The Speech service SDK **Compressed Audio Input Stream** API provides a way to stream compressed audio to the Speech service using either a `PullStream` or `PushStream`.
1919

20-
Streaming compressed input audio is currently supported for C#, C++, Java on Windows (UWP applications aren't supported) and Linux (Ubuntu 16.04, Ubuntu 18.04, Debian 9, RHEL 8, CentOS 8). It is also supported for Java in Android and Objective-C in iOS platform.
20+
Streaming compressed input audio is currently supported for C#, C++, Java on Windows (UWP applications aren't supported) and Linux (Ubuntu 16.04, Ubuntu 18.04, Debian 9, RHEL 7/8, CentOS 7/8). It is also supported for Java in Android and Objective-C in iOS platform.
2121
* Speech SDK version 1.10.0 or later is required for RHEL 8 and CentOS 8
2222
* Speech SDK version 1.11.0 or later is required for for Windows.
2323

articles/cognitive-services/Speech-Service/includes/get-speech-sdk-java.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ ms.author: trbye
88

99
:::row:::
1010
:::column span="3":::
11-
The Java SDK for Android is packaged as an <a href="https://developer.android.com/studio/projects/android-library" target="_blank">AAR (Android Library) <span class="docon docon-navigate-external x-hidden-focus"></span></a>, which includes the necessary libraries and required Android permissions. It's hosted in a Maven repository at `https://csspeechstorage.blob.core.windows.net/maven/` as package `com.microsoft.cognitiveservices.speech:client-sdk:1.11.0`.
11+
The Java SDK for Android is packaged as an <a href="https://developer.android.com/studio/projects/android-library" target="_blank">AAR (Android Library) <span class="docon docon-navigate-external x-hidden-focus"></span></a>, which includes the necessary libraries and required Android permissions. It's hosted in a Maven repository at `https://csspeechstorage.blob.core.windows.net/maven/` as package `com.microsoft.cognitiveservices.speech:client-sdk:1.12.0`.
1212
:::column-end:::
1313
:::column:::
1414
<br>
@@ -27,7 +27,7 @@ To consume the package from your Android Studio project, make the following chan
2727

2828
2. In the module-level *build.gradle* file, add the following to the `dependencies` section:
2929
```gradle
30-
implementation 'com.microsoft.cognitiveservices.speech:client-sdk:1.11.0'
30+
implementation 'com.microsoft.cognitiveservices.speech:client-sdk:1.12.0'
3131
```
3232

3333
The Java SDK is also part of the [Speech Devices SDK](../speech-devices-sdk.md).

articles/cognitive-services/Speech-Service/includes/get-speech-sdk-linux.md

Lines changed: 8 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,6 @@ ms.author: trbye
99
:::row:::
1010
:::column span="3":::
1111
The Speech SDK only supports **Ubuntu 16.04/18.04**, **Debian 9**, **Red Hat Enterprise Linux (RHEL) 7/8**, and **CentOS 7/8** on the following target architectures when used with Linux:
12-
- x64
1312
:::column-end:::
1413
:::column:::
1514
<br>
@@ -19,11 +18,13 @@ ms.author: trbye
1918
:::column-end:::
2019
:::row-end:::
2120

22-
> [!IMPORTANT]
23-
> When targeting Linux ARM64, and using C# - the .NET Core 3.x (dotnet-sdk-3.x package) is required. If you're targeting ARM32 or ARM64, Python is not supported.
21+
- x86 (Debian/Ubuntu), x64, ARM32 (Debian/Ubuntu), and ARM64 (Debian/Ubuntu) for C++ development
22+
- x64, ARM32 (Debian/Ubuntu), and ARM64 (Debian/Ubuntu) for Java
23+
- x64, ARM32 (Debian/Ubuntu), and ARM64 (Debian/Ubuntu) for .NET Core
24+
- x64 for Python
2425

25-
> [!NOTE]
26-
> The x86 architectures of Ubuntu 16.04, Ubuntu 18.04, and Debian 9 only support C++ development with the Speech SDK.
26+
> [!IMPORTANT]
27+
> For C# on Linux ARM64, the .NET Core 3.x (dotnet-sdk-3.x package) is required.
2728
2829
### System requirements
2930

@@ -55,10 +56,8 @@ sudo yum install alsa-lib openssl
5556
```
5657

5758
> [!IMPORTANT]
58-
> Follow the instructions on [how to configure RHEL/CentOS 7 for Speech SDK](~/articles/cognitive-services/speech-service/how-to-configure-rhel-centos-7.md).
59-
60-
> [!TIP]
61-
> On RHEL/CentOS 8, follow the instructions on [how to configure OpenSSL for Linux](../how-to-configure-openssl-linux.md).
59+
> - On RHEL/CentOS 7, follow the instructions on [how to configure RHEL/CentOS 7 for Speech SDK](~/articles/cognitive-services/speech-service/how-to-configure-rhel-centos-7.md).
60+
> - On RHEL/CentOS 8, follow the instructions on [how to configure OpenSSL for Linux](~/articles/cognitive-services/speech-service/how-to-configure-openssl-linux.md).
6261
6362
---
6463

articles/cognitive-services/Speech-Service/includes/get-speech-sdk-macos.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -38,7 +38,7 @@ platform :ios, '9.3'
3838
use_frameworks!
3939
4040
target 'MyApp' do
41-
pod 'MicrosoftCognitiveServicesSpeech', '~> 1.11.0'
41+
pod 'MicrosoftCognitiveServicesSpeech', '~> 1.12.0'
4242
end
4343
```
4444

articles/cognitive-services/Speech-Service/includes/how-to/compressed-audio-input/cpp/prerequisites.md

Lines changed: 6 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,7 @@ gstreamer1.0-plugins-bad \
1818
gstreamer1.0-plugins-ugly
1919
```
2020

21-
# [REHL / CentOS](#tab/centos)
21+
# [RHEL/CentOS 7/8](#tab/centos)
2222

2323
```sh
2424
sudo yum install gstreamer1 \
@@ -29,6 +29,10 @@ gstreamer1-plugins-ugly-free
2929
```
3030

3131
> [!NOTE]
32-
> On RHEL / CentOS, follow the instructions on [how to configure OpenSSL for Linux](../../../../how-to-configure-openssl-linux.md).
32+
> - On RHEL/CentOS 7, follow the instructions on [how to configure RHEL/CentOS 7 for Speech SDK](~/articles/cognitive-services/speech-service/how-to-configure-rhel-centos-7.md).
33+
> - On RHEL/CentOS 8, follow the instructions on [how to configure OpenSSL for Linux](~/articles/cognitive-services/speech-service/how-to-configure-openssl-linux.md).
34+
35+
> [!IMPORTANT]
36+
> The Opus audio coding format is not supported by the Speech SDK on RHEL/CentOS 7.
3337
3438
---

0 commit comments

Comments
 (0)