Skip to content

Commit d5d4889

Browse files
committed
FunASR check in.
1 parent 071967c commit d5d4889

File tree

4 files changed

+132
-113
lines changed

4 files changed

+132
-113
lines changed

content/learning-paths/servers-and-cloud-computing/funASR/1_asr.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -10,9 +10,9 @@ layout: learningpathall
1010

1111
Automatic Speech Recognition [ASR](https://en.wikipedia.org/wiki/Speech_recognition), also known as speech-to-text, is a rapidly evolving field that empowers computers to understand and transcribe human speech.
1212

13-
This technology has become an integral part of our daily lives, powering a wide range of applications and services.  
13+
This technology has become deeply integrated into our daily lives, powering a wide range of applications and services we often take for granted, many of which are optimized for and run on Arm CPU architecture.
1414

15-
At its core, ASR involves converting spoken audio into written text. This seemingly simple task is actually quite complex, requiring sophisticated algorithms and models to accurately interpret the nuances of human speech, including variations in pronunciation, accents, and background noise.  
15+
At its core, ASR involves converting spoken audio into written text. While seemingly simple, this process is quite complex, requiring sophisticated algorithms and models to accurately interpret the nuances of human speech, including variations in pronunciation, accents, and background noise.
1616

1717
### Key Applications of ASR
1818

@@ -47,8 +47,8 @@ ASR is now used in a myriad of applications across various domains:
4747

4848
While the potential applications of ASR are vast and inspiring, it's important to acknowledge the inherent challenges in developing and deploying accurate and reliable ASR systems. These challenges stem from the complexities of human speech, environmental factors, and the intricacies of language itself. These challenges are particularly pronounced for Chinese ASR, which needs to address unique linguistic characteristics such as:
4949

50-
* **Tonal Variations:**
51-
Mandarin Chinese is a tonal language, where the meaning of a syllable changes depending on its tone. Accurately recognizing these tonal variations is crucial for understanding spoken Chinese.
50+
* **Complexities of Chinese Language:**
51+
Mandarin Chinese involves tonal variations where the meaning of a syllable changes depending on its tone, and punctuation is crucial to convey meaning and avoid ambiguity. Accurately recognizing these nuances is essential for understanding spoken Chinese.
5252

5353
* **Noise Robustness:**
5454
ASR systems need to be able to filter out background noise to accurately transcribe speech. This is particularly challenging in noisy environments like crowded streets or busy offices.
@@ -59,4 +59,4 @@ While the potential applications of ASR are vast and inspiring, it's important t
5959
* **Homophones:**
6060
Chinese has a high prevalence of homophones, words that sound alike but have different meanings. Disambiguating these homophones requires understanding the context and semantics of the spoken words.
6161

62-
Fortunately, advancements in AI and open-source technologies are paving the way for innovative solutions to address these challenges. In the following sections, we'll explore one such solution that leverages the power of ModelScope and Arm CPUs to enable efficient and accurate Chinese ASR.
62+
In the following sections, we'll explore one such solution that leverages the power of ModelScope and Arm CPUs to enable efficient and accurate Chinese ASR.

content/learning-paths/servers-and-cloud-computing/funASR/2_modelscope.md

Lines changed: 10 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ layout: learningpathall
88

99
## Before you begin
1010

11-
To follow the instructions for this Learning Path, you will need an Arm server running Ubuntu 22.04 LTS or later version with at least 16 cores, 16GB of RAM, and 50GB of disk storage.
11+
To follow the instructions for this Learning Path, you will need an Arm server running Ubuntu 22.04 LTS or later version with at least 8 cores, 16GB of RAM, and 30GB of disk storage.
1212

1313
## Introduce ModelScope
1414
[ModelScope](https://github.com/modelscope/modelscope/) is an open-source platform that makes it easy to use AI models in your applications.
@@ -27,11 +27,9 @@ Key benefits of ModelScope include:
2727

2828

2929
## Arm CPU Acceleration
30-
ModelScope fully support Pytorch 1.8+ and other machine learing framework which can be efficiently deployed on Arm Neoverse CPUs, taking advantage of Arm's performance and power-efficiency characteristics.
30+
ModelScope fully supports Pytorch 1.8+ and other machine learning frameworks, which can be efficiently deployed on Arm Neoverse CPUs, taking advantage of Arm's performance and power-efficiency characteristics.
3131

32-
Arm provides optimized software and tools, such as the Kleidi, to accelerate AI inference on Arm-based platforms.
33-
34-
This makes Arm Neoverse CPUs an ideal choice for running ModelScope models in edge devices and other resource-constrained environments.
32+
Arm provides optimized software and tools, such as Kleidi, to accelerate AI inference on Arm-based platforms. This makes Arm Neoverse CPUs an ideal choice for running ModelScope models in edge devices and other resource-constrained environments.
3533

3634
You can learn more about [Faster PyTorch Inference using Kleidi on Arm Neoverse](https://community.arm.com/arm-community-blogs/b/servers-and-cloud-computing-blog/posts/faster-pytorch-inference-kleidi-arm-neoverse) from Arm community website.
3735

@@ -58,20 +56,23 @@ pip3 install numpy packaging addict datasets simplejson sortedcontainers transfo
5856

5957
```
6058
{{% notice Note %}}
61-
This learning path will use execute model in Arm Neoverse, so we only need install PyTorch CPU package.
59+
This learning path will execute models on Arm Neoverse, so we only need to install the PyTorch CPU package.
6260
{{% /notice %}}
6361

6462
## Create a sample example
6563

6664
After completing the installation, we will use an example related to Chinese semantic understanding to illustrate how to use ModelScope.
6765

68-
There is a fundamental difference between Chinese and English writing. The relationship between Chinese characters and their meanings is somewhat analogous to the difference between words and phrases in English. Some Chinese characters, like English words, have clear meanings on their own, such as "人" (person), "山" (mountain), and "水" (water).
66+
There is a fundamental difference between Chinese and English writing.
67+
The relationship between Chinese characters and their meanings is somewhat analogous to the difference between words and phrases in English.
68+
Some Chinese characters, like English words, have clear meanings on their own, such as “人” (person), “山” (mountain), and “水” (water).
6969

70-
However, more often, Chinese characters need to be combined with other characters to express more complete meanings, just like phrases in English. For example, "祝福" (blessing) can be broken down into "祝" (wish) and "福" (good fortune); "分享" (share) can be broken down into "分" (divide) and "享" (enjoy); "生成" (generate) is composed of "生" (produce) and "成" (become).
70+
However, more often, Chinese characters need to be combined with other characters to express more complete meanings, just like phrases in English.
71+
For example, “祝福” (blessing) can be broken down into “祝” (wish) and “福” (good fortune); “分享” (share) can be broken down into “分” (divide) and “享” (enjoy); “生成” (generate) is composed of “生” (produce) and “成” (become).
7172

7273
For computers to understand Chinese sentences, we need to understand the rules of Chinese characters, vocabulary, and grammar to accurately understand and express meaning.
7374

74-
Here ia a simple example using a general-domain Chinese word segmentation [model(https://www.modelscope.cn/models/iic/nlp_structbert_word-segmentation_chinese-base)], which can break down Chinese sentences into individual words, facilitating analysis and understanding by computers.
75+
Here ia a simple example using a general-domain Chinese [word segmentation model](https://www.modelscope.cn/models/iic/nlp_structbert_word-segmentation_chinese-base), which can break down Chinese sentences into individual words, facilitating analysis and understanding by computers.
7576

7677
```python
7778
from modelscope.pipelines import pipeline
@@ -128,4 +129,3 @@ The segmentation model has correctly identified the following words:
128129

129130

130131
The segmentation model has successfully identified the word boundaries and separated the sentence into meaningful units, which is essential for further natural language processing tasks like machine translation or sentiment analysis.
131-

0 commit comments

Comments
 (0)