Skip to content

Commit 093a6f9

Browse files
authored
[tn] fix 手机尾号2349,100兆 (#184)
1 parent bbb4cba commit 093a6f9

File tree

4 files changed

+6
-0
lines changed

4 files changed

+6
-0
lines changed

tn/chinese/data/measure/units_zh.tsv

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -239,3 +239,4 @@
239239
240240
摄氏度
241241
242+

tn/chinese/rules/cardinal.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -76,6 +76,7 @@ def build_tagger(self):
7676
# three or five or eleven phone numbers
7777
phone_digits = digits @ self.build_rule(cross('一', '幺'))
7878
phone = phone_digits**3 | phone_digits**5 | phone_digits**11
79+
phone |= accep("尾号") + (accep("是") | accep("为")).ques + phone_digits**4
7980
cardinal |= add_weight(phone, -1.0)
8081

8182
tagger = insert('value: "') + cardinal + insert('"')

tn/chinese/test/data/cardinal.txt

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,6 @@
11
110 => 幺幺零
22
127.0.0.1 => 一二七点零点零点一
33
010-64035547 => 零一零六四零三五五四七
4+
尾号1702 => 尾号幺七零二
5+
尾号是3385 => 尾号是三三八五
6+
尾号为2349 => 尾号为二三四九

tn/chinese/test/data/measure.txt

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -8,3 +8,4 @@
88
1 kg => 一千克
99
2% => 百分之二
1010
10km/h => 每小时十公里
11+
100兆 => 一百兆

0 commit comments

Comments
 (0)