Skip to content

Commit 41dfc8d

Browse files
authored
Reorder 廣韻字頭, add 字頭原貌 and correct some data (#10)
- All 廣韻 entries are reordered according to 澤存堂本 - This solves a long-standing issue with both 韻典 and 廣韻字音表's data: Both data tables combine 字頭 from 廣韻全字表 and 釋義 from 宋本廣韻データ. However 廣韻全字表 is based on 巾箱本 while 宋本廣韻データ is based on 澤存堂本, which creates mismatches that causes confusion - src/字序表.csv is the main result of this work. It lists 廣韻's entries in the correct order, with corresponding entry indices from 廣韻字音表, 宋本廣韻データ and 韻典 listed as reference - Entries missing in poem's data are added back - This includes characters only representable with IDS, and several additions from other versions of 廣韻 (chiefly 廣韻校本) - The field 小韻內字序 is now 小韻字號, which may contain "-a1", "-a2" etc for added entries that should be there but missing in 澤存堂本 - Added 字頭原貌 - 字頭原貌 is taken from poem's "字頭-原貌" field (only those marked with "校" but not with "部件換位" or "調整碼位", as the latter cases are for equivalent characters), plus our corrections (most are from 廣韻形聲考) - 釋義補充 field replaced with 釋義參照 - Instead of repeating the text, 釋義參照 simply indicates whether the entry has 釋義 that refers the entry above, or shares 釋義 with the entry below - This also resolves 釋義補充-related issues in #7 - More errors in 字頭 & 釋義 are corrected - These were discovered when the new 字序表 was being made, and are still WIP - Some corrections are newly added according to 廣韻形聲考 - README updated with detailed description of the fields in 廣韻.csv
1 parent 436aad7 commit 41dfc8d

File tree

9 files changed

+54899
-29397
lines changed

9 files changed

+54899
-29397
lines changed

DEVELOP.md

Lines changed: 14 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -2,18 +2,24 @@
22

33
## Sources
44

5-
- 廣韻(20170209).csv: From [廣韻字音表](https://zhuanlan.zhihu.com/p/20430939), created by poem.
6-
- rime-table-0b69606.tsv: From [切韻新韻圖](https://phesoca.com/rime-table/) by unt, built from git commit `0b69606`.
7-
- split.csv: Maintained here, ultimately also from 切韻新韻圖.
5+
_poem_'s 廣韻 data:
6+
7+
- 廣韻(20170209).csv: From [廣韻字音表](https://zhuanlan.zhihu.com/p/20430939), created by _poem_
8+
9+
Maintained by NK2028:
10+
11+
- 小韻表.csv: 音韻地位 and 反切
12+
- split.csv: Details of 小韻s with multiple 音韻地位s
13+
- 字序表: Correct order of 廣韻's entries
14+
- `poem_*` fields refer to _poem_'s 廣韻字音表
15+
- `sbgy_*` fields refer to [宋本廣韻データ](https://kanji-database.sourceforge.net/dict/sbgy/index.html)
16+
- `ytenx_*` fields refer to [韻典網](https://ytenx.org/)
17+
- Data is taken from commit `d95d247` (2023-12-21), which differs from the current (as of Jan. 2025) deployed version (commit `3666370` 2020-03-23) by two 字頭s (小韻 1326 茅→芧, 小韻 2882 匕→𠤎)
18+
- patches.csv: Corrections to _poem_'s data
819

920
## Build
1021

1122
```sh
1223
python build.py
1324
python check.py
1425
```
15-
16-
## Remarks
17-
18-
- poem 表註「應補」者,給出 Unicode 字頭者均可見於原表末尾(小韻內字序號帶 .5),未給出者(以 IDS 或文字描述字頭)則仍未錄
19-
- poem 表註「應換序」及「順序應爲」者,均未修正,且釋義補充字段亦有問題(似乎源自早先有女同車《廣韻全字表》底本差異)

README.md

Lines changed: 28 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -3,12 +3,33 @@
33
A database of the Qieyun phonological system.
44

55
- 韻書
6-
- 王一:`王一.csv` (not completed)
7-
- 王三:`王三.csv` (小韻內部待校)
8-
- 廣韻澤存堂本`廣韻.csv`
6+
- 王一:`王一.csv` (not completed)
7+
- 王三:`王三.csv` (小韻內部待校)
8+
- 廣韻 (澤存堂本, with corrections from 廣韻校本, 廣韻形聲考 etc.)`廣韻.csv`
99
- 韻圖
10-
- 韻鏡(嘉吉本):`韻鏡(嘉吉本).csv` (not completed)
11-
- 韻鏡(古逸叢書本):`韻鏡(古逸叢書本).csv`
10+
- 韻鏡(嘉吉本):`韻鏡(嘉吉本).csv` (not completed)
11+
- 韻鏡(古逸叢書本):`韻鏡(古逸叢書本).csv`
1212
- 反切音韻地位
13-
- 王三:`王三反切音韻地位表.csv` (rev. Ayaka & unt)
14-
- 廣韻:`廣韻反切音韻地位表.csv` (beta)
13+
- 王三:`王三反切音韻地位表.csv` (rev. Ayaka & unt)
14+
- 廣韻:`廣韻反切音韻地位表.csv` (beta)
15+
16+
## About fields in 韻書/廣韻.csv
17+
18+
- 小韻號: May contain -a/-b/-c if a 小韻 has multiple 音韻地位s
19+
- 小韻字號: May contain -a1, -a2 etc for entries not present in 澤存堂本 but added back according to 廣韻校本
20+
- 反切: May contain annotations:
21+
- 脫字: `[徒]候` (小韻 #3067 豆)
22+
- 訛字: `士<七>演` (小韻 #1625 淺)
23+
- 改用其他來源的音韻地位: `姊宜⦉規⦊` (小韻 #133 厜)
24+
- 替換成近似等價字,反切結果改變: `符咸(䒦)` (小韻 #1155 凡)
25+
- 替換成音近字,反切結果改變: `式之(脂)` (小韻 #157 尸)
26+
- 替換成等價字,反切結果不變: `甫⦅府⦆妄` (小韻 #2918 放)
27+
- 替換成同音字,反切結果不變: `呼東⦅紅⦆` (小韻 #32 烘)
28+
- 複合使用: `以沼⦅小⦆<水>` (小韻 #1692a 鷕)
29+
- 字頭原貌 & 字頭: The character for the entry
30+
- A non-empty 字頭原貌 indicates a correction of the character
31+
- Additionally, an empty 字頭 indicates this entry in 澤存堂本 is errorneous and should be removed
32+
- 字頭說明: contains notes about some of the corrections or removals
33+
- 釋義參照:
34+
- `` if 釋義 refers to the entry above ("同上", "俗", "古文" etc.)
35+
- `` if it shares 釋義 with the entry below ("並上同", "並古文" etc.)

0 commit comments

Comments
 (0)