@@ -19,8 +19,10 @@ List of these files (see https://www.unicode.org/Public/UCD/latest/ucd/):
1919
2020Process:
2121* The “source of truth” is the Unihan database maintained by the CJK/Unihan group, including data maintained by Michel.
22- * The CJK/Unihan group posts data files into an internal location.
23- * KenW vets these files and posts them to https://www.unicode.org/Public/draft/UCD/ucd/ .
22+ * The CJK/Unihan group maintains the data files in the Unicode-internal unihan-tools repo
23+ and creates GitHub releases with the /Public data files.
24+ * These include RSIndex.txt and RSIndex.pdf which are published in the charts folder, not in the ucd folder.
25+ * An infrastructure person copies these files to /Public/draft/ucd, /Public/draft/charts, /Public/{version}/... as appropriate.
2426* A unicodetools GitHub contributor fetches these files, preprocesses the contents of Unihan.zip,
2527 and creates a pull request as for “regular” data files.
2628 (The processed data files go into .../unicodetools/data/ucd/dev/Unihan.)
@@ -48,6 +50,7 @@ Changes are made in a GitHub pull request.
4850* Updated files could be shared in various ways including via email or via private FTP areas.
4951* Updated files should be based on the latest (or fairly recent) data in the unicodetools repo.
5052* Updated files should not be posted directly to https://www.unicode.org/Public/ ...
53+ * We work with an infra person to publish whole UCD/alpha/beta/final data file drops into /Public .
5154
5255Pull request cycle:
5356* One commit for manual or contributed data changes.
@@ -81,8 +84,8 @@ https://github.com/unicode-org/unicodetools/tree/main/unicodetools/data/emoji/de
8184
8285Certain snapshots of the .../dev/ files are copied into https://www.unicode.org/Public/draft/
8386for Unicode alpha, beta, and final releases, and more as appropriate.
84- * UCD files go into https://www.unicode.org/Public/draft/UCD /
85- * UCA files go into https://www.unicode.org/Public/draft/UCA /
87+ * UCD files go into https://www.unicode.org/Public/draft/ucd /
88+ * UCA files go into https://www.unicode.org/Public/draft/uca /
8689* emoji files go into https://www.unicode.org/Public/draft/emoji/
8790* etc.
8891* Inside “draft” there are no folder levels with version numbers.
@@ -104,18 +107,16 @@ script from an up-to-date repo workspace.
104107The script copies the set of the .../dev/ data files for an alpha snapshot
105108from a unicodetools workspace to a target folder with the layout of https://www.unicode.org/Public/draft/ .
106109
107- Send the resulting zip file to Rick for posting to https://www.unicode.org/Public/draft/ .
108- Ask Rick to add other files that are not tracked in the unicodetools repo:
109- * Unihan.zip to .../draft/UCD/ucd
110-
111- TODO: Figure out new process & people replacing Rick in 2025.
110+ Send the resulting zip file to an infra person for posting to https://www.unicode.org/Public/draft/ .
111+ Ask the infra person to add other files that are not tracked in the unicodetools repo:
112+ * Unihan.zip to .../draft/ucd
112113
113114Note: No version/delta infixes in names of data files.
114115We simply use the “draft” folder and the file-internal time stamps for versioning.
115116
116117### Publish an alpha snapshot
117118
118- For the alpha review, publish (at least) the UCD and emoji files, and the charts.
119+ For the alpha review, publish (at least) the UCD and emoji files, the IDNA files, and the charts.
119120
120121Review/edit the pub/* .sh scripts and advance the version numbers and copyright years.
121122
@@ -124,10 +125,10 @@ script from an up-to-date repo workspace.
124125The script copies the set of the .../dev/ data files for an alpha snapshot
125126from a unicodetools workspace to a target folder with the layout of https://www.unicode.org/Public/draft/ .
126127
127- Send the resulting zip file to Rick for posting to https://www.unicode.org/Public/draft/ .
128- Ask Rick to add other files that are not tracked in the unicodetools repo:
129- * Unihan.zip to .../draft/UCD/ ucd
130- * alpha charts to .../draft/UCD/ charts
128+ Send the resulting zip file to an infra person for posting to https://www.unicode.org/Public/draft/ .
129+ Ask the infra person to add other files that are not tracked in the unicodetools repo:
130+ * Unihan.zip to .../draft/ucd
131+ * alpha charts to .../draft/charts
131132
132133Note: No version/delta infixes in names of data files.
133134We simply use the “draft” folder and the file-internal time stamps for versioning.
@@ -141,11 +142,11 @@ script from an up-to-date repo workspace.
141142The script copies the set of the .../dev/ data files for a beta snapshot
142143from a unicodetools workspace to a target folder with the layout of https://www.unicode.org/Public/draft/ .
143144
144- Send the resulting zip file to Rick for posting to https://www.unicode.org/Public/draft/ .
145- Ask Rick to add other files that are not tracked in the unicodetools repo:
146- * Unihan.zip to .../draft/UCD/ ucd
147- * UCDXML files to .../draft/UCD/ ucdxml
148- * beta charts to .../draft/UCD/ charts
145+ Send the resulting zip file to an infra person for posting to https://www.unicode.org/Public/draft/ .
146+ Ask the infra person to add other files that are not tracked in the unicodetools repo:
147+ * Unihan.zip to .../draft/ucd
148+ * UCDXML files to .../draft/ucdxml
149+ * beta charts to .../draft/charts
149150
150151### Publish a release snapshot
151152
@@ -158,19 +159,19 @@ Verify the final set of files in the draft folder.
158159Run the [ pub/copy-final.sh] ( https://github.com/unicode-org/unicodetools/blob/main/pub/copy-final.sh )
159160script from an up-to-date repo workspace.
160161
161- Send the resulting zip file to Rick for posting to https://www.unicode.org/Public/ (not .../Public/draft/).
162- Ask Rick to add other files that are not tracked in the unicodetools repo:
162+ Send the resulting zip file to an infra person for posting to https://www.unicode.org/Public/ (not .../Public/draft/).
163+ Ask the infra person to add other files that are not tracked in the unicodetools repo:
163164* Unihan.zip to .../{version}/ucd
164165* UCDXML files to .../{version}/ucdxml
165166* final charts to .../{version}/charts
166167
167- This script works much like the beta script, except it:
168- * assembles all of the files for Public/ in their release folder structure,
169- rather than for Public/draft/
170- * creates a zipped/{version} folder with UCD.zip
168+ TODO: Starting with 17.0, the folder structure of /Public/draft is the same as that of /Public/{version} .
169+ Consider moving the final files from /Public/draft to /Public/{version} rather than running another script.
171170
172171### Before a release
173172
173+ TODO: Review this section, and merge it into the previous one.
174+
174175When the data files are supposed to be final, about a week or two before the release:
175176
176177Verify once more that the unicodetools repo .../dev/ files match the released/published files.
@@ -182,7 +183,7 @@ https://github.com/unicode-org/unicodetools/releases/tag/final-15.1-20230908
182183### After a release
183184
184185Copy a snapshot of the unicodetools repo .../dev/ files to a versioned unicodetools folder;
185- for example: .../unicodetools/data/ucd/16 .0.0/ .
186+ for example: .../unicodetools/data/ucd/17 .0.0/ .
186187(We no longer append a “-Update” suffix to the folder name.)
187188List: emoji, idna, security, uca, ucd, ucdxml
188189Watch for different naming conventions: emoji versions use only two fields, not three.
0 commit comments