Skip to content

Commit 1d1b687

Browse files
macchiatistenshamn
andauthored
CLDR-18940 Release page cleanup (#5089)
Co-authored-by: Fredrik <[email protected]>
1 parent 70e82ed commit 1d1b687

File tree

1 file changed

+29
-23
lines changed

1 file changed

+29
-23
lines changed

docs/site/downloads/cldr-48.md

Lines changed: 29 additions & 23 deletions
Original file line numberDiff line numberDiff line change
@@ -24,10 +24,11 @@ This data is also a factor in determining which languages are supported on mobil
2424
### Changes
2525

2626
The most significant changes in this release are:
27-
- Updated for Unicode 17, including new names and search terms for new emoji, new sort-order, Han→Latn romanization additions for many characters.
27+
- Updated for Unicode 17, including new names and search terms for new emoji, new sort-order, Han → Latin romanization additions for many characters.
2828
- Updated to the latest external standards and data sources, such as the language subtag registry, UN M49 macro regions, ISO 4217 currencies, etc.
2929
- Many enhancements of the CLDR specification (LDML), including:
30-
- Further additions to the Message Format 2.0 spec.
30+
- **TBD A summary of the changes will be added for the spec-beta.**
31+
**In the meantime, see the [Modifications section](https://www.unicode.org/reports/tr35/dev/tr35-modifications.html).**
3132
- Many additions to language data including:
3233
- Likely Subtags, for deriving the likely script and region from the language (used in many processes).
3334
- Language populations in countries: significant updates to improve accuracy and maintainability.
@@ -77,11 +78,11 @@ For a full listing, see [Coverage Levels](https://unicode.org/cldr/charts/dev/su
7778

7879
The following are the most significant changes to the specification (LDML).
7980

80-
- **A summary of the changes will be added for the beta on Oct 1.**
81-
**In the meantime, see [Modifications section](https://www.unicode.org/reports/tr35/proposed.html#Modifications) for the detailed modifications.**
81+
- **TBD A summary of the changes will be added for the spec-beta.**
82+
**In the meantime, see the [Modifications section](https://www.unicode.org/reports/tr35/dev/tr35-modifications.html).**
8283

8384
There are many more changes that are important to implementations, such as changes to certain identifier syntax and various algorithms.
84-
See the [Modifications section](https://www.unicode.org/reports/tr35/proposed.html#Modifications) of the specification for details.
85+
See the [Modifications section](https://www.unicode.org/reports/tr35/dev/tr35-modifications.html) of the specification for details.
8586

8687
## Data Changes
8788

@@ -109,7 +110,7 @@ See the [Modifications section](https://www.unicode.org/reports/tr35/proposed.ht
109110
- Chinese
110111
- Gregorian
111112
- Recent or upcoming currency names were added (XCG, ZWG).
112-
- To match ISO, translations for the region Sark (CQ) was added.
113+
- To match ISO, added translations for the region Sark (CQ).
113114
- There are now combination formats for _relative_ dates + times, such as “tomorrow _at_ 12:30”.
114115
In some languages the use of a relative date such as “tomorrow” or “2 days ago” requires a _different_ combining pattern than for a fixed date like “March 20”.
115116
A new “relative” variant is introduced to allow for those languages.
@@ -122,7 +123,8 @@ The alphaNextToNumber patterns allow for a space between letter currency symbols
122123
- For the Etc/Unknown timezone, the `exemplarCity` name was changed from “Unknown City” to “Unknown Location” for clarity.
123124
- Rational number formats were added, allowing for formats like 5½.
124125
- Certain concentration units were reworked, for “parts per million”, “parts per billion”.
125-
- Additional units were added for scientific contexts (coulombs, farads, teslas, etc.) and for English systems (fortnights, imperial pints, etc.). However, translation of these English system names were not required.
126+
- Additional units were added for scientific contexts (coulombs, farads, teslas, etc.) and for English systems (fortnights, imperial pints, etc.).
127+
However, translation of these English system names was not required.
126128
- Additional guidance on translation was added, leading to refined translations or transcreations.
127129

128130
#### Specific Locales
@@ -153,25 +155,30 @@ The explanations of usage are in the [Locale Changes](#locale-changes) section.
153155
- This “flattens” the rules into a format that is easier for implementations to use directly.
154156

155157
#### supplementalData
156-
- `era` — the range of `code` values nows allows two letters before the first hyphen.
157-
- `languageData` — the `territories` attribute [`supplementalData.xml`](https://github.com/unicode-org/cldr/blob/main/common/supplemental/supplementalData.xml) was deprecated and data using it removed. The definition was unclear, and prone to misunderstanding — the more detailed data is in `territoryInfo`. ([CLDR-5708][])
158+
- `era` — the range of `code` values now allows two letters before the first hyphen.
159+
- `languageData` — the `territories` attribute [`supplementalData.xml`](https://github.com/unicode-org/cldr/blob/main/common/supplemental/supplementalData.xml) was deprecated and data using it removed.
160+
The definition was unclear, and prone to misunderstanding — the more detailed data is in `territoryInfo`. ([CLDR-5708][])
158161
- `usesMetazone` — adds two new attributes `stdOffset` and `dstOffset` so that implementations can use either "vanguard" or "rearguard" TZDB data sources.
159162
- `numberingSystem` — Unicode 17 data was added.
160163

161164
#### ldmlBCP47
162-
- `type` — adds a new attibute `region`.
165+
- `type` — adds a new attribute `region`.
163166
- `keyboard3@conformsTo` — is updated to allow "48".
164167

165168
### BCP47 Data Changes
169+
170+
For a full listing, see [BCP47 Delta].
171+
166172
- `nu-tols` numbering system for Tolong Siki digits
167173
- One additional zone: America/Coyhaique = tz-clcxq
168174
- Seven region attributes for determining regions for timezones
169175
- Three additional aliases
170176

171-
For a full listing, see [BCP47 Delta].
172177

173178
### Supplemental Data Changes
174179

180+
For a full listing, see [Supplemental Delta].
181+
175182
#### Identifiers
176183
- Added aliases/deprecations for languages (dek, mnk, nte).
177184
- Updated to the latest language subtag registry, with various additions and deprecations.
@@ -180,12 +187,13 @@ For a full listing, see [BCP47 Delta].
180187
- deprecated unit IDs permillion, portion, portion-per-1e9, 100-kilometer.
181188

182189
#### Language Data
183-
- [language_script.tsv](https://github.com/unicode-org/cldr/blob/main/tools/cldr-code/src/main/resources/org/unicode/cldr/util/data/language_script.tsv) updated to include only one "Primary" writing system for languages that used to have multiple options ([CLDR-18114][]). Notable changes are:
184-
- Panjabi `pa` has the primary to Gurumukhi `Guru` because widespread usage is in the Gurumukhi script -- while most speakers are in Pakistan `PK`, written usage remains Gurumukhi.
190+
- [language_script.tsv](https://github.com/unicode-org/cldr/blob/main/tools/cldr-code/src/main/resources/org/unicode/cldr/util/data/language_script.tsv) updated to include only one "Primary" writing system for languages that used to have multiple options ([CLDR-18114][]).
191+
Notable changes are:
192+
- Punjabi `pa` has changed the primary script to Gurmukhi `Guru` because widespread usage is in the Gurmukhi script. While most speakers are in Pakistan `PK`, written usage remains Gurmukhi.
185193
- Azerbaijani `az` and Northern Kurdish `ku` primarily are used in Latin `Latn`.
186194
- Chinese languages `zh`, `hak`, and `nan` are matched to Simplified Han writing `Hans` -- except Cantonese `yue`, which is known for a preference in Traditional Han writing `Hant`.
187195
- Hassiniyya `mey` was missing significant data, it should be associated with the Arabic `Arab` writing system by default, not Latin `Latn`.
188-
- 5 new language distance values are added (for fallback to zh).
196+
- 5 new language distance values are added (for fallback to `zh`).
189197
- Substantial updates to Language Info: additional languages in countries; revised population values, writing percentages, literacy percentages, and official status values.
190198

191199
#### Likely Subtags
@@ -203,7 +211,7 @@ For a full listing, see [BCP47 Delta].
203211
- Day periods for kok, scn, hi_Latn
204212

205213
#### Plural Rules
206-
- additions for cv, ie, kok, sgs
214+
- Additions for cv, ie, kok, sgs
207215

208216
#### Currencies
209217
- Updates to the latest ISO currencies
@@ -212,14 +220,12 @@ For a full listing, see [BCP47 Delta].
212220
- IS changed to firstDay=sun
213221
- ku_SY adding H and hB
214222

215-
For a full listing, see [Supplemental Delta].
216-
217223
### Transforms
224+
For a full listing, see [Transforms Delta].
225+
218226
- Fixed problem in Gujarati → Latin romanization, with ૰
219227
- Updated to latest Unicode 17 data for Han → Latin, with very many changes.
220228

221-
For a full listing, see [Transforms Delta].
222-
223229
### Number Spellout Data Changes
224230

225231
- The biggest change is to the format, which has been “flattened” for easier use by clients.
@@ -281,10 +287,10 @@ The following files are new in the release:
281287

282288
## Migration
283289

284-
- Number patterns that did not have a specific numberSystem (such as latn or arab) had be deprecated for many releases, and were finally removed.
290+
- Number patterns that did not have a specific numberSystem (such as `latn` or `arab`) had been deprecated for many releases, and were finally removed.
285291
- Additionally, language and territory data in `languageData` and `territoryInfo` data received significant updates to improve accuracy and maintainability [CLDR-18087]
286292
- The likely language for Belarus changed to Russian [CLDR-14479]
287-
- **TBD Additional items plus future guidance will be added before the beta, on Oct 1.**
293+
- **TBD Additional items plus future guidance will be added before the spec-beta.**
288294

289295

290296
### V49 advance warnings
@@ -305,8 +311,8 @@ These will be removed in the future, but note that they may be present in the ne
305311

306312
## Acknowledgments
307313

308-
Many people have made significant contributions to CLDR and LDML;
309-
see the [Acknowledgments](/index/acknowledgments) page for a full listing.
314+
Many people have made significant contributions to CLDR and LDML.
315+
For a full listing, see the [Acknowledgments](https://www.unicode.org/reports/tr35/dev/tr35-acknowledgments.html).
310316

311317
The Unicode [Terms of Use](https://unicode.org/copyright.html) apply to CLDR data;
312318
in particular, see [Exhibit 1](https://unicode.org/copyright.html#Exhibit1).

0 commit comments

Comments
 (0)