You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/site/downloads/cldr-48.md
+29-23Lines changed: 29 additions & 23 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -24,10 +24,11 @@ This data is also a factor in determining which languages are supported on mobil
24
24
### Changes
25
25
26
26
The most significant changes in this release are:
27
-
- Updated for Unicode 17, including new names and search terms for new emoji, new sort-order, Han→Latn romanization additions for many characters.
27
+
- Updated for Unicode 17, including new names and search terms for new emoji, new sort-order, Han → Latin romanization additions for many characters.
28
28
- Updated to the latest external standards and data sources, such as the language subtag registry, UN M49 macro regions, ISO 4217 currencies, etc.
29
29
- Many enhancements of the CLDR specification (LDML), including:
30
-
- Further additions to the Message Format 2.0 spec.
30
+
-**TBD A summary of the changes will be added for the spec-beta.**
31
+
**In the meantime, see the [Modifications section](https://www.unicode.org/reports/tr35/dev/tr35-modifications.html).**
31
32
- Many additions to language data including:
32
33
- Likely Subtags, for deriving the likely script and region from the language (used in many processes).
33
34
- Language populations in countries: significant updates to improve accuracy and maintainability.
@@ -77,11 +78,11 @@ For a full listing, see [Coverage Levels](https://unicode.org/cldr/charts/dev/su
77
78
78
79
The following are the most significant changes to the specification (LDML).
79
80
80
-
-**A summary of the changes will be added for the beta on Oct 1.**
81
-
**In the meantime, see [Modifications section](https://www.unicode.org/reports/tr35/proposed.html#Modifications) for the detailed modifications.**
81
+
-**TBD A summary of the changes will be added for the spec-beta.**
82
+
**In the meantime, see the [Modifications section](https://www.unicode.org/reports/tr35/dev/tr35-modifications.html).**
82
83
83
84
There are many more changes that are important to implementations, such as changes to certain identifier syntax and various algorithms.
84
-
See the [Modifications section](https://www.unicode.org/reports/tr35/proposed.html#Modifications) of the specification for details.
85
+
See the [Modifications section](https://www.unicode.org/reports/tr35/dev/tr35-modifications.html) of the specification for details.
85
86
86
87
## Data Changes
87
88
@@ -109,7 +110,7 @@ See the [Modifications section](https://www.unicode.org/reports/tr35/proposed.ht
109
110
- Chinese
110
111
- Gregorian
111
112
- Recent or upcoming currency names were added (XCG, ZWG).
112
-
- To match ISO, translations for the region Sark (CQ) was added.
113
+
- To match ISO, added translations for the region Sark (CQ).
113
114
- There are now combination formats for _relative_ dates + times, such as “tomorrow _at_ 12:30”.
114
115
In some languages the use of a relative date such as “tomorrow” or “2 days ago” requires a _different_ combining pattern than for a fixed date like “March 20”.
115
116
A new “relative” variant is introduced to allow for those languages.
@@ -122,7 +123,8 @@ The alphaNextToNumber patterns allow for a space between letter currency symbols
122
123
- For the Etc/Unknown timezone, the `exemplarCity` name was changed from “Unknown City” to “Unknown Location” for clarity.
123
124
- Rational number formats were added, allowing for formats like 5½.
124
125
- Certain concentration units were reworked, for “parts per million”, “parts per billion”.
125
-
- Additional units were added for scientific contexts (coulombs, farads, teslas, etc.) and for English systems (fortnights, imperial pints, etc.). However, translation of these English system names were not required.
126
+
- Additional units were added for scientific contexts (coulombs, farads, teslas, etc.) and for English systems (fortnights, imperial pints, etc.).
127
+
However, translation of these English system names was not required.
126
128
- Additional guidance on translation was added, leading to refined translations or transcreations.
127
129
128
130
#### Specific Locales
@@ -153,25 +155,30 @@ The explanations of usage are in the [Locale Changes](#locale-changes) section.
153
155
- This “flattens” the rules into a format that is easier for implementations to use directly.
154
156
155
157
#### supplementalData
156
-
-`era` — the range of `code` values nows allows two letters before the first hyphen.
157
-
-`languageData` — the `territories` attribute [`supplementalData.xml`](https://github.com/unicode-org/cldr/blob/main/common/supplemental/supplementalData.xml) was deprecated and data using it removed. The definition was unclear, and prone to misunderstanding — the more detailed data is in `territoryInfo`. ([CLDR-5708][])
158
+
-`era` — the range of `code` values now allows two letters before the first hyphen.
159
+
-`languageData` — the `territories` attribute [`supplementalData.xml`](https://github.com/unicode-org/cldr/blob/main/common/supplemental/supplementalData.xml) was deprecated and data using it removed.
160
+
The definition was unclear, and prone to misunderstanding — the more detailed data is in `territoryInfo`. ([CLDR-5708][])
158
161
-`usesMetazone` — adds two new attributes `stdOffset` and `dstOffset` so that implementations can use either "vanguard" or "rearguard" TZDB data sources.
159
162
-`numberingSystem` — Unicode 17 data was added.
160
163
161
164
#### ldmlBCP47
162
-
-`type` — adds a new attibute`region`.
165
+
-`type` — adds a new attribute`region`.
163
166
-`keyboard3@conformsTo` — is updated to allow "48".
164
167
165
168
### BCP47 Data Changes
169
+
170
+
For a full listing, see [BCP47 Delta].
171
+
166
172
-`nu-tols` numbering system for Tolong Siki digits
167
173
- One additional zone: America/Coyhaique = tz-clcxq
168
174
- Seven region attributes for determining regions for timezones
169
175
- Three additional aliases
170
176
171
-
For a full listing, see [BCP47 Delta].
172
177
173
178
### Supplemental Data Changes
174
179
180
+
For a full listing, see [Supplemental Delta].
181
+
175
182
#### Identifiers
176
183
- Added aliases/deprecations for languages (dek, mnk, nte).
177
184
- Updated to the latest language subtag registry, with various additions and deprecations.
@@ -180,12 +187,13 @@ For a full listing, see [BCP47 Delta].
180
187
- deprecated unit IDs permillion, portion, portion-per-1e9, 100-kilometer.
181
188
182
189
#### Language Data
183
-
-[language_script.tsv](https://github.com/unicode-org/cldr/blob/main/tools/cldr-code/src/main/resources/org/unicode/cldr/util/data/language_script.tsv) updated to include only one "Primary" writing system for languages that used to have multiple options ([CLDR-18114][]). Notable changes are:
184
-
- Panjabi `pa` has the primary to Gurumukhi `Guru` because widespread usage is in the Gurumukhi script -- while most speakers are in Pakistan `PK`, written usage remains Gurumukhi.
190
+
-[language_script.tsv](https://github.com/unicode-org/cldr/blob/main/tools/cldr-code/src/main/resources/org/unicode/cldr/util/data/language_script.tsv) updated to include only one "Primary" writing system for languages that used to have multiple options ([CLDR-18114][]).
191
+
Notable changes are:
192
+
- Punjabi `pa` has changed the primary script to Gurmukhi `Guru` because widespread usage is in the Gurmukhi script. While most speakers are in Pakistan `PK`, written usage remains Gurmukhi.
185
193
- Azerbaijani `az` and Northern Kurdish `ku` primarily are used in Latin `Latn`.
186
194
- Chinese languages `zh`, `hak`, and `nan` are matched to Simplified Han writing `Hans` -- except Cantonese `yue`, which is known for a preference in Traditional Han writing `Hant`.
187
195
- Hassiniyya `mey` was missing significant data, it should be associated with the Arabic `Arab` writing system by default, not Latin `Latn`.
188
-
- 5 new language distance values are added (for fallback to zh).
196
+
- 5 new language distance values are added (for fallback to `zh`).
189
197
- Substantial updates to Language Info: additional languages in countries; revised population values, writing percentages, literacy percentages, and official status values.
190
198
191
199
#### Likely Subtags
@@ -203,7 +211,7 @@ For a full listing, see [BCP47 Delta].
203
211
- Day periods for kok, scn, hi_Latn
204
212
205
213
#### Plural Rules
206
-
-additions for cv, ie, kok, sgs
214
+
-Additions for cv, ie, kok, sgs
207
215
208
216
#### Currencies
209
217
- Updates to the latest ISO currencies
@@ -212,14 +220,12 @@ For a full listing, see [BCP47 Delta].
212
220
- IS changed to firstDay=sun
213
221
- ku_SY adding H and hB
214
222
215
-
For a full listing, see [Supplemental Delta].
216
-
217
223
### Transforms
224
+
For a full listing, see [Transforms Delta].
225
+
218
226
- Fixed problem in Gujarati → Latin romanization, with ૰
219
227
- Updated to latest Unicode 17 data for Han → Latin, with very many changes.
220
228
221
-
For a full listing, see [Transforms Delta].
222
-
223
229
### Number Spellout Data Changes
224
230
225
231
- The biggest change is to the format, which has been “flattened” for easier use by clients.
@@ -281,10 +287,10 @@ The following files are new in the release:
281
287
282
288
## Migration
283
289
284
-
- Number patterns that did not have a specific numberSystem (such as latn or arab) had be deprecated for many releases, and were finally removed.
290
+
- Number patterns that did not have a specific numberSystem (such as `latn` or `arab`) had been deprecated for many releases, and were finally removed.
285
291
- Additionally, language and territory data in `languageData` and `territoryInfo` data received significant updates to improve accuracy and maintainability [CLDR-18087]
286
292
- The likely language for Belarus changed to Russian [CLDR-14479]
287
-
-**TBD Additional items plus future guidance will be added before the beta, on Oct 1.**
293
+
-**TBD Additional items plus future guidance will be added before the spec-beta.**
288
294
289
295
290
296
### V49 advance warnings
@@ -305,8 +311,8 @@ These will be removed in the future, but note that they may be present in the ne
305
311
306
312
## Acknowledgments
307
313
308
-
Many people have made significant contributions to CLDR and LDML;
309
-
see the [Acknowledgments](/index/acknowledgments) page for a full listing.
314
+
Many people have made significant contributions to CLDR and LDML.
315
+
For a full listing, see the [Acknowledgments](https://www.unicode.org/reports/tr35/dev/tr35-acknowledgments.html).
310
316
311
317
The Unicode [Terms of Use](https://unicode.org/copyright.html) apply to CLDR data;
312
318
in particular, see [Exhibit 1](https://unicode.org/copyright.html#Exhibit1).
0 commit comments