Skip to content

Commit e69785d

Browse files
authored
CLDR-18463 Update translation.md (#4602)
1 parent f62119e commit e69785d

File tree

1 file changed

+67
-19
lines changed

1 file changed

+67
-19
lines changed

docs/site/translation.md

Lines changed: 67 additions & 19 deletions
Original file line numberDiff line numberDiff line change
@@ -33,38 +33,60 @@ Most of the following are relevant to locales at the Modern Coverage Level.
3333

3434
### New emoji
3535

36-
TBD - New emoji will be added the week of April 21st.
36+
New emoji will be added the week of April 21st.
37+
38+
### Core Data
39+
There are new Alphabetic Information items.
40+
- `numbers-auxiliary` — If there are are characters used in numbers that are not customarily used, but may occur, add them here instead of in `auxiliary`.
41+
- `punctuation-auxiliary` — If there are punctuation characters that are not customarily used, but may occur, add them here instead of in `auxiliary`.
42+
- `punctuation-person` — If there are punctuation characters that are customarily used in people's names in standard documents, add them here.
43+
This should be a small list such as “.” or “-”.
44+
Do **not** include ‘fanciful’ characters such as emoji or [kaomoji](https://en.wikipedia.org/wiki/Kaomoji).
3745

3846
### Locale display names
3947

4048
#### More language names
4149

42-
As new locales reach Basic Coverage, their language names have been added for locales targeting modern coverage: TBD
50+
As new locales reach Basic Coverage, their language names have been added for locales targeting modern coverage. These include: **TBD**
4351

4452
#### Languages whose English name changed
4553

46-
- tkl: English name changed to Tokelauan. [CLDR-11231](https://unicode-org.atlassian.net/browse/CLDR-11231)
54+
- tkl: English name changed to Tokelauan.
55+
<!-- Let's not add ticket numbers, except for Known Issues — there is already enough for vetters to read, and the tickets will often be confusing to people or take more time to read and puzzle out than is worth it. Fine to leave them in the document, but commented out, for our usage.
56+
[CLDR-11231](https://unicode-org.atlassian.net/browse/CLDR-11231) -->
57+
58+
#### Core/Extensions
59+
60+
There is a new mechanism for better menu names. When you see a **Code** with `-core` or `-extension`, please read [Core and Extensions](/displaynames/languagelocale-names#core-and-extensions).
4761

4862
#### Scripts
4963

5064
There are 5 new scripts for Unicode 17. Currently the names are in English: Beria Erfe, Chisoi, Sidetic, Tai Yo, Tolong Siki.
51-
Coverage for other languages is at comprehensive; if there is a need to have coverage at lower level in some locale,
52-
please file a ticket. [CLDR-18283](https://unicode-org.atlassian.net/browse/CLDR-18283)
65+
Coverage for other languages is at comprehensive.
66+
<!-- I don't think we want to encourage this: ... if there is a need to have coverage at lower level in some locale,
67+
please file a ticket. [CLDR-18283](https://unicode-org.atlassian.net/browse/CLDR-18283) -->
5368

54-
#### ISO 8601 calendar name
69+
#### ISO 8601 calendar
5570

5671
This is a variant of the Gregorian calendar whose formats always use year-month-day ordering and a 24-hour time cycle.
5772
The English name has changed to reflect that (and also added a variant); locales should update accordingly:
58-
-calendar-iso8601: Gregorian (Year First)
59-
-calendar-iso8601-variant: ISO 8601 Order
73+
- calendar-iso8601: Gregorian (Year First)
74+
- calendar-iso8601-variant: ISO 8601 Order
75+
76+
Please go though the ISO8601 fields. You should change separators to match what is acceptable in your language. However, do not change the ordering of the elements, which should be strictly the following order (for any that occur in a particular pattern):
77+
* year - month - day - day-of-week - hour - minute - second
6078

61-
[CLDR-18447](https://unicode-org.atlassian.net/browse/CLDR-18447)
79+
Also avoid changing the width of numeric fields (like `dd`).
80+
81+
<!-- Let's not add ticket numbers — there is already enough for vetters to read, and the tickets will often be confusing to people or take more time to read and puzzle out than is worth it. Fine to leave them here, commented out, for our usage.
82+
[CLDR-18447](https://unicode-org.atlassian.net/browse/CLDR-18447) -->
6283

6384
### DateTime formats
6485

6586
#### New “relative” variant for date-time combining pattern
6687

67-
There is a new “-relative” variant for [Date-Time Combined Formats](/translation/date-time/date-time-patterns#date-time-combined-formats). [CLDR-18350](https://unicode-org.atlassian.net/browse/CLDR-18350)
88+
There is a new “-relative” variant for [Date-Time Combined Formats](/translation/date-time/date-time-patterns#date-time-combined-formats).
89+
<!-- [CLDR-18350](https://unicode-org.atlassian.net/browse/CLDR-18350) -->
6890

6991
Before CLDR 48, there were two variants:
7092
- A “standard” variant for combining date with time, typically without literal text. In English this was “{1}, {0}”
@@ -82,11 +104,13 @@ fvfixed date with single time (as was the case in CLDR 47 and earlier), then for
82104

83105
#### New `gmtUnknownFormat`
84106

85-
Normally time zones formatted using UTC offset (like xxxx) use the `gmtFormat` pattern (“GMT{0}” in root). The new `gmtUnknownFormat` is used when formatting time zones using a UTC offset for cases when the offset or zone is unknown. The root value “GMT+?” need not be changed if it works for your locale; hoever it should be consistent with the `gmtFormat` and `gmtZeroFormat` in your locale. See [Time Zones and City names](translation/time-zones-and-city-names) [CLDR-18236](https://unicode-org.atlassian.net/browse/CLDR-18236)
107+
Normally time zones formatted using UTC offset (like xxxx) use the `gmtFormat` pattern (“GMT{0}” in root). The new `gmtUnknownFormat` is used when formatting time zones using a UTC offset for cases when the offset or zone is unknown. The root value “GMT+?” need not be changed if it works for your locale; however it should be consistent with the `gmtFormat` and `gmtZeroFormat` in your locale. See [Time Zones and City names](translation/time-zones-and-city-names)
108+
<!-- [CLDR-18236](https://unicode-org.atlassian.net/browse/CLDR-18236) -->
86109

87110
#### “Unknown City” → “Unknown Location”
88111

89-
For zone `Etc/Unknown`, the exemplarCity name was changed in English from “Unknown City” to “Unknown Location”; other locales should update accordingly. [CLDR-18262](https://unicode-org.atlassian.net/browse/CLDR-18262)
112+
For zone `Etc/Unknown`, the exemplarCity name was changed in English from “Unknown City” to “Unknown Location”; other locales should update accordingly.
113+
<!-- [CLDR-18262](https://unicode-org.atlassian.net/browse/CLDR-18262) -->
90114

91115
#### Changes to the root and/or English names of many exemplar cities and some metazones
92116

@@ -96,37 +120,61 @@ For zone `Etc/Unknown`, the exemplarCity name was changed in English from “Unk
96120

97121
#### Currency patterns alphaNextToNumber, noCurrency
98122

123+
<!-- this is unnecessary for vetters to worry about.
99124
There actually added in CLDR 42 per (CLDR-14336)[https://unicode-org.atlassian.net/browse/CLDR-14336]. However, they were not properly set up for coverage and inheritance, and were not presented to many vetters. These issue were corrected in CLDR 47 per [CLDR-17879](https://unicode-org.atlassian.net/browse/CLDR-17879), which adjusted the data for some locales (and made it draft="provisional"). Many vetters will see these for the first time in CLDR 48.
125+
-->
100126
- The `alphaNextToNumber` patterns should be used when currency symbol is alphabetic, such as “USD”; in this case the m=pattern may add a space to offset the currency symbol from the numeric value, if the standard pattern does not already include a space.
127+
- **Note that some currency units may only be alphabetic at the start or end, such as CA$ or $CA.
128+
This pattern will be used if an alphabetic character would end up being adjacent to a number in the regular pattern.
129+
So suppose that the regular pattern is "¤#,##0" and this pattern is "¤ #,##0":
130+
$CA would use this pattern ("$CA 123"), but CA$ would just use the regular pattern to get "CA$123".**
101131
- The `alphaNextToNumber` patterns should be used when the currency amount should be formatted without a currency symbol, as in a table of values all using the same currency. This pattern must not include the currency symbol pattern character ‘¤’.
102132

103133
For more information see [Number and currency patterns](/translation/number-currency-formats/number-and-currency-patterns).
104134

105135
#### Rational formats
106136

107-
These describe the formatting of rational fractions such as ¾ or combinations of integers and fractions such as 5½. [CLDR-17570](https://unicode-org.atlassian.net/browse/CLDR-17570)
137+
These describe the formatting of rational fractions such as ¾ or combinations of integers and fractions such as 5½.
138+
<!-- [CLDR-17570](https://unicode-org.atlassian.net/browse/CLDR-17570) -->
108139

109-
Here are the the English values and a short description of their purpose; for more information see (TBD):
140+
Here are the the English values and a short description of their purpose:
110141
- `rationalFormats-rationalPattern`: “{0}⁄{1}” - The format for a rational fraction with arbitrary numerator and denominator; the English pattern uses the Unicode character ‘⁄’ U+2044 FRACTION SLASH which causes composition of fractions such as 22⁄7.
111142
- `rationalFormats-integerAndRationalPattern`: “{0} {1}” - The format for combining an integer with a rational fraction composed using the pattern above; the English pattern uses U+202F NARROW NO-BREAK SPACE (NNBSP) to produce a small no-break space.
112-
- `rationalFormats-integerAndRationalPattern-superSub`: “{0}⁠{1}” - The format for combining an integer with a rational fraction using (TBD); the English pattern uses U+2060 WORD JOINER, a zero-width no-break space.
143+
- `rationalFormats-integerAndRationalPattern-superSub`: “{0}⁠{1}” - The format for combining an integer with a rational fraction using composed using the pattern above; the English pattern uses U+2060 WORD JOINER, a zero-width no-break space.
113144
- `rationalFormats-rationalUsage`: “sometimes” - An indication of the extent to which rational fractions are used in the locale; may be one of “never”, “sometimes”, ... (TBD)
114145

146+
If an integer and fraction (5½) is best expressed in your language with a space between them (5 ½),
147+
then copy the pattern from integerAndRationalPattern to integerAndRationalPattern-superSub.
148+
However, you **cannot** do the reverse.
149+
Some fonts and rendering systems don't properly handle the fraction slash, and the user would see something like 51/2 (fifty-one halves).
150+
So in that case, implementations must have the integerAndRationalPattern with a space in it to fall back on,
151+
unless they have verified that the font / rendering system supports superscripting the numerator.
152+
153+
See [Rational Formatting](https://cldr.unicode.org/translation/number-currency-formats/number-and-currency-patterns#rational-formatting) for more information.
154+
115155
### Units
116156

117157
#### Rework certain concentration units
118158

119-
The keys for two units changed (the translations can probably remain the same) and there is one new unit that is used for constructing certain other kinds of concentration units. [CLDR-18274](https://unicode-org.atlassian.net/browse/CLDR-18274):
159+
The keys for two units changed (the translations can probably remain the same) and there is one new unit that is used for constructing certain other kinds of concentration units:
160+
<!-- [CLDR-18274](https://unicode-org.atlassian.net/browse/CLDR-18274) -->
120161
- key `permillion` changed to `concentr-part-per-1e6`; English values remain “parts per million”, “{0} part per million”, etc.
121162
- key `portion-per-1e9` changed to `concentr-part-per-1e9`; English values remain “parts per billion”, “{0} part per billion”, etc.
122163
- new key `part` used for constructing arbitrary concentrations such as “parts per 100,000”; English values “parts”, “{0} part”, etc.
123164

165+
**Please check over the values for `concentr-part-per-1e6` and `concentr-part-per-1e9` in your locale.
166+
Some languages had used the equivalent of "millionths" instead of the equivalent of "parts per million".**
167+
124168
#### Many new units in English
125169

126-
Mnny new units were added in English. In general these are very specific and vetters will not be
170+
Mnny new units were added in English.
171+
Vetters will not be asked to translate the non-metric units for other languages.
172+
However, the metric ones are used in scientific contexts, and will need to be translated.
173+
However, the case inflections (accusative, dative, etc) will not be requested.
174+
<!-- In general these are very specific and vetters will not be
127175
asked to translate them for other locales, so coverage will be comprehensive. If some of these units
128176
would be useful in particualr locales, please file a ticekt and the coverage can be adjusted.
129-
[CLDR-18215](https://unicode-org.atlassian.net/browse/CLDR-18215)
177+
[CLDR-18215](https://unicode-org.atlassian.net/browse/CLDR-18215) -->
130178

131179
The units (English names) are:
132180
- angle: steradians
@@ -137,7 +185,7 @@ The units (English names) are:
137185
- energy: becquerels, British thermal units [IT], calories [IT], grays, sieverts
138186
- force: kilograms-force
139187
- length: chains, rods; jo [JP], ken [JP], ri [JP], rin [JP], shaku [cloth, JP], >shaku [JP], sun [JP] (Japanese units)
140-
- magnetic: teslas , webers
188+
- magnetic: teslas, webers
141189
- mass: slugs; fun [JP] (Japanese unit)
142190
- temperature: rankines
143191
- volume: metric fluid ounces; cups Imperial, pints Imperial; cup [JP], koku [JP], kosaji [JP], osaji [JP], sai [JP], shaku [volume, JP], to [JP] (Japanese units)

0 commit comments

Comments
 (0)