You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
| <aname="ufield"href="#ufield">`ufield`</a><br/>(Also known as `keyword`) |`= ukey (sep uvalue)? ;`|
519
-
| <aname="ukey"href="#ukey">`ukey`</a><br/>(Also known as `key`) |`= alphanum alpha ;`<br/>(Note that this is narrower than in [[RFC6067](https://www.ietf.org/rfc/rfc6067.txt)], so that it is disjoint with tkey.) |[`validity`](#Key_Type_Definitions)<br/>[`latest-data`](https://github.com/unicode-org/cldr/blob/maint/maint-47/common/bcp47)|
519
+
| <aname="ukey"href="#ukey">`ukey`</a><br/>(Also known as `key`) |`= alphanum alpha ;`|[`validity`](#Key_Type_Definitions)<br/>[`latest-data`](https://github.com/unicode-org/cldr/blob/maint/maint-47/common/bcp47) <br/>(Note that this is narrower than in [[RFC6067](https://www.ietf.org/rfc/rfc6067.txt)], so that it is disjoint with `tkey`.) |
520
520
| <aname="uvalue"href="#uvalue">`uvalue`</a><br/>(Also known as `type`) |`= alphanum{3,8}`<br/>` (sep alphanum{3,8})* ;`|[`validity`](#Key_Type_Definitions)<br/>[`latest-data`](https://github.com/unicode-org/cldr/blob/maint/maint-47/common/bcp47)|
@@ -575,8 +575,8 @@ A [`unicode_locale_id`](#unicode_locale_id) has _canonical syntax_ when:
575
575
* Any variants are in alphabetical order (eg, en-fonipa-scouse, not en-scouse-fonipa)
576
576
* Any extensions are in alphabetical order by their singleton (eg, en-t-xxx-u-yyy, not en-u-yyy-t-xxx)
577
577
* All attributes are sorted in alphabetical order.
578
-
* All keywords and tfields are sorted by alphabetical order of their keys, within their respective extensions.
579
-
* Any type or tfield value "true" is removed.
578
+
* All `ufield`s and `tfield`s are sorted by alphabetical order of their keys, within their respective extensions.
579
+
* Any `ufield` or `tfield` value "true" is removed.
580
580
581
581
For example, the canonical form of "en-u-foo-bar-nu-thai-ca-buddhist-kk-true" is "en-u-bar-foo-ca-buddhist-kk-nu-thai". The attributes `"foo"` and `"bar"` in this example are provided only for illustration; no attribute subtags are defined by the current CLDR specification.
582
582
@@ -943,7 +943,7 @@ These subtags are all in lowercase (that is the canonical casing for these subta
943
943
944
944
**The -u- Extension.** The syntax of 'u' extension subtags is defined by the rule `unicode_locale_extensions` in [Unicode locale identifier](#Unicode_locale_identifier), except the separator of subtags `sep` must be always hyphen '-' when the extension is used as a part of BCP 47 language tag.
945
945
946
-
A 'u' extension may contain multiple `attribute`s or `keyword`s as defined in [Unicode locale identifier](#Unicode_locale_identifier). The canonical syntax is defined as in [Canonical Unicode Locale Identifiers](#Canonical_Unicode_Locale_Identifiers).
946
+
A 'u' extension may contain multiple `attribute`s or `ufield`s as defined in [Unicode locale identifier](#Unicode_locale_identifier). The canonical syntax is defined as in [Canonical Unicode Locale Identifiers](#Canonical_Unicode_Locale_Identifiers).
947
947
948
948
_See also [Unicode Extensions for BCP 47](https://cldr.unicode.org/index/bcp47-extension) on the CLDR site._
949
949
@@ -1017,8 +1017,8 @@ The BCP 47 form for keys and types is the canonical form, and recommended. Other
1017
1017
<td><code>standard</code></td>
1018
1018
<td>The default ordering for each language. For root it is based on the [<a href="#DUCET">DUCET</a>] (Default Unicode Collation Element Table): see <i><a href="tr35-collation.md#Root_Collation">Root Collation</a></i>. Each other locale is based on that, except for appropriate modifications to certain characters for that language.</td></tr>
1019
1019
<tr><td><code>search</code></td>
1020
-
<td>A special collation type dedicated for string search—it is not used to determine the relative order of two strings, but only to determine whether they should be considered equivalent for the specified strength, using the string search matching rules appropriate for the language. Compared to the normal collator for the language, this may add or remove primary equivalences, may make additional characters ignorable or change secondary equivalences, and may modify contractions to allow matching within them, depending on the desired behavior. For example, in Czech, the distinction between ‘a’ and ‘á’ is secondary for normal collation, but primary for search; a search for ‘a’ should never match ‘á’ and vice versa. A search collator is normally used with strength set to PRIMARY or SECONDARY (should be SECONDARY if using “asymmetric” search as described in the [<a href="https://www.unicode.org/reports/tr41/#UTS10">UCA</a>] section Asymmetric Search). The search collator in root supplies matching rules that are appropriate for most languages (and which are different than the root collation behavior); language-specific search collators may be provided to override the matching rules for a given language as necessary.</td></tr>
1021
-
<tr><td colspan="2"><p>Other keywords provide additional choices for certain locales; <i>they only have effect in certain locales.</i></p></td></tr>
1020
+
<td>A special collation type dedicated for string search — it is not used to determine the relative order of two strings, but only to determine whether they should be considered equivalent for the specified strength, using the string search matching rules appropriate for the language. Compared to the normal collator for the language, this may add or remove primary equivalences, may make additional characters ignorable or change secondary equivalences, and may modify contractions to allow matching within them, depending on the desired behavior. For example, in Czech, the distinction between ‘a’ and ‘á’ is secondary for normal collation, but primary for search; a search for ‘a’ should never match ‘á’ and vice versa. A search collator is normally used with strength set to PRIMARY or SECONDARY (should be SECONDARY if using “asymmetric” search as described in the [<a href="https://www.unicode.org/reports/tr41/#UTS10">UCA</a>] section Asymmetric Search). The search collator in root supplies matching rules that are appropriate for most languages (and which are different than the root collation behavior); language-specific search collators may be provided to override the matching rules for a given language as necessary.</td></tr>
1021
+
<tr><td colspan="2"><p>Other ufields provide additional choices for certain locales; <i>they only have effect in certain locales.</i></p></td></tr>
1022
1022
<tr><td colspan="2">…</td></tr>
1023
1023
<tr><td><code>phonetic</code></td>
1024
1024
<td>Requests a phonetic variant if available, where text is sorted based on pronunciation. It may interleave different scripts, if multiple scripts are in common use.</td></tr>
0 commit comments