Skip to content

Commit 8fb77e4

Browse files
authored
Fix includes in table for CharUnicodeInfo (#5047)
1 parent d552b5e commit 8fb77e4

File tree

1 file changed

+20
-45
lines changed

1 file changed

+20
-45
lines changed

xml/System.Globalization/CharUnicodeInfo.xml

Lines changed: 20 additions & 45 deletions
Original file line numberDiff line numberDiff line change
@@ -50,10 +50,11 @@
5050
<remarks>
5151
<format type="text/markdown"><![CDATA[
5252
53-
## Remarks
54-
The [Unicode Standard](https://go.microsoft.com/fwlink/?linkid=37123) defines a number of Unicode character categories. For example, a character might be categorized as an uppercase letter, a lowercase letter, a decimal digit number, a letter number, a paragraph separator, a math symbol, or a currency symbol. Your application can use the character category to govern string-based operations, such as parsing or extracting substring with regular expressions. The <xref:System.Globalization.UnicodeCategory> enumeration defines the possible character categories.
53+
## Remarks
54+
55+
The [Unicode Standard](https://home.unicode.org/) defines a number of Unicode character categories. For example, a character might be categorized as an uppercase letter, a lowercase letter, a decimal digit number, a letter number, a paragraph separator, a math symbol, or a currency symbol. Your application can use the character category to govern string-based operations, such as parsing or extracting substring with regular expressions. The <xref:System.Globalization.UnicodeCategory> enumeration defines the possible character categories.
5556
56-
You use the <xref:System.Globalization.CharUnicodeInfo> class to obtain the <xref:System.Globalization.UnicodeCategory> value for a specific character. The <xref:System.Globalization.CharUnicodeInfo> class defines methods that return the following Unicode character values:
57+
Use the <xref:System.Globalization.CharUnicodeInfo> class to obtain the <xref:System.Globalization.UnicodeCategory> value for a specific character. The <xref:System.Globalization.CharUnicodeInfo> class defines methods that return the following Unicode character values:
5758
5859
- The specific category to which a character or surrogate pair belongs. The value returned is a member of the <xref:System.Globalization.UnicodeCategory> enumeration.
5960
@@ -63,21 +64,19 @@
6364
6465
- Decimal digit value. Applies only to characters that represent decimal digits in the decimal (base 10) system. A decimal digit can be one of ten digits, from zero through nine. These characters are members of the <xref:System.Globalization.UnicodeCategory.DecimalDigitNumber?displayProperty=nameWithType> category.
6566
66-
In addition, the <xref:System.Globalization.CharUnicodeInfo> class is used internally by a number of other .NET Framework types and methods that rely on character classification. These include:
67+
In addition, the <xref:System.Globalization.CharUnicodeInfo> class is used internally by a number of other .NET types and methods that rely on character classification. These include:
6768
6869
- The <xref:System.Globalization.StringInfo> class, which works with textual elements instead of single characters in a string.
6970
7071
- The overloads of the <xref:System.Char.GetUnicodeCategory%2A?displayProperty=nameWithType> method, which determine the category to which a character or surrogate pair belongs.
7172
72-
- The [character classes](/dotnet/standard/base-types/character-classes-in-regular-expressions) recognized by <xref:System.Text.RegularExpressions.Regex>, the .NET Framework's regular expression engine.
73+
- The [character classes](/dotnet/standard/base-types/character-classes-in-regular-expressions) recognized by <xref:System.Text.RegularExpressions.Regex>, .NET's regular expression engine.
7374
74-
When using this class in your applications, keep in mind the following programming considerations for using the <xref:System.Char> type. The type can be difficult to use, and strings are generally preferable for representing linguistic content.
75+
When using this class in your applications, keep in mind the following programming considerations for using the <xref:System.Char> type. The type can be difficult to use, and strings are generally preferable for representing linguistic content.
7576
7677
- A <xref:System.Char> object does not always correspond to a single character. Although the <xref:System.Char> type represents a single 16-bit value, some characters (such as grapheme clusters and surrogate pairs) consist of two or more UTF-16 code units. For more information, see "Char Objects and Unicode Characters" in the <xref:System.String> class.
7778
78-
- The notion of a "character" is also flexible. A character is often thought of as a glyph, but many glyphs require multiple code points. For example, ä can be represented either by two code points ("a" plus U+0308, which is the combining diaeresis), or by a single code point ("ä" or U+00A4). Some languages have many letters, characters, and glyphs that require multiple code points, which can cause confusion in linguistic content representation. For example, there is a ΰ (U+03B0, Greek small letter upsilon with dialytika and tonos), but there is no equivalent capital letter. Uppercasing such a value simply retrieves the original value.
79-
80-
79+
- The notion of a "character" is also flexible. A character is often thought of as a glyph, but many glyphs require multiple code points. For example, ä can be represented either by two code points ("a" plus U+0308, which is the combining diaeresis), or by a single code point ("ä" or U+00A4). Some languages have many letters, characters, and glyphs that require multiple code points, which can cause confusion in linguistic content representation. For example, there is a ΰ (U+03B0, Greek small letter upsilon with dialytika and tonos), but there is no equivalent capital letter. Uppercasing such a value simply retrieves the original value.
8180
8281
## Examples
8382
The following code example shows the values returned by each method for different types of characters.
@@ -89,43 +88,19 @@
8988
]]></format>
9089
</remarks>
9190
<block subset="none" type="usage">
92-
<para>Recognized characters and the specific categories to which they belong are defined by the Unicode standard and can change from one version of the Unicode Standard to another. Categorization of characters in a particular version of the .NET Framework is based on a single version of the Unicode Standard regardless of the underlying operating system on which the .NET Framework is running. The following table lists versions of the .NET Framework since the [!INCLUDE[net_v40_long](~/includes/net-v40-long-md.md)] and the versions of the Unicode Standard used to classify characters.
93-
94-
<list type="table"><listheader><term> .NET Framework version
95-
96-
</term><description> Version of the Unicode Standard
97-
98-
</description></listheader><item><term> [!INCLUDE[net_v40_long](~/includes/net-v40-long-md.md)]
99-
100-
</term><description> [The Unicode Standard, Version 5.0.0](https://www.unicode.org/versions/Unicode5.0.0/)
101-
102-
</description></item><item><term> [!INCLUDE[net_v45](~/includes/net-v45-md.md)]
103-
104-
</term><description> [The Unicode Standard, Version 5.0.0](https://www.unicode.org/versions/Unicode5.0.0/)
105-
106-
</description></item><item><term> [!INCLUDE[net_v451](~/includes/net-v451-md.md)]
107-
108-
</term><description> [The Unicode Standard, Version 5.0.0](https://www.unicode.org/versions/Unicode5.0.0/)
109-
110-
</description></item><item><term> [!INCLUDE[net_v452](~/includes/net-v452-md.md)]
111-
112-
</term><description> [The Unicode Standard, Version 5.0.0](https://www.unicode.org/versions/Unicode5.0.0/)
113-
114-
</description></item><item><term> [!INCLUDE[net_v46](~/includes/net-v46-md.md)]
115-
116-
</term><description> [The Unicode Standard, Version 6.3.0](https://www.unicode.org/versions/Unicode6.3.0/)
117-
118-
</description></item><item><term> [!INCLUDE[net_v461](~/includes/net-v461-md.md)]
119-
120-
</term><description> [The Unicode Standard, Version 6.3.0](https://www.unicode.org/versions/Unicode6.3.0/)
121-
122-
</description></item><item><term> [!INCLUDE[net_v462](~/includes/net-v462-md.md)]
123-
124-
</term><description> [The Unicode Standard, Version 8.0.0](https://www.unicode.org/versions/Unicode8.0.0/)
125-
126-
</description></item></list>
91+
<para>Recognized characters and the specific categories to which they belong are defined by the Unicode standard and can change from one version of the Unicode Standard to another. Categorization of characters in a particular version of .NET Framework is based on a single version of the Unicode Standard, regardless of the underlying operating system on which .NET Framework is running. The following table lists versions of .NET Framework since .NET Framework 4 and the versions of the Unicode Standard used to classify characters.
92+
93+
| .NET Framework version | Unicode Standard version |
94+
| - | - |
95+
| [!INCLUDE[net_v40_long](~/includes/net-v40-long-md.md)] | [5.0.0](https://www.unicode.org/versions/Unicode5.0.0/) |
96+
| [!INCLUDE[net_v45](~/includes/net-v45-md.md)] | [5.0.0](https://www.unicode.org/versions/Unicode5.0.0/) |
97+
| [!INCLUDE[net_v451](~/includes/net-v451-md.md)] | [5.0.0](https://www.unicode.org/versions/Unicode5.0.0/) |
98+
| [!INCLUDE[net_v452](~/includes/net-v452-md.md)] | [5.0.0](https://www.unicode.org/versions/Unicode5.0.0/) |
99+
| [!INCLUDE[net_v46](~/includes/net-v46-md.md)] | [6.3.0](https://www.unicode.org/versions/Unicode6.3.0/) |
100+
| [!INCLUDE[net_v461](~/includes/net-v461-md.md)] | [6.3.0](https://www.unicode.org/versions/Unicode6.3.0/) |
101+
| [!INCLUDE[net_v462](~/includes/net-v462-md.md)] | [8.0.0](https://www.unicode.org/versions/Unicode8.0.0/) |
127102

128-
Each version of the Unicode standard includes information on changes to the Unicode character database since the previous version. The Unicode character database is used by the <see cref="T:System.Globalization.CharUnicodeInfo" /> class for categorizing characters.</para>
103+
Each version of the Unicode standard includes information on changes to the Unicode character database since the previous version. The Unicode character database is used by the <see cref="T:System.Globalization.CharUnicodeInfo" /> class for categorizing characters.</para>
129104
</block>
130105
<altmember cref="T:System.Globalization.UnicodeCategory" />
131106
<altmember cref="N:System.Text" />

0 commit comments

Comments
 (0)