Skip to content

Conversation

@jowilco
Copy link
Contributor

@jowilco jowilco commented Oct 23, 2025

See https://github.com/unicode-org/sah/issues/675 and L2/25-187.

[184-C5] Consensus: Provisionally assign 5 code points U+18CD6..U+18CDA in the Khitan Small Script block for characters used in Jurchen Small Script as described in L2/25-164. [Ref. 1.2 in L2/25-187]

[184-C6] Consensus: Update the representative glyph of U+18C3E KHITAN SMALL SCRIPT CHARACTER-18C3E as described in L2/25-152, for Unicode Version 18.0. [Ref. 1.2 in L2/25-187]

Copy link

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR adds support for 5 Jurchen Small Script characters (U+18CD6..U+18CDA) provisionally assigned to the Khitan Small Script block, as per UTC decisions 184-C5 and 184-C6. The characters are integrated into the Unicode 18.0.0 release with appropriate properties and documentation.

Key Changes:

  • Added 5 new Jurchen Small Script characters (18CD6-18CDA) to the Khitan Small Script block
  • Updated character range from 18CD5 to 18CDA across all property files
  • Added property comparison test file for validating the new character additions

Reviewed Changes

Copilot reviewed 18 out of 18 changed files in this pull request and generated no comments.

Show a summary per file
File Description
231.txt Test configuration for comparing properties of newly added Jurchen characters with existing Khitan characters
DerivedName.txt Extended character range to include 18CDA and updated total code points
DerivedLineBreak.txt Updated line break properties for the extended range
DerivedGeneralCategory.txt Updated general category assignments for the new characters
DerivedEastAsianWidth.txt Extended East Asian Width property ranges
DerivedCombiningClass.txt Updated combining class properties
DerivedBidiClass.txt Extended bidirectional class assignments
WordBreakProperty.txt Updated word break property ranges
SentenceBreakProperty.txt Extended sentence break property ranges
VerticalOrientation.txt Updated vertical orientation properties
UnicodeData.txt Added character data entries for the 5 new characters
Scripts.txt Extended Khitan_Small_Script range and updated count
PropList.txt Updated Ideographic property ranges
NamesList.txt Added documentation and annotations for Jurchen characters
LineBreak.txt Extended line break property ranges
EastAsianWidth.txt Updated East Asian Width assignments
DerivedCoreProperties.txt Extended multiple core properties (Alphabetic, ID_Start, etc.)
DerivedAge.txt Assigned Age 18.0 to the new characters
Comments suppressed due to low confidence (1)

Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.

Copy link
Contributor

@roozbehp roozbehp left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants