Emoji 18.0 alpha data by nedley · Pull Request #1279 · unicode-org/unicodetools

nedley · 2026-01-31T00:22:55Z

https://github.com/unicode-org/utc-release-management/issues/264
(From ESC)

[185-C31] Consensus: Accept nine (9) new emoji characters with the following character names, based on Section 1 of document L2/25-230, for Unicode Version 18.0:

    1F6D9 LIGHTHOUSE
    1FA8B METEOR
    1FA8C ERASER
    1FA8D NET WITH HANDLE
    1FACC MONARCH BUTTERFLY
    1FADD PICKLE
    1FAEB FACE WITH SQUINTING EYES
    1FAF9 LEFTWARDS THUMB SIGN
    1FAFA RIGHTWARDS THUMB SIGN

[186-C2] Consensus: Change the character approved for Unicode Version 18.0 at U+1FAEB from FACE WITH SQUINTING EYES to CRACKING FACE, based on Section 1 of document L2/26-008 and L2/26-048.

I tried to keep track of the steps I took but as always I start getting a little woozy at some point such that some of the later steps may be slightly out of order. The steps themselves are below but basically what it amounts to is I keep re-running GenerateEmoji.java until it stops throwing errors and the data files themselves stop changing, meaning they have been completely bootstrapped.

Set "emoji-beta" as described in Emoji.java
Update candidateData.txt with new emoji with Status=Draft Candidate
Update docRegistry.txt as appropriate
Run GenerateEmoji.java, adding new emoji to data files as E0.0
Run GenerateEmoji.java again, complains about ordering
Add new emoji to emojiOrdering.txt based on candidateData.txt
Run CandidateData.java, add result to proposalData.txt
Run GenerateEmoji.java one last time, updates new emoji with correct version

nedley · 2026-01-31T02:28:42Z

@eggrobin Apparently this is my first attempt at adding characters rather than simply reviewing. Do I need to manually edit the UCD files or is there a more streamlined process?

nedley · 2026-01-31T03:25:58Z

I tried adding the new characters to dev/UnicodeData.txt and running MakeUnicodeFiles.java with cleanAndCopy, that seems to have almost done what I want but it also made some unexpected changes to DerivedAge.txt…

eggrobin · 2026-01-31T23:09:31Z

Do I need to manually edit the UCD files or is there a more streamlined process?

The process is described here (here I expect it would basically mean add to UnicodeData, add to Scripts, and as you found, regenerate UCD with MakeUnicodeFiles).

eggrobin · 2026-01-31T23:11:05Z

it also made some unexpected changes to DerivedAge.txt…

What changes? (Remember to clear the stupid BIN cache which never works, see #1125. I will go right ahead and fix this issue, this is utterly ridiculous.)

nedley · 2026-02-02T18:13:11Z

What changes?

Some characters ended up with the wrong age, e.g. the ARCHAIC SHRII characters had 16.0

the stupid BIN cache which never works

That was my problem! Will push up something shortly.

nedley · 2026-02-02T19:27:00Z

Where did the Derived files go? Whoops…

nedley · 2026-02-02T19:29:56Z

So, rather than fixing all of the dumb merge conflicts in extracted/ I deleted the files, assuming they would be entirely rebuilt. Let that be a lesson, I suppose.

unicodetools/data/ucd/dev/extracted/DerivedJoiningType.txt

nedley · 2026-02-02T20:07:12Z

For some reason the extracted/ files still don’t have the new characters. Help me @eggrobin, you’re my only hope.

eggrobin · 2026-02-02T21:00:42Z

I ran the following commands (Windows, in-source; paths and syntactic details will probably differ for you).

mvn compile exec:java '-Dexec.mainClass="org.unicode.text.UCD.MakeUnicodeFiles"'  '-Dexec.args="-c"' -am -pl unicodetools  "-DCLDR_DIR=..\cldr\"  "-DUNICODETOOLS_GEN_DIR=Generated"  "-DUNICODETOOLS_REPO_DIR=."
git commit -am "Regenerate UCD"
n compile exec:java '-Dexec.mainClass="org.unicode.tools.GenerateLinkData"' -am -pl unicodetools  "-DCLDR_DIR=..\cldr\"  "-DUNICODETOOLS_GEN_DIR=Generated"  "-DUNICODETOOLS_REPO_DIR=."
git add *LinkTerm.txt
git commit -m "And regenerate LinkTerm too"

I need to make MakeUnicodeFiles regenerate link data so I don’t need to have those last three lines, this is getting annoying.

P.-S.: GitHub actions seems to be in a bad mood and isn’t giving us runners to run the tests.

nedley · 2026-02-02T22:24:00Z

P.-S.: GitHub actions seems to be in a bad mood and isn’t giving us runners to run the tests.

They’re probably just upset with all the dumb stuff I made them check.

… enough)

eggrobin · 2026-02-02T23:44:02Z

GitHub Actions stopped sulking and complained about a couple of things. Hopefully fixed now.

nedley requested a review from eggrobin January 31, 2026 00:22

nedley marked this pull request as draft January 31, 2026 01:50

nedley force-pushed the ned/emoji_18_alpha branch from 954f0d6 to a1199e6 Compare January 31, 2026 01:52

nedley marked this pull request as ready for review February 2, 2026 18:40

nedley added 2 commits February 2, 2026 10:54

Emoji 18.0 alpha data

64be36b

Compare hands against one in the same block

ddc3823

nedley force-pushed the ned/emoji_18_alpha branch 2 times, most recently from baf7bca to b502bd6 Compare February 2, 2026 19:26

nedley force-pushed the ned/emoji_18_alpha branch from b502bd6 to 29f7fea Compare February 2, 2026 19:29

nedley commented Feb 2, 2026

View reviewed changes

unicodetools/data/ucd/dev/extracted/DerivedJoiningType.txt Show resolved Hide resolved

Update UCD

a1b326c

nedley force-pushed the ned/emoji_18_alpha branch from 29f7fea to a1b326c Compare February 2, 2026 20:02

eggrobin added 2 commits February 2, 2026 21:38

Regenerate UCD

99687a6

And regenerate LinkTerm too

9c4c7cc

eggrobin added data-for-new pipeline-18.0 labels Feb 2, 2026

eggrobin added 3 commits February 3, 2026 00:40

typo in code point for 😀 (the extended \N{:} escapes cannot come soon…

8f65087

… enough)

lb=EB for hands

8c1a60e

Regenerate UCD

c84132a

eggrobin previously approved these changes Feb 3, 2026

View reviewed changes

Merge remote-tracking branch 'la-vache/main' into ned/emoji_18_alpha

6c2bb95

eggrobin dismissed their stale review via 6c2bb95 February 3, 2026 11:41

eggrobin approved these changes Feb 3, 2026

View reviewed changes

eggrobin merged commit 803d843 into main Feb 3, 2026
16 checks passed

eggrobin deleted the ned/emoji_18_alpha branch February 3, 2026 12:06

Uh oh!

Comments

Conversation

nedley commented Jan 31, 2026 • edited by eggrobin Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nedley commented Jan 31, 2026

Uh oh!

nedley commented Jan 31, 2026

Uh oh!

eggrobin commented Jan 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eggrobin commented Jan 31, 2026

Uh oh!

nedley commented Feb 2, 2026

Uh oh!

nedley commented Feb 2, 2026

Uh oh!

nedley commented Feb 2, 2026

Uh oh!

Uh oh!

nedley commented Feb 2, 2026

Uh oh!

eggrobin commented Feb 2, 2026

Uh oh!

nedley commented Feb 2, 2026

Uh oh!

eggrobin commented Feb 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

nedley commented Jan 31, 2026 •

edited by eggrobin

Loading

eggrobin commented Jan 31, 2026 •

edited

Loading