Skip to content

Chinese Text Search Limitation with FTS5 Unicode61 Tokenizer #6148

@cjr96720

Description

@cjr96720
  • I have searched open and closed issues for duplicates
  • I am submitting a bug report for existing functionality that does not work as intended
  • This isn't a feature request or a discussion topic

Bug description

When searching for Chinese text in Signal iOS, there's a limitation with how search results are matched.

Steps to reproduce

For example, when searching for "你好" (hello):

  • Send a message "前言 你好" and another "前言你好"
  • Navigate to the search bar and search for the keyword "你好"
  • Result:
    • ✅ Messages containing "前言 你好" (with a space) are found
    • ❌ Messages containing "前言你好" (without a space) are NOT found

Actual result: Messages containing "前言你好" (without a space) are NOT found

Expected result: Messages containing "前言你好" (without a space) should be found.

Screenshots

Device info

Device: iPhone 15 Pro

iOS version: 26.1.0

Signal version: 7.84

Link to debug log

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions