[209_9] 修复代码模式的中文换行显示问题 #2635

notfoundzzz · 2026-01-21T06:43:44Z

如何测试

启动 Mogan
插入以下任意代码环境之一（或其他支持的代码环境）：
- \code
- \python-code
- \cpp-code
- \r-code
输入一行足够长、包含中文字符的内容，例如：

z中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中

在开头输入字符以触发不同位置的自动换行

期望结果：

中文字符不会被拆分
不再出现 <#XXXX> 或在 < 处断裂的异常显示
中文字符要么完整出现在上一行，要么完整出现在下一行

测试文档: TeXmacs/tests/tmu/209_9.tmu

2026/1/21

What

修复在代码模式下（包括 \code、\python-code、\cpp-code 等环境）
中文字符在自动换行时被错误拆分、显示为 <#XXXX> 的问题。

Why

代码模式在自动换行时直接按字符串下标切分字符串，
当断行位置落在 <#XXXX> 内部时，会破坏内部转义结构，
最终导致渲染失败并显示为 <#XXXX>。

关联issue #2605

How

在 verb_language_rep::hyphenate 与 prog_language_rep::hyphenate 中
引入断行边界保护机制：

将 <#...> 内部转义序列视为不可拆分的原子
若断行位置落在原子内部，则向左吸附到最近的合法边界
仅在合法边界处对字符串进行切分

Copilot

Pull request overview

This PR fixes a rendering bug where Chinese characters and other CJK text were incorrectly split during automatic line wrapping in code environments, causing them to display as <#XXXX> escape sequences instead of the actual characters.

Changes:

Added boundary protection logic to prevent splitting TeXmacs internal escape sequences (<#...>) during line wrapping
Applied the fix to both verb_language_rep (for \code environments) and prog_language_rep (for language-specific code environments like \python-code, \cpp-code, etc.)
Added documentation and test cases

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

File	Description
src/System/Language/verb_language.cpp	Added helper functions to detect and preserve `<#...>` escape sequences during hyphenation in verbatim/code mode
src/System/Language/prog_language.cpp	Added identical helper functions for programming language code environments
devel/209_9.md	Added developer documentation explaining the issue, fix, and testing approach
TeXmacs/tests/tmu/209_9.tmu	Added test document with code examples in multiple environments to verify the fix

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/System/Language/verb_language.cpp

src/System/Language/prog_language.cpp

Copilot

Pull request overview

Copilot reviewed 5 out of 5 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-01-22T05:50:40Z

src/System/Language/code_wrap.hpp

+  if (after >= n) return n;
+
+  int i   = 0;
+  int last= 0;


The initialization of last to 0 could lead to returning 0 when no valid split point is found before the requested after position. Consider whether this is the intended behavior. If the entire beginning of the string consists of an escape sequence and after falls within it, the function will return 0, which would place the entire string in the right output and leave left empty. This might cause issues with line breaking algorithms that expect at least some progress. Consider documenting this edge case behavior or initializing last to after to ensure forward progress in line breaking.

Suggested change

int last= 0;

// Initialize last to 'after' so we guarantee forward progress even if no

// valid atom boundary is found at or before 'after'.

int last= after;

Copilot · 2026-01-22T05:50:40Z

TeXmacs/tests/tmu/209_9.tmu

+  code模式示例：
+
+  <\cpp-code>
+    <code|<code*|z中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中>>


Line 9 contains nested code tags <code|<code*|...>> while lines 15, 21, and 27 only use <code*|...>. This inconsistency appears to be unintentional. Consider removing the outer <code| tag to match the pattern used in the other examples.

Suggested change

<code|<code*|z中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中>>

<code*|z中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中>

src/System/Language/code_wrap.hpp

[209_9] 修复代码模式的中文换行显示问题

a5941eb

notfoundzzz force-pushed the zhl/209_9/fix-code-chinese-lexing branch from cd44abf to a5941eb Compare January 21, 2026 06:48

yinyuscloor requested a review from Copilot January 21, 2026 06:55

Copilot started reviewing on behalf of yinyuscloor January 21, 2026 06:56 View session

Copilot AI reviewed Jan 21, 2026

View reviewed changes

src/System/Language/verb_language.cpp Outdated Show resolved Hide resolved

src/System/Language/prog_language.cpp Outdated Show resolved Hide resolved

提取辅助函数

679c276

notfoundzzz force-pushed the zhl/209_9/fix-code-chinese-lexing branch from 60d46a7 to 679c276 Compare January 21, 2026 08:15

yinyuscloor approved these changes Jan 21, 2026

View reviewed changes

yinyuscloor requested a review from Copilot January 22, 2026 05:44

Copilot started reviewing on behalf of yinyuscloor January 22, 2026 05:45 View session

Copilot AI reviewed Jan 22, 2026

View reviewed changes

da-liii requested a review from yinyuscloor January 26, 2026 03:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[209_9] 修复代码模式的中文换行显示问题 #2635

[209_9] 修复代码模式的中文换行显示问题 #2635

notfoundzzz commented Jan 21, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jan 22, 2026

Uh oh!

Copilot AI Jan 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

-  int last= 0;
+  // Initialize last to 'after' so we guarantee forward progress even if no
+  // valid atom boundary is found at or before 'after'.
+  int last= after;

	<code\|<code*\|z中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中>>
	<code*\|z中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中中>

[209_9] 修复代码模式的中文换行显示问题 #2635

Are you sure you want to change the base?

[209_9] 修复代码模式的中文换行显示问题 #2635

Conversation

notfoundzzz commented Jan 21, 2026

如何测试

2026/1/21

What

Why

How

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants