Skip to content

Commit 9c232c0

Browse files
quanruottomao
andauthored
docs(site): add v1.5 changelog (#2059)
* docs(site): add v1.5 changelog for both zh and en * docs(core): update changelog * docs(core): update changelog --------- Co-authored-by: ottomao <ottomao@gmail.com>
1 parent b080edc commit 9c232c0

File tree

2 files changed

+94
-0
lines changed

2 files changed

+94
-0
lines changed

apps/site/docs/en/changelog.mdx

Lines changed: 47 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,52 @@
11
# Changelog
22

3+
## v1.5 - HarmonyOS Support
4+
5+
v1.5 adds HarmonyOS automation support, Qwen3.5 and doubao-seed 2.0 model support, along with multiple improvements to desktop automation, report system, Chrome extension, and more.
6+
7+
### HarmonyOS Automation Support
8+
9+
New `@midscene/harmony` package officially supports HarmonyOS platform automation. Midscene's automation capabilities now extend from Web, Android, iOS, and Desktop to the HarmonyOS ecosystem.
10+
11+
### Qwen3.5 & doubao-seed 2.0 Model Support
12+
13+
Added support for Qwen3.5 and doubao-seed 2.0 models, allowing developers to leverage newer models for better visual understanding.
14+
15+
### Generic Model Reasoning Configuration
16+
17+
New `MIDSCENE_MODEL_REASONING_EFFORT` environment variable provides a generic model reasoning effort configuration, enabling developers to uniformly control reasoning behavior across different models.
18+
19+
### Desktop Automation Improvements
20+
21+
- **Xvfb virtual display support**: Support Xvfb virtual display for headless Linux environments, enabling desktop automation on CI/CD servers without GUI
22+
- **Connection health check**: Added health check during desktop automation connection for improved reliability
23+
- **macOS input optimization**: All text input on macOS now uses clipboard to avoid IME issues
24+
- **Mouse control failure detection**: Automatically detects mouse control failure and warns about admin privilege requirements
25+
- **Stop execution optimization**: Checks destroyed state to abort screenshot operations promptly when stopping execution
26+
27+
### Screenshot & Display Optimization
28+
29+
- **Custom screenshot shrink**: Support custom screenshot shrink ratio to optimize performance while maintaining recognition accuracy
30+
- **Android scalingRatio decoupling**: Decoupled scalingRatio from size() method for improved flexibility
31+
32+
### Report System Improvements
33+
34+
- **More detailed timing**: Finer-grained timing information in reports helps developers analyze performance bottlenecks more precisely
35+
- **Directory mode support for mergeReports**: `mergeReports` now supports directory mode report files
36+
37+
### Chrome Extension Improvements
38+
39+
- **Always decline option**: Chrome extension adds "always decline" option with confirm race condition fix
40+
- **Close Bridge server after CLI**: Bridge server automatically closes after CLI command completes, preventing lingering processes
41+
42+
### Bug Fixes
43+
44+
- Fixed `z.preprocess` handling in input mode schema for correct form rendering
45+
- Fixed Android swipe parameter passing
46+
- Fixed web size calculation
47+
- Fixed `BASE_URL_FIX_SCRIPT` closing tag not recognized by HTML parser
48+
- Fixed undefined page guard in PlaywrightAgent/PuppeteerAgent constructors
49+
350
## v1.4 - Skills: Let AI Assistants Control Your Devices
451

552
v1.4 introduces Midscene Skills — a set of installable skill packs for AI assistants like Claude Code and OpenClaw, enabling them to directly control browsers, desktops, Android, and iOS devices. This release also includes a standalone desktop MCP service, independent CLI entry points for each platform package, enhanced AI planning, and more.

apps/site/docs/zh/changelog.mdx

Lines changed: 47 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,52 @@
11
# 更新日志
22

3+
## v1.5 - HarmonyOS(鸿蒙)自动化支持
4+
5+
v1.5 版本新增了 HarmonyOS 自动化支持,新增 Qwen3.5 和 doubao-seed 2.0 模型支持,同时对桌面自动化、报告系统、Chrome 扩展等进行了多项改进。
6+
7+
### 新增 HarmonyOS(鸿蒙)自动化支持
8+
9+
新增 `@midscene/harmony` 包,正式支持 HarmonyOS 平台自动化。Midscene 的自动化能力从 Web、Android、iOS、桌面进一步扩展到鸿蒙生态。
10+
11+
### 新增 Qwen3.5 与 doubao-seed 2.0 模型支持
12+
13+
适配了通义千问 Qwen3.5 和豆包 doubao-seed 2.0 模型,开发者可以使用更新的模型获得更好的视觉理解效果。
14+
15+
### 新增通用模型推理配置
16+
17+
新增 `MIDSCENE_MODEL_REASONING_EFFORT` 环境变量,作为通用的模型推理强度配置参数,方便开发者在不同模型间统一控制推理行为。
18+
19+
### 桌面自动化改进
20+
21+
- **Xvfb 虚拟显示器支持**:在无头 Linux 环境下支持 Xvfb 虚拟显示器,适用于 CI/CD 服务器等无 GUI 环境的桌面自动化
22+
- **连接健康检查**:桌面自动化连接时新增健康检查,提升连接可靠性
23+
- **macOS 输入优化**:macOS 上所有文本输入改用剪贴板方式,避免输入法(IME)导致的输入异常
24+
- **鼠标控制失败检测**:自动检测鼠标控制失败并提示管理员权限需求
25+
- **停止执行优化**:在停止执行时通过检查 destroyed 状态及时中断截图操作,避免无效等待
26+
27+
### 截图与显示优化
28+
29+
- **自定义截图缩放**:支持自定义截图缩放比例(screenshot shrink),在保证识别准确性的前提下优化性能
30+
- **Android 缩放比解耦**:将 scalingRatio 从 size() 方法中解耦,提升灵活性
31+
32+
### 报告系统改进
33+
34+
- **时序信息更详细**:报告中的时序信息粒度更细,帮助开发者更精确地分析性能瓶颈
35+
- **合并报告支持目录模式**`mergeReports` 支持目录模式的报告文件
36+
37+
### Chrome 扩展改进
38+
39+
- **新增始终拒绝选项**:Chrome 扩展新增"始终拒绝"选项,并修复确认弹窗的竞态条件
40+
- **CLI 结束后关闭 Bridge 服务**:CLI 命令完成后自动关闭 Bridge 服务器,避免残留进程
41+
42+
### 问题修复
43+
44+
- 修复表单渲染中 input mode schema 的 `z.preprocess` 处理问题
45+
- 修复 Android 滑动参数传递问题
46+
- 修复 Web 端尺寸计算问题
47+
- 修复 `BASE_URL_FIX_SCRIPT` 闭合标签未被 HTML 解析器识别的问题
48+
- 修复 PlaywrightAgent/PuppeteerAgent 构造函数中 page 为 undefined 的保护处理
49+
350
## v1.4 - Skills:让 AI 助手直接操控你的设备
451

552
v1.4 版本推出了 Midscene Skills —— 一套可安装到 Claude Code、OpenClaw 等 AI 助手中的技能包,让 AI 助手直接操控浏览器、桌面、Android 和 iOS 设备。同时本版本还包含独立桌面 MCP 服务、各平台 CLI 独立入口、AI 规划增强等多项改进。

0 commit comments

Comments
 (0)