|
1 | 1 | # Changelog |
2 | 2 |
|
| 3 | +## v1.5 - HarmonyOS Support |
| 4 | + |
| 5 | +v1.5 adds HarmonyOS automation support, Qwen3.5 and doubao-seed 2.0 model support, along with multiple improvements to desktop automation, report system, Chrome extension, and more. |
| 6 | + |
| 7 | +### HarmonyOS Automation Support |
| 8 | + |
| 9 | +New `@midscene/harmony` package officially supports HarmonyOS platform automation. Midscene's automation capabilities now extend from Web, Android, iOS, and Desktop to the HarmonyOS ecosystem. |
| 10 | + |
| 11 | +### Qwen3.5 & doubao-seed 2.0 Model Support |
| 12 | + |
| 13 | +Added support for Qwen3.5 and doubao-seed 2.0 models, allowing developers to leverage newer models for better visual understanding. |
| 14 | + |
| 15 | +### Generic Model Reasoning Configuration |
| 16 | + |
| 17 | +New `MIDSCENE_MODEL_REASONING_EFFORT` environment variable provides a generic model reasoning effort configuration, enabling developers to uniformly control reasoning behavior across different models. |
| 18 | + |
| 19 | +### Desktop Automation Improvements |
| 20 | + |
| 21 | +- **Xvfb virtual display support**: Support Xvfb virtual display for headless Linux environments, enabling desktop automation on CI/CD servers without GUI |
| 22 | +- **Connection health check**: Added health check during desktop automation connection for improved reliability |
| 23 | +- **macOS input optimization**: All text input on macOS now uses clipboard to avoid IME issues |
| 24 | +- **Mouse control failure detection**: Automatically detects mouse control failure and warns about admin privilege requirements |
| 25 | +- **Stop execution optimization**: Checks destroyed state to abort screenshot operations promptly when stopping execution |
| 26 | + |
| 27 | +### Screenshot & Display Optimization |
| 28 | + |
| 29 | +- **Custom screenshot shrink**: Support custom screenshot shrink ratio to optimize performance while maintaining recognition accuracy |
| 30 | +- **Android scalingRatio decoupling**: Decoupled scalingRatio from size() method for improved flexibility |
| 31 | + |
| 32 | +### Report System Improvements |
| 33 | + |
| 34 | +- **More detailed timing**: Finer-grained timing information in reports helps developers analyze performance bottlenecks more precisely |
| 35 | +- **Directory mode support for mergeReports**: `mergeReports` now supports directory mode report files |
| 36 | + |
| 37 | +### Chrome Extension Improvements |
| 38 | + |
| 39 | +- **Always decline option**: Chrome extension adds "always decline" option with confirm race condition fix |
| 40 | +- **Close Bridge server after CLI**: Bridge server automatically closes after CLI command completes, preventing lingering processes |
| 41 | + |
| 42 | +### Bug Fixes |
| 43 | + |
| 44 | +- Fixed `z.preprocess` handling in input mode schema for correct form rendering |
| 45 | +- Fixed Android swipe parameter passing |
| 46 | +- Fixed web size calculation |
| 47 | +- Fixed `BASE_URL_FIX_SCRIPT` closing tag not recognized by HTML parser |
| 48 | +- Fixed undefined page guard in PlaywrightAgent/PuppeteerAgent constructors |
| 49 | + |
3 | 50 | ## v1.4 - Skills: Let AI Assistants Control Your Devices |
4 | 51 |
|
5 | 52 | v1.4 introduces Midscene Skills — a set of installable skill packs for AI assistants like Claude Code and OpenClaw, enabling them to directly control browsers, desktops, Android, and iOS devices. This release also includes a standalone desktop MCP service, independent CLI entry points for each platform package, enhanced AI planning, and more. |
|
0 commit comments