Skip to content

Latest commit

 

History

History
96 lines (63 loc) · 3.39 KB

File metadata and controls

96 lines (63 loc) · 3.39 KB

Android Use: Automate Android phones using AI agents

English | 中文

android-use is a tool designed for natural language-driven automation on Android devices. It enables automated operations on Android phones using natural language commands.

✨ Features

  1. Precise Positioning & Interaction: Utilizes XML parsing to highlight interactive elements and supports clicking via index, significantly enhancing the Agent's positioning capabilities.
  2. Broad Model Support: Supports domestic models like Deepseek, Kimi-k2, GLM, etc. It works effectively even without vision capabilities by leveraging XML parsing.

🚀 Getting Started

1. Prerequisites

Before using this tool, ensure USB debugging is enabled on your Android device.

  • Device Requirements: Android 7.0+.
  • Enable Developer Mode:
    • Go to Settings -> About Phone.
    • Find Build Number and tap it continuously about 10 times.
    • Wait for the "You are now a developer!" toast message.
    • Note: Steps may vary slightly by phone manufacturer. Search online if you can't find it.
  • Enable USB Debugging:
    • Go to Settings -> Developer Options.
    • Find and enable USB Debugging.

2. Install uv

Install the uv package manager from the official website.

MacOS/Linux

curl -LsSf https://astral.sh/uv/install.sh | sh

Windows

powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex"

3. Setup Environment

Running the Application

Option 1: WebUI (Recommended)

uvx android-use webui

1224.mp4

Option 2: CLI (Interactive)

uvx android-use

Local Development

If you want to contribute or develop locally:

git clone https://github.com/languse-ai/android-use
cd android-use
uv sync
# Start CLI
python -m android_use.cli
# Start WebUI
python -m android_use.app

⚠️ Disclaimer

⚠️ This project is for research and study purposes only. It is strictly prohibited to use it for illegal information acquisition, system interference, or any illegal activities. Please review the Terms of Use carefully.

📺 Demos

Here are three demos showcasing the capabilities:

1. WeChat Video Account Interaction

Task: Open WeChat, search for "Doubao Phone" in Video Accounts, get all comments from the most liked video on the current page, share this video with a friend, and send a summary of the comments.

1222-1.mp4

2. Douyin E-commerce Data Scraping

Task: Open Douyin, search for "Anta Basketball Shoes" in the shop, and retrieve information for the top 10 products.

A0F23DBA-AF73-4B2F-88B9-6FA893B17107.mov

3. Xiaohongshu Interaction

Task: Open Xiaohongshu, search for "browser-use", select the most liked post on the current page, like it, leave a concise, human-like comment, and summarize the core content of the post.

81B6C936-1E4B-4586-BDE9-AAD8A191B932.mov