Watch The Detailed Video To Set-up This Model: https://www.youtube.com/watch?v=-YjbWjv1tJg
A real-time voice AI that can hear, see, understand, and control your Windows computer. Local execution. Zero subscriptions(Unless you want to increase request by buying requests from Google AI Studio). Built for intelligent automation.
MARK XXX is an advanced voice-driven AI assistant designed to turn your computer into an interactive intelligent system.
Speak naturally — it listens, understands context, responds with a human-like voice, and executes tasks across your system automatically.
Designed for speed, autonomy, and real-world usability.
- Real-time voice interaction — Natural conversation with instant response
- System control — Launch apps, manage files, execute commands
- Autonomous task execution — Plans and completes multi-step workflows
- Visual awareness — Screen analysis and webcam understanding
- Persistent memory — Learns preferences and remembers context
- Integrated tools — Web search, weather, reminders, messaging, code help, image generation
git clone https://github.com/FatihMakes
cd mark-xxx
python setup.py
python main.pyEnter your free Gemini API key on first launch. System ready in minutes.
If you got some problems or questions to ask or just want to support;
YouTube Account: text Instagram Account: text
- Windows 10 / 11
- Python 3.10 or newer
- Microphone
- Gemini API key
Personal and non-commercial use only. Licensed under Creative Commons BY-NC 4.0.
Engineered by a 17-year-old building a real JARVIS-style assistant. ⭐ Star the repository to support the project.