DEV.OS is an intelligent AI-powered Operating System assistant that revolutionizes how developers interact with their computer. It bridges the gap between natural human intent and system execution through:
- Conversational AI Interface - Talk to your computer naturally
- Intelligent Automation - Multi-step task orchestration
- System Integration - Deep OS-level control
- Developer Tools Integration - GitHub, code editors, terminals
- Real-time Monitoring - Live system metrics and activity feed
- 🔄 Constant context switching between IDE, browser, terminals, file explorer
- ⏱️ Manual repetitive tasks (opening apps, managing files, system settings)
- 🗣️ No natural language interface for complex OS operations
- 📊 Fragmented view of system state and activity
- 🔗 Disconnected developer workflows across tools
✅ Single unified interface for all tasks
✅ Automated multi-step workflows
✅ Voice and natural language commands
✅ Real-time monitoring dashboard
✅ Seamless GitHub and IDE integration
- Project overview and features
- Architecture diagrams
- Tech stack details
- Quick installation guide
- API structure overview
- Problem statement
- Complete environment setup
- Backend configuration
- Frontend configuration
- OS automation setup
- Database configuration (MongoDB)
- OAuth integration setup
- Environment variables reference
- Troubleshooting guide
- Complete API endpoint documentation
- Request/response examples
- All 10+ endpoint categories
- Authentication flows
- WebSocket real-time events
- Error handling
- Rate limiting information
- Docker containerization
- Cloud deployment options (Railway, Vercel, AWS)
- CI/CD pipeline setup
- Monitoring and logging
- Security hardening
- Rollback procedures
- Health check endpoints
- 5-minute quick start
- Prerequisites
- Step-by-step setup
- Feature overview
- Common commands
- Detailed architecture diagrams
- Data flow explanations
- Component interactions
- Real-time communication
- Security considerations
┌─────────────────────────────────────────────────────────────┐
│ USER BROWSER │
│ ┌──────────────┐ ┌──────────────┐ ┌──────────────────┐ │
│ │ Command Chat │ │ Activity Feed│ │ System Dashboard │ │
│ └──────┬───────┘ └──────┬───────┘ └────────┬─────────┘ │
└─────────┼──────────────────┼──────────────────┼─────────────┘
│ │ │
└──────────────────┼──────────────────┘
HTTP/WebSocket
│
┌───────────────────┴────────────────────┐
│ │
┌────────▼─────────────────────────┐ ┌────────▼──────────┐
│ EXPRESS.JS BACKEND (Port 3001) │ │ MONGODB ATLAS │
│ • Authentication (JWT) │ │ • User data │
│ • Command Processing │ │ • Commands │
│ • WebSocket (Real-time) │ │ • Sessions │
│ • GitHub Integration │ │ • Audit logs │
│ • AI Orchestration │ └───────────────────┘
└────────┬──────────────────────────┘
│
├─────────────────────────────────────┐
│ │
┌────────▼────────────┐ ┌───────────▼─────────┐
│ GEMINI AI BRAIN │ │ OS AUTOMATION │
│ • NLP Understanding │ │ (Python FastAPI) │
│ • Action Routing │ │ • App Launch/Close │
│ • Intent Detection │ │ • File Operations │
│ • Response Gen │ │ • System Control │
└─────────────────────┘ │ • Permissions │
└─────────────────────┘
| Layer | Technology | Purpose |
|---|---|---|
| Frontend | Next.js 14, React, Tailwind, Framer Motion | Futuristic UI |
| Backend | Express.js, Node.js, Socket.io, Mongoose | API & Real-time |
| AI | Google Gemini Flash API | Natural language |
| Database | MongoDB Atlas | Data persistence |
| OS Layer | Python, FastAPI, PyAutoGUI, pywin32 | System execution |
| Auth | JWT, Google OAuth, GitHub OAuth | Security |
User Input → NLP Understanding → Action Routing → Multi-Step Execution
- Natural language command processing with Google Gemini
- Context-aware action routing
- Multi-step workflow orchestration
- Error recovery and fallback handling
-
Launch/close applications
-
Control volume, brightness, power
-
File creation, deletion, search, copy
-
Directory navigation
-
Application monitoring
-
Repository management
- System metrics dashboard
- Speech-to-text processing
- Voice activity detection
- Natural language voice commands
- Text-to-speech responses
- Google OAuth 2.0 integration
- GitHub OAuth authentication
- JWT token-based sessions
- Permission-based access control
- Audit logging
GET /api/auth/google- Google OAuthGET /api/auth/google/callback- Google callbackGET /api/auth/github- GitHub OAuthPOST /api/auth/refresh- Refresh token
POST /api/command/execute- Execute commandGET /api/command/history- Get historyGET /api/command/{id}- Get details
GET /api/system/status- System metricsGET /api/system/apps- Running appsPOST /api/system/apps/launch- Launch appPOST /api/system/apps/close- Close appPOST /api/system/audio/volume- Set volumePOST /api/system/display/brightness- Set brightness
POST /api/files/create- Create fileGET /api/files/read- Read filePUT /api/files/update- Update fileDELETE /api/files/delete- Delete filePOST /api/files/copy- Copy fileGET /api/files/search- Search filesGET /api/files/list- List directory
GET /api/github/repos- Get repositoriesPOST /api/github/repos/create- Create repoPOST /api/github/repos/push- Push codeGET /api/github/repos/{owner}/{repo}/content/{path}- Get content
POST /api/ai/process- Process NL commandGET /api/ai/suggestions- Get suggestions
GET /api/user/profile- Get profilePUT /api/user/profile- Update profile
{
"framework": "Next.js 14",
"ui": "React 18",
"styling": "Tailwind CSS",
"animation": "Framer Motion",
"realtime": "Socket.io Client",
"state": "React Hooks",
"testing": "Jest, React Testing Library"
}{
"runtime": "Node.js 18+",
"framework": "Express.js 4",
"database": "MongoDB Atlas",
"orm": "Mongoose",
"auth": "JWT, Passport.js",
"realtime": "Socket.io 4",
"validation": "Joi",
"testing": "Jest",
"logging": "Winston"
}{
"language": "Python 3.10+",
"web": "FastAPI",
"system": "PyAutoGUI, pywin32",
"audio": "sounddevice, soundfile",
"async": "asyncio",
"testing": "pytest"
}# 1. Backend (5 min)
cd apps/dev-auth-backend
npm install
# Add .env file
npm run dev
# 2. Frontend (5 min) - New Terminal
cd apps/dev-frontend-ui
npm install
npm run dev
# 3. OS Automation (5 min) - New Terminal
cd apps/dev-os-automation
python -m venv venv
source venv/bin/activate # or venv\Scripts\activate on Windows
pip install -r requirements.txt
python src/server.py
# Access: http://localhost:3000Dev-AI-OS-assistant/
├── apps/
│ ├── dev-frontend-ui/ # Next.js Frontend
│ │ ├── src/
│ │ │ ├── components/ # React components
│ │ │ ├── pages/ # Next.js pages
│ │ │ ├── styles/ # Tailwind CSS
│ │ │ └── lib/ # Utilities
│ │ └── package.json
│ │
│ ├── dev-auth-backend/ # Express Backend
│ │ ├── src/
│ │ │ ├── models/ # MongoDB schemas
│ │ │ ├── modules/ # Auth, User
│ │ │ ├── services/ # GitHub, AI
│ │ │ ├── config/ # Database
│ │ │ └── main.ts # Entry point
│ │ └── package.json
│ │
│ ├── dev-os-automation/ # Python OS Control
│ │ ├── src/
│ │ │ ├── server.py # FastAPI server
│ │ │ ├── app_controller.py
│ │ │ ├── file_controller.py
│ │ │ ├── guard_agent.py
│ │ │ └── models.py
│ │ └── requirements.txt
│ │
│ ├── dev-voice-system/ # Voice Processing
│ ├── dev-assistant-core/ # Shared Logic
│ └── dev-llm/ # LLM Integration
│
├── env/ # Python Virtual Environment
├── docs/ # Documentation
├── README.md # Main documentation
├── CONFIGURATION.md # Setup guide
├── API.md # API reference
├── DEPLOYMENT.md # Production guide
├── ARCHITECTURE.md # System design
├── QUICK_START.md # Quick start
└── package.json # Monorepo config
- Google OAuth 2.0
- GitHub OAuth
- JWT token-based sessions
- Refresh token rotation
- Role-based access control
- Permission-based system operations
- Scope validation
- Encrypted sensitive data
- HTTPS/TLS in production
- Environment-based configuration
- Input validation & sanitization
- Command execution audit logs
- User activity tracking
- Error logging with Sentry
- System event logging
- API Response Time: < 200ms
- Frontend Load Time: < 3s
- WebSocket Latency: < 100ms
- Command Execution: 100-500ms
- Horizontal scaling via Docker
- Database indexing for speed
- Connection pooling
- Caching strategies
- Local machine with npm/pip
- Docker containers
- Docker Compose for orchestration
- Cloud platforms (Railway, Heroku, AWS)
- VPS (AWS EC2, DigitalOcean, Linode)
- Kubernetes for enterprise
- Vercel for frontend
- MongoDB Atlas for database
- Development: Free
- Small production: $5-20/month
- Enterprise: $50+/month
Each documentation file covers:
README.md
├── Problem statement
├── Features overview
├── Architecture diagram
├── Quick start
├── API structures
└── Links to other docs
CONFIGURATION.md
├── System requirements
├── Installation steps
├── Environment variables
├── Database setup
├── OAuth setup
├── Troubleshooting
└── Security best practices
API.md
├── Authentication flows
├── All endpoints with examples
├── Request/response formats
├── WebSocket events
├── Error handling
└── Rate limiting
DEPLOYMENT.md
├── Docker setup
├── Cloud deployment
├── CI/CD pipelines
├── Monitoring
├── Security hardening
└── Rollback procedures
ARCHITECTURE.md
├── System design
├── Data flows
├── Component interactions
├── Technology choices
└── Scalability considerations
QUICK_START.md
├── Prerequisites
├── 5-minute setup
├── Running all services
├── Accessing dashboard
└── Common commands
- Read QUICK_START.md - Get running in 5 minutes
- Review CONFIGURATION.md - Understand setup requirements
- Check API.md - Explore available endpoints
- Study ARCHITECTURE.md - Learn system design
- See DEPLOYMENT.md - Plan production release
We welcome contributions! Areas to contribute:
- Additional system integrations
- More AI prompt optimizations
- Mobile app development
- Cloud provider support
- Performance optimizations
- Documentation improvements
See CONTRIBUTING.md for guidelines.
- GitHub Issues: Report bugs and request features
- GitHub Discussions: Ask questions and discuss ideas
- Email: support@dev-os.dev
- Documentation: Check relevant .md files
MIT License - See LICENSE file for details
✅ Full-stack microservices architecture
✅ Real-time WebSocket integration
✅ Google Gemini AI integration
✅ OAuth authentication (Google & GitHub)
✅ Cross-platform OS control
✅ Voice command support
✅ MongoDB cloud database
✅ Professional documentation
✅ Docker containerization
✅ CI/CD pipeline ready
- Total Repositories: 6 main apps
- Languages: TypeScript, JavaScript, Python
- Frontend: Next.js with React
- Backend: Express.js with MongoDB
- Lines of Code: 10,000+
- Documentation Pages: 8+
- API Endpoints: 30+
- Contributors Ready: Yes
Version: 1.0.0
Last Updated: December 27, 2025
Status: Active Development
Maintained By: DEV.OS Team
For detailed information, refer to:
- 📖 README.md - Main documentation
- 🔧 CONFIGURATION.md - Setup guide
- 📡 API.md - API reference
- 🚀 DEPLOYMENT.md - Production guide
- 🏗️ ARCHITECTURE.md - System design
- ⚡ QUICK_START.md - Quick start guide


