CoCo - Collaborative Drawing Application

CoCo is a collaborative drawing application that enables multiple users to draw together in real-time. The application features AI-powered image enhancement using Google's Gemini model and story generation capabilities.

🎨 Features

Real-time Collaborative Drawing: Draw together with others in real-time
Hand Gesture Recognition: Control the canvas with hand gestures
AI-Powered Enhancement: Enhance your sketches with Gemini AI
Voice-Controlled AI Assistant: Talk to an AI assistant about your drawings
Storyboard Generation: Create storyboards from your drawings
Video Generation: Generate videos from your storyboards
Multi-modal Interaction: Combine drawing, voice, and text for AI interaction

🤖 Multimodal AI Assistant

The new Multimodal AI Assistant allows you to have real-time conversations with Gemini about your drawings:

Voice Commands

"Make this drawing more detailed" - Ask for enhancements
"Change the colors to blue and green" - Modify existing images
"What do you think about this drawing?" - Get feedback
"Add a background to this scene" - Request modifications

Text Chat

Type messages to ask questions about your drawings
Get suggestions for improvements
Request specific modifications

Real-time Drawing Analysis

The AI can see your drawing as you create it
Get instant feedback and suggestions
Ask for help with drawing techniques

How to Use

Click the 💬 AI Assistant button in the top-right corner
Draw something on the canvas
Talk to the AI using voice or text
Get real-time assistance and modifications

Quick Start

Option 1: Complete Setup (Recommended)

# Start all servers including multimodal AI assistant
chmod +x start-multimodal.sh
./start-multimodal.sh

# In a new terminal, start the frontend
npm install
npm run dev

Option 2: Manual Setup

# Backend setup
cd backend
chmod +x setup-all.sh
./setup-all.sh

# Start multimodal server (in new terminal)
cd multimodal
source ../backend/venv/bin/activate
python main.py

# Start backend server (in new terminal)
cd backend
source venv/bin/activate
python app.py

# Frontend setup (in new terminal)
npm install
npm run dev

Project Architecture

CoCo uses a modular architecture with separate frontend and backend components:

Frontend (React + TypeScript)
- Real-time canvas with collaboration features
- UI for storyboard management
- Image enhancement interface
Backend (Node.js + Flask)
- WebSocket server for real-time collaboration
- Flask API for AI-powered features (image enhancement, video generation)
- Combined server that runs both services simultaneously

Detailed Setup

Prerequisites

Node.js 14+ and npm
Python 3.8+
Git

Backend Setup

The backend consists of a WebSocket server for real-time collaboration and a Flask API for AI-powered features.

cd backend

# Run the setup script to install all dependencies 
./setup-all.sh

# Start both servers
npm run start-all

# For development mode with auto-restart:
npm run dev-all

# To run servers individually:
npm run dev            # WebSocket server only
npm run api            # Flask API only

Frontend Setup

cd frontend-main

# Install dependencies
npm install

# Start development server
npm run dev

Environment Configuration

Create .env files in both the backend and frontend-main directories:

backend/.env

GOOGLE_API_KEY=your_gemini_api_key
ELEVENLABS_API_KEY=your_elevenlabs_api_key

frontend-main/.env

VITE_WS_URL=ws://localhost:8080
VITE_API_URL=http://localhost:5001

Accessing the Application

Once all services are running, access the application at:

Frontend UI: http://localhost:5173
Backend API: http://localhost:5001
WebSocket: ws://localhost:8080

Network Collaboration

To collaborate with users on the same network:

Find your IP address (e.g., use ifconfig or ipconfig)

Update frontend-main/.env:

VITE_WS_URL=ws://YOUR_IP_ADDRESS:8080
VITE_API_URL=http://YOUR_IP_ADDRESS:5001

Share your IP address with collaborators, who can connect via:
```
http://YOUR_IP_ADDRESS:5173
```

AI Features

Image Enhancement

Draw a sketch on the canvas
Click "Enhance with Gemini"
Enter a prompt to guide the enhancement
The enhanced image appears on your canvas as an interactive object

Storyboard Creation

Create multiple enhanced drawings
Add them to the storyboard using the "Add to Storyboard" button
Arrange your scenes in the storyboard panel

Video Generation

Add at least 2 images to your storyboard
Click "Generate Video"
The AI will create a narrated video connecting your scenes

Development

Project Structure

CoCo/
├── frontend-main/     # React frontend
│   ├── src/           # Source code
│   ├── public/        # Static assets
│   └── package.json   # Dependencies
│
├── backend/           # Backend services
│   ├── websocket-server.js  # Real-time collaboration server
│   ├── app.py         # Flask API for AI features
│   ├── server.js      # Combined server manager
│   └── package.json   # Node.js dependencies
│
├── start.sh           # Script to start all services
└── stop.sh            # Script to stop all services

Adding New Features

Frontend changes should be made in the frontend-main/src directory
Backend changes:
- WebSocket functionality: backend/websocket-server.js
- AI processing: backend/app.py
- Server management: backend/server.js

Troubleshooting

Connection Issues

Ensure all services are running (check with ps aux | grep node and ps aux | grep python)
Verify correct URLs in frontend-main/.env
Check that ports 5001, 5173, and 8080 are not blocked by firewalls

AI Enhancement Issues

Verify your API keys are correctly set in backend/.env
Check the Flask server logs for API-related errors

Acknowledgements

Google Gemini API for image generation
ElevenLabs for text-to-speech generation
MediaPipe for hand gesture recognition

Name		Name	Last commit message	Last commit date
Latest commit History 124 Commits
__pycache__		__pycache__
backend		backend
examples		examples
public		public
src		src
.DS_Store		.DS_Store
.editorconfig		.editorconfig
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
VOICE_ENHANCEMENT_SETUP.md		VOICE_ENHANCEMENT_SETUP.md
eslint.config.js		eslint.config.js
fix_indentation.py		fix_indentation.py
force-stop.sh		force-stop.sh
index.html		index.html
manual-shutdown.sh		manual-shutdown.sh
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
start-all-servers.sh		start-all-servers.sh
stop-all-servers.sh		stop-all-servers.sh
tailwind.config.js		tailwind.config.js
test_voice_enhancement_workflow.py		test_voice_enhancement_workflow.py
tsconfig.app.json		tsconfig.app.json
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts

tahababou12/CoCo

Folders and files

Latest commit

History

Repository files navigation

CoCo - Collaborative Drawing Application

🎨 Features

🤖 Multimodal AI Assistant

Voice Commands

Text Chat

Real-time Drawing Analysis

How to Use

Quick Start

Option 1: Complete Setup (Recommended)

Option 2: Manual Setup

Project Architecture

Detailed Setup

Prerequisites

Backend Setup

Frontend Setup

Environment Configuration

backend/.env

frontend-main/.env

Accessing the Application

Network Collaboration

AI Features

Image Enhancement

Storyboard Creation

Video Generation

Development

Project Structure

Adding New Features

Troubleshooting

Connection Issues

AI Enhancement Issues

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages