A sophisticated multi-agent system for generating 3D CAD models using AI agents. This project implements an iterative workflow that combines image generation, metadata analysis, and evaluation to create high-quality 3D models.
-
🤖 Multi-Agent Architecture: Uses specialized agents for generation and evaluation
-
🎨 Image Generation: Creates multi-view images using DALL-E
-
📊 Metadata Generation: Produces detailed metadata for 3D CAD reconstruction
-
🔄 Iterative Improvement: Continuously refines results based on evaluation feedback
-
⚡ Multi-API Support: Test different AI providers (OpenAI, Claude, DeepSeek, Qwen)
- Generation Agent: Creates multi-view images and metadata for 3D CAD reconstruction
- Evaluation Agent: Assesses quality and provides improvement suggestions
- Mesh Generation Agent: Converts results into 3D mesh data
- User provides a query describing the desired 3D object
- Generation agent creates multi-view images and metadata
- Evaluation agent assesses quality and provides feedback
- System iteratively improves results until quality threshold is met
- Final 3D mesh is generated
-
Clone the repository:
git clone <your-repo-url> cd openai-agents-python
-
Set up virtual environment:
python -m venv env source env/bin/activate # On Windows: env\Scripts\activate
-
Install dependencies:
pip install -r requirements.txt
-
Configure API keys: Create a
.env
file in the root directory and add your API keys:OPENAI_API_KEY=your_openai_key CLAUDE_API_KEY=your_claude_key DEEPSEEK_API_KEY=your_deepseek_key QWEN_API_KEY=your_qwen_key
Run the web application:
cd webapp
python app.py
The system will:
- Start a web server for the 3D generation interface
- Allow you to input your desired 3D object
- Generate and iteratively improve results
- Save outputs to organized directories
The system supports multiple AI providers for speed and quality testing:
- OpenAI GPT-4o: Fast and reliable
- Claude 3 Sonnet: High-quality reasoning
- DeepSeek: Cost-effective alternative
- Qwen: Alibaba's AI model
Switch between models by changing the model configuration in the web application or use the interactive selection.
project/
├── renders/ # Generated images
├── evaluation_reports_*/ # Evaluation reports per iteration
├── mesh_outputs/ # Final 3D mesh data
└── webapp/ # Web interface
The web application allows you to select different AI models through the interface. Available options include OpenAI GPT-4o, Claude 3 Sonnet, DeepSeek, and Qwen.
# In evaluation agent
if all scores > 6.5:
suggestions = "well done"
- Add API key to the configuration
- Create client with appropriate base URL
- Add to
MODEL_CONFIGS
dictionary - Test with
switch_model()
function
Edit the prompts in the web application:
- Generation agent prompts in
webapp/app.py
- Evaluation agent prompts in the application logic
- Image generation prompts in the DALL-E integration
- API Errors: Check your API keys and quotas
- Import Errors: Ensure all dependencies are installed
- Memory Issues: Reduce image resolution in generation
- Slow Performance: Try different AI providers
- Fork the repository
- Create a feature branch
- Make your changes
- Add tests if applicable
- Submit a pull request
This project is licensed under the MIT License - see the LICENSE file for details.
- Built on OpenAI's agents framework
- Uses DALL-E for image generation
- Inspired by modern multi-agent architectures