3D Generation Multi-Agent System

A sophisticated multi-agent system for generating 3D CAD models using AI agents. This project implements an iterative workflow that combines image generation, metadata analysis, and evaluation to create high-quality 3D models.

Features

🤖 Multi-Agent Architecture: Uses specialized agents for generation and evaluation
🎨 Image Generation: Creates multi-view images using DALL-E
📊 Metadata Generation: Produces detailed metadata for 3D CAD reconstruction
🔄 Iterative Improvement: Continuously refines results based on evaluation feedback
⚡ Multi-API Support: Test different AI providers (OpenAI, Claude, DeepSeek, Qwen)

Architecture

Agents

Generation Agent: Creates multi-view images and metadata for 3D CAD reconstruction
Evaluation Agent: Assesses quality and provides improvement suggestions
Mesh Generation Agent: Converts results into 3D mesh data

Workflow

User provides a query describing the desired 3D object
Generation agent creates multi-view images and metadata
Evaluation agent assesses quality and provides feedback
System iteratively improves results until quality threshold is met
Final 3D mesh is generated

Installation

Clone the repository:

git clone <your-repo-url>
cd openai-agents-python

Set up virtual environment:

python -m venv env
source env/bin/activate  # On Windows: env\Scripts\activate

Install dependencies:
```
pip install -r requirements.txt
```

Configure API keys: Create a .env file in the root directory and add your API keys:

OPENAI_API_KEY=your_openai_key
CLAUDE_API_KEY=your_claude_key
DEEPSEEK_API_KEY=your_deepseek_key
QWEN_API_KEY=your_qwen_key

Usage

Command Line Interface

Run the web application:

cd webapp
python app.py

The system will:

Start a web server for the 3D generation interface
Allow you to input your desired 3D object
Generate and iteratively improve results
Save outputs to organized directories

API Testing

The system supports multiple AI providers for speed and quality testing:

OpenAI GPT-4o: Fast and reliable
Claude 3 Sonnet: High-quality reasoning
DeepSeek: Cost-effective alternative
Qwen: Alibaba's AI model

Switch between models by changing the model configuration in the web application or use the interactive selection.

Output Structure

project/
├── renders/                    # Generated images
├── evaluation_reports_*/       # Evaluation reports per iteration
├── mesh_outputs/              # Final 3D mesh data
└── webapp/                    # Web interface

Configuration

Model Selection

The web application allows you to select different AI models through the interface. Available options include OpenAI GPT-4o, Claude 3 Sonnet, DeepSeek, and Qwen.

Quality Thresholds

# In evaluation agent
if all scores > 6.5:
    suggestions = "well done"

Development

Adding New AI Providers

Add API key to the configuration
Create client with appropriate base URL
Add to MODEL_CONFIGS dictionary
Test with switch_model() function

Customizing Prompts

Edit the prompts in the web application:

Generation agent prompts in webapp/app.py
Evaluation agent prompts in the application logic
Image generation prompts in the DALL-E integration

Troubleshooting

API Errors: Check your API keys and quotas
Import Errors: Ensure all dependencies are installed
Memory Issues: Reduce image resolution in generation
Slow Performance: Try different AI providers

Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests if applicable
Submit a pull request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

Built on OpenAI's agents framework
Uses DALL-E for image generation
Inspired by modern multi-agent architectures

Name		Name	Last commit message	Last commit date
Latest commit History 649 Commits
api		api
generated_images		generated_images
src/agents		src/agents
webapp		webapp
.gitignore		.gitignore
ASYNC_3D_GENERATION.md		ASYNC_3D_GENERATION.md
CREDIT_WALLET_GUIDE.md		CREDIT_WALLET_GUIDE.md
IMAGE_SIZE_CONFIGURATION.md		IMAGE_SIZE_CONFIGURATION.md
LICENSE		LICENSE
MULTIVIEW_UPLOAD_GUIDE.md		MULTIVIEW_UPLOAD_GUIDE.md
README.md		README.md
README_GEMINI_TEST.md		README_GEMINI_TEST.md
README_GLB_CLEANUP.md		README_GLB_CLEANUP.md
STRIPE_LINK_ALTERNATIVE.md		STRIPE_LINK_ALTERNATIVE.md
check_job_status.py		check_job_status.py
database_schema_update.sql		database_schema_update.sql
debug_glb_structure.py		debug_glb_structure.py
example_cleanup_demo.py		example_cleanup_demo.py
gemini_image_edit_demo.py		gemini_image_edit_demo.py
glb_cleanup.py		glb_cleanup.py
requirements.txt		requirements.txt
requirements_gemini_test.txt		requirements_gemini_test.txt
setup_env.py		setup_env.py
setup_gemini_test.py		setup_gemini_test.py
studio_module.py		studio_module.py
test_crop_views.py		test_crop_views.py
test_db_connection.py		test_db_connection.py
test_gemini_image_edit.py		test_gemini_image_edit.py
test_glb_cleanup.py		test_glb_cleanup.py
test_meshy_multiview_to_3d.py		test_meshy_multiview_to_3d.py
test_multiview_upload.py		test_multiview_upload.py
test_single_image_upload.py		test_single_image_upload.py
test_supabase_connection.py		test_supabase_connection.py
test_supabase_studio.py		test_supabase_studio.py
test_tencent_ai3d.py		test_tencent_ai3d.py
test_tripo_multiview_to_3d.py		test_tripo_multiview_to_3d.py
test_tripo_single_image_to_3d.py		test_tripo_single_image_to_3d.py
test_webapp_glb_cleanup.py		test_webapp_glb_cleanup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

3D Generation Multi-Agent System

Features

Architecture

Agents

Workflow

Installation

Usage

Command Line Interface

API Testing

Output Structure

Configuration

Model Selection

Quality Thresholds

Development

Adding New AI Providers

Customizing Prompts

Troubleshooting

Contributing

License

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

License

interstellar-ai-lab/3dprint

Folders and files

Latest commit

History

Repository files navigation

3D Generation Multi-Agent System

Features

Architecture

Agents

Workflow

Installation

Usage

Command Line Interface

API Testing

Output Structure

Configuration

Model Selection

Quality Thresholds

Development

Adding New AI Providers

Customizing Prompts

Troubleshooting

Contributing

License

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages