🖼️ Doc2Image

Turn any document into a gallery of AI‑ready image ideas. Upload any file (PDF, DOCX, TXT, and more) and Doc2Image will read it, summarize the content, and generate a list of unique visual concepts you can take to the image generator of your choice. Perfect for blog posts, presentations, decks, social posts—or just sparking your imagination.

Why You’ll Love It

Intuitive Interface: A clean, guided experience from start to finish.
Flexible AI: Use OpenAI models or go local via Ollama—your call.
Idea History: Never lose a great idea with the built-in idea gallery.
Customizable: Fine-tune the AI's creativity to get the perfect results.
Budget-friendly: works great with small models (e.g., gpt-4.1-nano, deepseek-r1:1.5b), so it’s really cheap to run.

How It Works (3 Quick Steps)

Configure your models (one‑time): Add API keys and pick your providers in Settings.
Generate amazing image ideas: Upload a document and let the app craft tailored visual prompts.
Browse your Idea Gallery: Revisit past sessions and reuse your favorite prompts.

Disclaimer: Doc2Image does not generate images. It generates image ideas (prompts) you can paste into any AI image generator (e.g., Grok, ChatGPT, WhatsApp, etc.).

Demo

doc2image-demo.mov

Getting Started

You can run doc2image in two ways depending on your needs:

Basic setup — the simplest, uses only OpenAI models
Advanced setup — supports both OpenAI and local models via Ollama

💡 Pre-requisite: Make sure you have Docker installed on your system.

Basic Setup

Open your terminal or command line.
Pull the latest image from Docker Hub:

docker pull dylantinten/doc2image:latest

Run the application:

docker run --name doc2image -p 8000:8000 -v data:/app/data dylantinten/doc2image:latest

💡 This command will create a data folder in your current working directory to store output files (do not delete this folder). You can change data to any path you prefer, or cd into the directory where you want your data to live before running the command.

Open your browser and visit: http://localhost:8000. You're ready to go!

To stop the application:

docker stop doc2image

To start it again:

docker start doc2image

Advanced Setup

This setup runs both the doc2image app and an Ollama server locally using Docker Compose. You’ll be able to generate prompts using both OpenAI and open-source models like LLaMA or Gemma.

Download the docker-compose.yaml.
Open your terminal or command line and navigate where docker-compose.yaml is located.

💡 This command will create a data folder in your current working directory to store output files (do not delete this folder).

Build and launch the services:

docker compose up --build

Open your browser and go to: http://localhost:8000 to start using the app.

To stop the services:

docker compose down

To restart them later:

docker compose up

❤️ Contributing

We’d love your help to make Doc2Image even better!

Whether it’s reporting bugs, suggesting new features, or submitting a pull request — all contributions are welcome.

If you enjoy using this project, please consider giving it a star ⭐️ — it helps others discover it too!

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
assets		assets
doc2image		doc2image
.gitignore		.gitignore
Dockerfile		Dockerfile
Dockerfile.dev		Dockerfile.dev
LICENSE		LICENSE
README.md		README.md
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🖼️ Doc2Image

Why You’ll Love It

How It Works (3 Quick Steps)

Demo

Getting Started

Basic Setup

Advanced Setup

❤️ Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

dylannalex/doc2image

Folders and files

Latest commit

History

Repository files navigation

🖼️ Doc2Image

Why You’ll Love It

How It Works (3 Quick Steps)

Demo

Getting Started

Basic Setup

Advanced Setup

❤️ Contributing

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages