VidXiv - ArXiv Paper to Video Generator

🎥 Convert ArXiv research papers into engaging video presentations automatically!

Fig. VidXiv Streamlit UI

Features

📄 Fetches papers directly from ArXiv using paper ID
🤖 Uses AI (Gemini) to generate video scripts from paper content
🎬 Creates multi-scene videos with text overlays
🔊 Generates narration using text-to-speech
📱 Supports both landscape (YouTube) and portrait (Shorts/Reels) formats
🎵 Optional background music support

Upcoming Features

🚀 Coming Soon:

✏️ Script Editing Interface - Review and modify AI-generated scripts before video creation
🖼️ Automatic Figure Integration - Smart extraction and placement of paper figures, charts, and diagrams
🎨 Manual Graphics Upload - Add custom images, logos, and visual elements to enhance presentations
🎭 Multiple AI Voices - Choose from different TTS voices and speaking styles
📊 Advanced Templates - Pre-designed video templates for different research fields
🔄 Batch Processing - Generate videos for multiple papers simultaneously
🌐 Multi-language Support - Generate videos in different languages

Installation

uv sync

Setup

Copy the environment template:

cp .env.template .env

Edit .env and add your API keys if needed (for Gemini or other LLM models)

Usage

Start the Streamlit app:

streamlit run main.py

Open your browser to the displayed URL (usually http://localhost:8501)
Enter an ArXiv paper ID (e.g., 2401.06015)
Choose video format:
- Uncheck for landscape YouTube format (16:9)
- Check for portrait Shorts/Reels format (9:16)
Optionally upload background music (MP3 format)
Click "Generate Video" and wait for processing
Download your generated video!

Requirements

Python 3.11+
Internet connection (for fetching papers and AI processing)
Sufficient disk space for temporary video files

Dependencies

arxiv - Fetching papers from ArXiv
pymupdf - PDF processing and figure extraction
gtts - Text-to-speech for narration
moviepy - Video editing and composition
streamlit - Web interface
langchain - LLM integration
requests - HTTP requests
pillow - Image processing
python-dotenv - Environment variable management

Troubleshooting

Common Issues

Import errors: Make sure all dependencies are installed correctly
MoviePy errors: Try installing with pip install moviepy[optional]
Font errors: Install system fonts or use the fallback font options
Memory issues: Try with shorter papers or reduce video quality

Error Messages

"Could not add background music": The background music file may be corrupted or in an unsupported format
"Error generating video": Check that all dependencies are properly installed and try again

Contributing

Feel free to submit issues and enhancement requests!

License

MIT License - see LICENSE file for details

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
static		static
.env.template		.env.template
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

VidXiv - ArXiv Paper to Video Generator

Features

Upcoming Features

Installation

Setup

Usage

Requirements

Dependencies

Troubleshooting

Common Issues

Error Messages

Contributing

License

About

Uh oh!

Languages

License

gauravfs-14/vidxiv

Folders and files

Latest commit

History

Repository files navigation

VidXiv - ArXiv Paper to Video Generator

Features

Upcoming Features

Installation

Setup

Usage

Requirements

Dependencies

Troubleshooting

Common Issues

Error Messages

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages