Skip to content
#

multimodal-ai

Here are 221 public repositories matching this topic...

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX backend, 400+ tok/s. Works with Claude Code.

  • Updated Feb 11, 2026
  • Python
Building-Business-Ready-Generative-AI-Systems

This GitHub repository contains the complete code for building Business-Ready Generative AI Systems (GenAISys) from scratch. It guides you through architecting and implementing advanced AI controllers, intelligent agents, and dynamic RAG frameworks. The projects demonstrate practical applications across various domains.

  • Updated Aug 9, 2025
  • Jupyter Notebook

AI-powered tool to turn long videos into short, viral-ready clips. Combines transcription, speaker diarization, scene detection & 9:16 resizing β€” perfect for creators & smart automation.

  • Updated Apr 2, 2025
  • Python

ICML 2025 Papers: Dive into cutting-edge research from the premier machine learning conference. Stay current with breakthroughs in deep learning, generative AI, optimization, reinforcement learning, and beyond. Code implementations included. ⭐ support the future of machine learning research!

  • Updated Oct 24, 2025

Open source, AI-enhanced CAT tool with multi-LLM support (OpenAI, Claude, Gemini, Ollama), innovative Superlookup concordance system offering access to multiple terminology sources (TMs, glossaries, web resources, etc.), and seamless CAT tool integration (memoQ, Trados, CafeTran, Phrase).

  • Updated Feb 11, 2026
  • Python

⚑ Production-ready .NET Standard 2.1 RAG library with πŸ€– multi-AI provider support, 🏒 enterprise vector storage, πŸ“„ intelligent document processing, and πŸ—„οΈ multi-database query coordination. 🌍 Cross-platform compatible.

  • Updated Feb 8, 2026
  • C#

This is a fully autonomous, self-operating computer automation system designed to automate tasks on Windows without any user interaction. It runs scheduled or trigger-based workflows using Python, system tools, and smart agents β€” ideal for repetitive tasks, bots, or self-executing pipelines.

  • Updated Aug 3, 2025
  • Python

Improve this page

Add a description, image, and links to the multimodal-ai topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multimodal-ai topic, visit your repo's landing page and select "manage topics."

Learn more