English | 中文
This repo collects awesome AI tools. Welcome everyone to recommend more awesome AI tools together! Please use the following template as a reference for your recommendations. issue
- All Categories
- ChatGPT and other AI chatbot
- AI Agent
- Open Source LLMs
- LLM Leaderboard
- GPT LLMs Applications
- AI Coding
- AI Image Creation
- Video Creation
- AI Cloud Platform
- LLM Prompts
- LLM training platform
- Writing
- Translation
- Speech Recognition
- Text To Speech
- Music Recognition
- Voice Processing
- AI generated music or sound effects
- Speech translation
- Video Content Summary
- Academic research
- OCR
- AI Detection
| Name | Description | Links | Fees |
|---|---|---|---|
| Gemini | Google's AI chatbot, including Gemini-3.1 pro. Excels in multimodality, high-fidelity image generation/analysis, and deep integration with the Google ecosystem. Best for: Image processing and web information integration, Deep Research feature performs exceptionally well, seamlessly integrates with Google Drive. ai.google.dev | URL |
Free/Paid |
| ChatGPT | OpenAI's AI chatbot, including GPT-5.2. Best for general purpose, coding, and creative writing. Great for most users. Memory function is currently the best - it remembers what you've said and picks up right where you left off in the next conversation, making it feel most like talking to a real person. | URL | Free/Paid |
| Claude | Anthropic's AI chatbot, including Claude Opus 4.6. Best for coding, long context, safety, and enterprise use. Cowork functionality transforms AI into a true "agent" rather than just a chatbot - can pull financial data, build Excel forecasting models, etc., with high efficiency. | URL | Free/Paid |
| DeepSeek | DeepSeek's AI chatbot. Cost-effective option. API | URL | Free/Paid |
| Grok | xAI's AI chatbot, including grok-4.1-thinking. Best for real-time internet access and Elon Musk's AI vision. Real-time data and news is its moat - can directly access posts on X as information sources, a differentiation that's hard to replicate. x.com/grok | URL | Free/Paid |
| qwen | Alibaba's AI chatbot. Includes Qwen3, Qwen3-Code and other Qwen LLMs. | URL | Free |
| Dola | Bytedance's AI chatbot. Intuitive interface and good general capabilities. | URL | Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| OpenClaw | Open-source self-hosted AI agent that runs locally and autonomously executes tasks. Connects to WhatsApp, Telegram, Slack, Discord and other messaging platforms, with browser control, system access, and persistent memory. Developed by Peter Steinberger, gained over 180K GitHub stars, one of the fastest-growing open-source projects | Github |
Free |
| Manus | Manus is the action engine that goes beyond answers to execute tasks, automate workflows, and extend your human reach | URL | Free Trial/Paid |
| AnyGen | AnyGen is the AI assistant that truly "gets work done" for you. From writing and analysis to planning and reporting, it transforms your ideas into ready-to-use professional deliverables in minutes. The AI Assistant Built for Work | URL | Free Trial/Paid |
| Gemini CLI | An open-source AI agent that brings the power of Gemini directly into your terminal. | Github |
Free |
| agentscope | Agent-Oriented Programming for Building LLM Applications, Open-sourced by Alibaba | Github |
Free |
| Auto-GPT | Open source, An experimental open-source attempt to make GPT-4 fully autonomous. | GitHub |
Free |
| microsoft/autogen | AutoGen is an open-source programming framework for building AI agents and facilitating cooperation among multiple agents to solve tasks. | Github |
Free |
| potpie-ai/potpie | Open Source AI Agents for your codebase in minutes. Use pre-built agents for Q&A, Testing, Debugging and System Design or create your own purpose-built agents. | URL , Github |
Free Trial |
| MastraAI | Mastra is an opinionated TypeScript framework that helps you build AI applications and features quickly. It gives you the set of primitives you need: workflows, agents, RAG, integrations and evals | Github |
Free |
| Taskade | AI-native workspace platform. Build apps from prompts, deploy autonomous AI agents with memory and knowledge bases, automate workflows with 100+ integrations, and collaborate in real-time. Supports MCP (Model Context Protocol). Cross-platform: Web, Desktop, Mobile, and Browser Extensions. | URL , Github |
Free/Paid |
| Name | Description | Links | Fees |
|---|---|---|---|
| DeepSeek-R1 | DeepSeek's first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1. DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning (RL) without supervised fine-tuning (SFT) as a preliminary step, demonstrated remarkable performance on reasoning. | Github |
Free |
| DeepSeek-V3 | A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token. | Github |
Free |
| Qwen3 | Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud. | Github |
Free |
| Llama 3 | Llama3 is a large language model developed by Meta AI. It is the successor to Meta's Llama2 language model. Online test address: huggingface.co/Meta-Llama-3-70B-Instruct |
GitHub |
Free |
| Mixtral | Mixtral 8x7B, a high-quality sparse mixture of experts model (SMoE) with open weights. Mixtral outperforms Llama 2 70B on most benchmarks with 6x faster inference. It matches or outperforms GPT3.5 on most standard benchmarks. paper:https://arxiv.org/pdf/2401.04088.pdf news:https://mistral.ai/news/mixtral-of-experts/ |
mistral-inference mistral-finetune |
Free |
| grok-1 | A large language model open sourced by xAI | Github |
Free |
| Phi-3 | Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks. | Github |
Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| LMSYS Chatbot Arena Leaderboard | LMSYS Chatbot Arena is a crowdsourced open platform for LLM evals. Collected over 1,000,000 human pairwise comparisons to rank LLMs with the Bradley-Terry model and display the model ratings in Elo-scale. | URL | Free |
| Artificial Analysis | Artificial Analysis is a platform that provides AI model and service provider comparisons and benchmarks to help users make informed decisions when choosing AI models and service providers. The platform provides comparative data on a wide range of popular AI models, including OpenAI's GPT-4, Meta's Llama 3, and Anthropic's Claude series, covering performance metrics such as response time, latency, and cost. | URL | Free |
| LiveCodeBench | LiveCodeBench is a holistic and contamination-free evaluation benchmark of LLMs for code that continuously collects new problems over time. Particularly, LiveCodeBench also focuses on broader code-related capabilities, such as self-repair, code execution, and test output prediction, beyond mere code generation. | URL | Free |
| LLM Stats | LLM Stats, the most comprehensive LLM leaderboard, benchmarks and compares API models using daily‑updated, open‑source community data on capability, price, speed, and context length. | URL | Free |
| Price Per Token | Compare LLM API pricing across 200+ models from OpenAI, Anthropic, Google, and more. Includes token counters, cost calculators, and benchmark comparisons. | URL | Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| Google AI Studio | Google AI Studio is a free, web-based developer tool that enables you to quickly develop prompts and then get an API key to use in your app development. Available regions | URL | Free |
| NotebookLM | AI Research Assistant developed by Google. Upload PDFs, websites, YouTube videos, audio files, Google Docs, or Google Slides, and NotebookLM will summarize them and make interesting connections between topics. Audio Overview feature can turn your sources into engaging “Deep Dive” discussions with one click. | URL | Free |
| Poe | AI product built by Quora. Can use ChatGPT, Sage, Dragonfly, Claude bots for free. All you need is an email address to register. GPT-4 can be used once a day for free | URL | Free/Paid |
| Cherry Studio | Cherry Studio is a desktop client that supports for multiple LLM providers, available on Windows, Mac and Linux. Support major LLM Cloud Services: OpenAI, Gemini, Anthropic, and more AI Web Service Integration: Claude, Peplexity, Poe, and others Local Model Support with Ollama, LM Studio | Github |
Free |
| HuggingChat | Open source codebase powering the HuggingChat app. URL | Github |
Free |
| Learn about | AI learning Assistant developed by Google.Grasp new topics and deepen your understanding with a conversational learning companion that adapts to your unique curiosity and learning goals. | URL | Free |
| monica | AI assistant that provides help with a variety of tasks such as searching, reading, writing, translating, drawing, and more. Standalone apps and browser plug-ins available | URL chrome extension |
Free/Paid |
| ollama | Get up and running with Llama 2, Mistral, Gemma, and other large language models. | Github |
Free |
| openai/openai-python | The official Python library for the OpenAI API, It is generated from OpenAPI specification with Stainless | Github |
Free, need OpenAPI apikey |
| sashabaranov/go-openai | This library provides unofficial Go clients for OpenAI API. support: ChatGPT, GPT-3, GPT-4, DALL·E 2 | Github |
Free |
| langchain | LangChain is a framework for developing applications powered by language models. | Github |
Free |
| Helicone AI | Helicone is the open-source LLM observability platform for logging, monitoring, and debugging AI applications. | Github |
Free |
| WFGY ProblemMap | Open-source RAG failure-mode checklist and diagnostics toolkit for LLM pipelines (data, embeddings, retrievers, tools, evaluation). MIT-licensed and used by several labs and infra projects as a practical RAG debugging guide. | Github |
Free |
| ChatGPT-Next-Web | One-Click to get a well-designed cross-platform ChatGPT web UI, with GPT3, GPT4 & Gemini Pro support. | Github |
Free |
| screenshot-to-code | This simple app converts a screenshot to HTML/Tailwind CSS. It uses GPT-4 Vision to generate the code and DALL-E 3 to generate similar-looking images. You can now also enter a URL to clone a live website! | GitHub |
Free, need access to GPT-4 Vision |
| Chatbox | Desktop application that uses ChatGPT API (OpenAI API) to store all chat messages and prompts locally, thus reducing the risk of data loss. A bit more stable to use than the web version | GitHub |
Free, requires apikey with OpenAPI |
| together.ai chat | Similar to HuggingChat, with the option of different open source models, support for DeepSeek R1, LLaMA, QWen, Flux Schnell. 60 free messages per day. | URL | Free/Paid |
| gpt-crawler | Crawl a site to generate knowledge files to create your own custom GPT from a URL | Github |
Free |
| ChatGPT-Shortcut | Open source, ChatGPT shortcut commands that double productivity, partitioned by domain and function, can filter prompt words by tag, keyword search and one-click copy. | GitHub |
Free |
| ChatGPT Sidebar | ChatGPT Sidebar is an artificial intelligence assistant you can use while browsing any website. | URL | Free |
| WebChatGPT | Open source, expand the ability of networking to chatgpt | GitHub |
Free |
| AIPRM for ChatGPT | Browser plug-in, providing a series of selected ChatGPT instruction templates, and even creating your own, and adjusting AI tone and writing style | URL | Free |
| MindMac | Feature-rich & privacy-first native ChatGPT app for macOS to use OpenAI, Azure OpenAI, Anthropic Claude, OpenRouter all in one place, designed for maximum productivity. Currently available in 15 languages. | URL | Free/Paid |
| NadirClaw | Open-source LLM router that classifies prompts in ~10ms and routes to the optimal model tier (free/cheap/premium/reasoning). OpenAI-compatible proxy with agentic detection, session pinning, and 429 fallback. | GitHub |
Free |
| chathub | Use different chatbots in one app, currently supporting ChatGPT, new Bing Chat, Google Bard, Claude, and 10+ open-source models including Alpaca, Vicuna, ChatGLM etc. | GitHub |
Free/Paid |
| Harbor | Effortlessly run LLM backends, APIs, frontends, and services with one command. | GitHub |
Free |
| gemini-fullstack-langgraph-quickstart | Get started with building Fullstack Agents using Gemini 2.5 and LangGraph | Github |
Free |
| NoteGPT | NoteGPT is a smart note-taking tool that can record, transcribe, and summarize various content, such as meetings, lectures, podcasts, YouTube videos, news briefings, and articles. | URL | Free/Paid |
| OpenRouter | A unified API gateway for 400+ AI models (OpenAI, Anthropic, Google, Mistral, etc.). Zero markup pricing, 5% commission on inference traffic, supports smart routing/failover | URL | Free/Paid |
| OmniRoute | Self-hostable AI gateway with 4-tier automatic fallback routing across 36+ providers. OpenAI-compatible API with quota tracking and zero-cost fallback to free tiers. | GitHub |
Free |
| Morphik.ai | Open source AI-driven search engine for private documents | URL Github |
Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| Claude Code | Anthropic's AI coding assistant with strong long‑context understanding, complex code refactoring and agent capabilities. | Github |
Paid/Free Trial |
| Cursor | A collaborative code editor using GPT | URL | Paid/Free Trial |
| GitHub Copilot | A code writing assistant developed by GitHub and OpenAI | URL | Paid |
| dbForge AI Assistant | AI-powered tool that generates, optimizes, and troubleshoots SQL code; indispensable for developers, DBAs, and analysts | URL | Paid/Free Trial |
| Antigravity | Google AI coding assistant based on Windsurf technology, deeply integrated with Gemini and Google Cloud | URL | Free for Individual Use/Paid |
| Happy Coder | Mobile and Web client for Codex and Claude Code, with realtime voice, encryption and fully featured | URL GitHub |
Free |
| Trae | ByteDance's AI coding IDE. Trae is your helpful coding partner. It offers features like AI Q&A, code auto-completion, and agent-based AI programming capabilities. | URL | Free |
| Amazon CodeWhisperer | A code writing assistant developed by Amazon | URL | Free for Individual Use |
| scalene | Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals | Github |
Free |
| Kodus | Open Source Code Review Agent | GitHub |
Free/Paid |
| Kagan | AI-powered Kanban TUI for autonomous development workflows. Integrates with Claude Code and OpenCode for ticket-driven AI coding with git worktree isolation and MCP server support. | GitHub |
Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| Nano Banana/Nano Banana Pro | Google's advanced AI model for image generation and editing. No. 1 in the LMArea Text to Image and Image Edit leadboard. Online website: 1. gemini 2.aistudio 3. lmarea.ai |
URL | Free/Paid |
| Z-Image | Z-Image is a high-performance image generation model recently open-sourced by Alibaba's Tongyi Lab. It strikes a balance between "extreme speed" and "high quality," making it highly suitable for scenarios requiring rapid image generation. Z-Image-Turbo Online Demo: https://huggingface.co/spaces/mrfakename/Z-Image-Turbo | Github |
Free |
| Midjourney | Enter text or pictures to create pictures | URL | Paid |
| ChatGPT Images | GPT Image 1.5 | URL | Free/Paid |
| Photoshop AI | Adobe Photoshop generative-fill | URL | Paid |
| Stable diffusion webui | Open source project, input text or pictures to create pictures, Stable diffusion webui is the GUI of Stable diffusion, and it is an image user interface that visualizes stable diffusion. It also integrates many other useful extension scripts. | GitHub |
Free |
| civitai | civitai.com is a website platform for sharing AI image creation model resources, with a large number of models, has become the main model exchange place in the SD open source community | URL | Free |
| clipdrop | clipdrop by stability.ai. Has many AI image processing tools, such as stable diffusion XL, uncrop, reimage XL, stable doodle. | URL | Free/Paid |
| firefly | Adobe's AI image processing web site | URL | Free/Paid |
| ideogram.ai | Enter text to create pictures. A product developed by a company founded by many ex-Googlers | URL | Free/Paid |
| Nero AI | AI picture upscale, AI repair scratches, AI picture coloring, AI picture noise removal, AI one-click to change the background, AI magical erasing pen, AI portrait. API doc:https://ai.nero.com/ai-api/docs/ | URL | Paid/Trial |
| Skybox AI | Generate 360-degree panoramic images using text prompts | URL | Free/Paid |
| remove.bg | Remove Image Background | URL | Free/Paid |
| ControlNet | ControlNet is a neural network structure to control diffusion models by adding extra conditions. | Github |
Free |
| PixelPanda | AI-powered platform that creates professional product photos, marketing images, UGC-style videos, and AI avatars — no camera or studio needed. | URL | Free/Paid |
| Name | Description | Links | Fees |
|---|---|---|---|
| Dreamina | AI image and video creation tool by ByteDance/CapCut. Powered by Seedance 2.0 model. Supports text-to-image, text-to-video, image-to-video with 2K ultra-clear output | URL | Free/Paid |
| Wan2.6 | AI Video Creation Tool by Alibaba | URL | Paid/Free trial |
| Sora | Sora is an AI model published by OpenAI that can create realistic and imaginative scenes from text instructions. | URL | Paid |
| KLING AI | AI Video Creation Tool by kuaishou. Support text to video, image to video, start-end frame and motion control | URL | Free/Paid |
| hailuoai | AI Video Creation Tool by Minimax | URL | Free/Paid |
| Dream Machine | By Luma AI. Dream Machine is an AI model that makes high quality, realistic videos fast from text and images.Official introductory video | URL | Free/Paid |
| capcut | Subtitle-generated speech, speech recognition, and very convenient and powerful video editing | URL | Free/Paid |
| Runway | Gen-2: Text/Image to video Gen-1: Video to video. Featured video: https://runwayml.com/staff-picks |
URL | Paid/Free trial |
| pixverse | Create Amazing AI Videos from Text & Photos | URL | Paid/Free trial |
| Pika | Text/Image to video | URL | Paid/Free trial |
| Fliki | A website that converts text into audio and video | URL | Free/Paid |
| d-id | Generate digital human dubbing video based on text | URL | Paid/Free trial |
| HeyGen | Generate digital human dubbing video based on text | URL | Paid/Free trial |
| AnimateDiff | AnimateDiff is a plug-and-play module turning most community models into animation generators, without the need of additional training. | Github |
Free |
| vivago.ai/video | Text to Video; Image to Video; 4K enhance | URL | Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| together.ai | The AI Acceleration Cloud. Train, fine-tune-and run inference on AI models blazing fast, at low cost, and at production scale. | URL | Free/Paid |
| Name | Description | Links | Fees |
|---|---|---|---|
| f/awesome-chatgpt-prompts | This repo includes ChatGPT prompt curation to use ChatGPT better. | Github |
Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| lm-sys/FastChat | An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena. | Github |
Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| Notion AI | AI-assisted note-taking software | URL | with certain free AI trials, AI features $10/month |
| Deep L Write | English and German writing tools to fix writing errors and rewrite sentences promptly. | URL | Free version to use with text word limit / paid upgrade available |
| grammarly | Edit and correct your grammar, spelling, punctuation, and more with your personal writing assistant, grammar checker, and editor. | URL | Free/Paid |
| TextCraft | Add-in for Microsoft Word that seamlessly integrates essential AI tools, including text generation, proofreading, and more, directly into the user interface. | URL | Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| Google Translate | Support text, picture, document and URL | URL | Free |
| Deep L | Accurate and instant translation tool, currently supporting 31 languages | URL | Free/Paid |
| immersive-translate | Open source project. Immersive bilingual web translation extension | GitHub |
Free |
| openai-translator | Open source project. Crossword translation browser plugin and cross-platform desktop application based on ChatGPT API | GitHub |
Free, requires OpenAI API key |
| RTranslator | RTranslator is an open-source, free, and offline real-time translation app for Android. | Github |
Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| whisper | OpenAPI open source robust speech recognition model through large-scale weak supervision | GitHub |
Free |
| whisper.cpp | Port of OpenAI's Whisper model in C/C++ | Github |
Free |
| buzz | An open source desktop software based on OpenAI's Whisper to recognize speech and generate subtitles | GitHub |
Free |
| WhisperDesktop | Open source, OpenAI-based Whisper, a desktop application for Windows, uses the GPU for processing, which will be faster than on the CPU with good GPU performance. | GitHub |
Free |
| whisperX | WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization) | whisperX |
Free |
| whisper-web | ML-powered speech recognition directly in your browser. Built with Transformers.js. Demo | GitHub |
Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| index-tts2 | Bilibili's Open-Source Industrial-Grade Controllable High-Efficiency Zero-Sample Text-to-Speech System. Online Demo: https://huggingface.co/spaces/IndexTeam/IndexTTS-2-Demo Paper: https://arxiv.org/abs/2506.21619 |
Github |
Free |
| Azure Text to speech | The best and most realistic voice tools currently available | URL | Paid / 500,000 characters per month free |
| Hailuo AI Text to Speech | Offer over 300 voices in 17 languages and multiple accents, covering a wide range of styles and age groups to provide the voice effects you need. | URL | Limited-time Free |
| coqui-ai/tts | A deep learning toolkit for Text-to-Speech, battle-tested in research and production Online Demo: https://huggingface.co/spaces/coqui/xtts |
Github |
Free |
| elevenlabs | Intelligent AI Text to Speech | URL | Free/Paid |
| netease-youdao/EmotiVoice | A Multi-Voice and Prompt-Controlled TTS Engine. EmotiVoice speaks both English and Chinese, and with over 2000 different voices. The most prominent feature is emotional synthesis, allowing you to create speech with a wide range of emotions, including happy, excited, sad, angry and others. | Github |
Free |
| tetos | A unified interface for multiple Text-to-Speech (TTS) providers. Supported TTS providers: Edge TTS, OpenAI TTS, Azure TTS, Google TTS, Volcengine TTS, Baidu TTS | Github |
Free |
| ChatTTS | ChatTTS is a text-to-speech model designed specifically for dialogue scenario such as LLM assistant. It supports both English and Chinese languages. Our model is trained with 100,000+ hours composed of chinese and english. Website:https://chattts.com/ | Github |
Free |
| Name | Description | Links | Fee |
|---|---|---|---|
| shazam | Download the shazaom app for music recognition, which is pretty fast | URL | Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| so-vits-svc | SoftVC VITS Singing Voice Conversion. | GitHub |
Free |
| vocalremover | Extract vocal and music | URL | Free |
| lala.ai | Extract vocal, accompaniment and various instruments from any audio and video | URL | Free/Paid |
| Name | Description | Link | Fees |
|---|---|---|---|
| suno.ai | The AI music creation tool Suno can generate custom songs based on text prompts in mere second | URL | |
| udio | Create music from simple text prompts by specifying topics, genres, and other descriptors which are then transformed into professional quality tracks. | URL | |
| mureka.ai | Text to music | URL | Free/Paid |
| elevenlabs/sound-effects | Imagine a sound and bring it to life, or explore a selection of the best sound effects generated by the community. | URL | Free |
| suno-ai/bark | Bark is a transformer-based text-to-audio model created by Suno. Bark can generate highly realistic, multilingual speech as well as other audio - including music, background noise and simple sound effects. | Github |
Free |
| audiocraft | Open source library for audio/music generation by Meta, which mainly includes two models, MusicGen: text-to-music model, AudioGen: text-generated sound model. MusicGen Online Demo | GitHub |
Free |
| Stable Audio | AI music and sound effect generation application by stability.ai | URL | Free/Paid |
| OptimizerAI | Sound effect generation Official Introduction |
URL | Free/Paid |
| SFX Engine | AI Sound effect generation | URL | Free/Paid |
| Name | Description | Links | Fees |
|---|---|---|---|
| Seamless | Seamless is a family of AI models that enable more natural and authentic communication across languages.Online Demo | Github |
Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| ChatGPT for YouTube | Chrome plugin, quickly summarize Youtube video content, need to log in chatgpt account or apikey | URL | Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| AMiner | AI-powered academic research and tech intelligence platform providing paper search, patent search, literature tracking, and scholar profiling | URL | Free |
| alphaxiv | An open academic discussion community based on the arXiv platform that allows users to comment line-by-line, ask questions, and interact in real-time by replacing the paper's linking domain (arxiv.org for alphaxiv.org) directly on the paper's page. And provides AI features such as Ask AI and AI-generated article blogs | URL | Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| Umi-OCR | Comes with a highly efficient offline OCR engine. As long as the computer performance is sufficient, it can be faster than online OCR services. | Github |
Free |
| allenai/olmocr | A toolkit for training language models to work with PDF documents in the wild. Online demo: https://olmocr.allenai.org/ | Github |
Free |
| Name | Description | Links | Fees |
|---|---|---|---|
| AI Virtual Staging | Stage empty rooms instantly with realistic furniture using AI. MLS compliant, fast, and affordable virtual staging for real estate listings. With furniture removal, day to dusk, 2D to 3D floor plan features support. | URL |
| Name | Description | Links | Fees |
|---|---|---|---|
| AI Detect Lab | Professional AI image and deepfake detector specifically optimized for Midjourney v7 and Flux. | URL | Free |