Inferless
Popular repositories Loading
-
triton-co-pilot
triton-co-pilot PublicGenerate Glue Code in seconds to simplify your Nvidia Triton Inference Server Deployments
-
whisper-large-v3
whisper-large-v3 Public templateState‑of‑the‑art speech recognition model for English, delivering transcription accuracy across diverse audio scenarios. <metadata> gpu: T4 | collections: ["CTranslate2"] </metadata>
-
qwq-32b-preview
qwq-32b-preview Public templateA 32B experimental reasoning model for advanced text generation and robust instruction following. <metadata> gpu: A100 | collections: ["vLLM"] </metadata>
-
deepseek-r1-distill-qwen-32b
deepseek-r1-distill-qwen-32b Public templateA distilled DeepSeek-R1 variant built on Qwen2.5-32B, fine-tuned with curated data for enhanced performance and efficiency. <metadata> gpu: A100 | collections: ["vLLM"] </metadata>
Repositories
- chatterbox Public template
Chatterbox is an TTS by Resemble AI featuring emotion exaggeration control, zero-shot voice cloning, alignment-informed real-time synthesis, and built-in PerTh neural watermarking for responsible, high-quality speech generation audio. <metadata> gpu: A10 | collections: ["HF_Transformers"] </metadata>
inferless/chatterbox’s past year of commit activity - qwen3-30b-a3b-instruct-2507 Public template
30.5B MoE language model from Qwen team, tuned for broad instruction following, reasoning, multilingual tasks, and agentic tool use.<metadata> gpu: A100 | collections: ["HF_Transformers"] </metadata>
inferless/qwen3-30b-a3b-instruct-2507’s past year of commit activity - flux-1-krea-dev Public template
12B model distilled from Krea 1, designed to deliver highly photorealistic results. <metadata> gpu: A100 | collections: ["HF_Transformers"] </metadata>
inferless/flux-1-krea-dev’s past year of commit activity - code-debugging-agent Public
inferless/code-debugging-agent’s past year of commit activity - qwen-image Public
inferless/qwen-image’s past year of commit activity - pyannote-speaker-diarization-3.1 Public template
A state-of-the-art model that segments and labels audio recordings by accurately distinguishing different speakers. <metadata> gpu: T4 | collections: ["HF Transformers"] </metadata>
inferless/pyannote-speaker-diarization-3.1’s past year of commit activity - facebook-bart-cnn Public template
A variant of the BART model designed specifically for natural language summarization. It was pre-trained on a large corpus of English text and later fine-tuned on the CNN/Daily Mail dataset. <metadata> gpu: T4 | collections: ["HF Transformers"] </metadata>
inferless/facebook-bart-cnn’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…