CohereLabs/cohere-transcribe-03-2026 Automatic Speech Recognition • 2B • Updated 23 days ago • 900k • • 1.02k
Qwen3 Voice Embedding Collection Standalone ECAPA-TDNN x-vector speaker encoders extracted from Qwen3-TTS. 1024-dim (0.6B) and 2048-dim (1.7B). • 4 items • Updated Feb 27 • 29
nvidia/canary-1b-flash Automatic Speech Recognition • 0.8B • Updated 4 days ago • 3.73k • 274
ibm-granite/granite-docling-258M Image-Text-to-Text • 0.3B • Updated Sep 23, 2025 • 93k • 1.21k
Running on Zero Agents Featured 827 Qwen Image Edit ✒ 827 Edit images using natural language instructions
Running on Zero Agents Featured 913 Qwen Image 🖼 913 Generate high-quality images from text prompts
docling-project/SmolDocling-256M-preview Image-Text-to-Text • 0.3B • Updated Sep 17, 2025 • 25.6k • 1.61k
nomic-ai/nomic-embed-text-v2-moe Sentence Similarity • 0.5B • Updated Apr 1, 2025 • 839k • 485