view article Article Open Responses: What you need to know +2 evalstate, burtenshaw, merve, pcuenq โข Jan 15 โข 112
Arabic Speech Datasets Collection Best Datasets for Arabic Speech Tasks โข 20 items โข Updated 18 days ago โข 20
view post Post 5650 Thank you @clem (Co-Founder & CEO of Hugging Face) for sharing my dataset on X / Twitter! ronantakizawa/github-top-developers#github #dataset See translation 4 replies ยท ๐ 11 11 โค๏ธ 3 3 ๐ 2 2 ๐ 1 1 + Reply
allenai/llama-3.1-tulu-3-8b-preference-mixture Viewer โข Updated Feb 4, 2025 โข 273k โข 5.65k โข 26
Qwen/Qwen3-30B-A3B-Thinking-2507 Text Generation โข 31B โข Updated Aug 17, 2025 โข 146k โข โข 378
view post Post 1661 Multiple NEW notebooks and scripts added to the Hugging Face Gemma recipes repo!Thanks to the community ๐ซถ, we're adding more and more recipes using Gemma ๐Fine tuning for all modalities, function calling, RAG...Repo: https://github.com/huggingface/huggingface-gemma-recipesWe're also open to new ideas from the community ๐ค! See translation 1 reply ยท ๐ค 4 4 ๐ฅ 1 1 + Reply
view post Post 3536 ByteDance released Tar 1.5B and 7B: image-text in image-text out models, fully open-source ๐ ByteDance-Seed/tar-6864cf0d9fe59a3b91cc4260They have an image tokenizer unified with text, and they de-tokenize using either of two models (LLM and diffusion)The model is actually a full LLM (Qwen2), the tokenizer converts image tokens ๐คฏ See translation ๐ฅ 8 8 โค๏ธ 1 1 + Reply
Josiefied and Abliterated Qwen2.5 Collection The best uncensored models โข 20 items โข Updated Jun 27, 2025 โข 3