view post Post 2003 -UPDATED-4bit inference is working! The blogpost is updated with code snippet and requirements.txthttps://devquasar.com/uncategorized/all-about-amd-and-rocm/-UPDATED-I've played around with an MI100 and ROCm and collected my experience in a blogpost:https://devquasar.com/uncategorized/all-about-amd-and-rocm/Unfortunately I've could not make inference or training work with model loaded in 8bit or use BnB, but did everything else and documented my findings. See translation 4 replies · 👍 5 5 🔥 3 3 🚀 1 1 👀 1 1 🤗 1 1 ❤️ 1 1 😎 1 1 ➕ 1 1 🧠 1 1 🤝 1 1 😔 1 1 🤯 1 1 + Reply
view post Post 5802 RWKV-7 "Goose" preview rc2 => Peak RNN architecture?😃Will try to squeeze more performance for the final release. Preview code & model: https://github.com/BlinkDL/RWKV-LM/tree/main/RWKV-v7 2 replies · 👀 11 11 🚀 4 4 👍 3 3 ❤️ 2 2 🔥 1 1 + Reply
Kotokin/sophosympatheia_New-Dawn-Llama-3.1-70B-v1.1-exl2-4.5bpw Text Generation • Updated Aug 15, 2024 • 3 • 2
Qwen2 Collection Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated 18 days ago • 374