In a Training Loop 🔄

33 13 77

aquiffoo

https://aquiffoo.is-a.dev/

AI & ML interests

thanks for everything.

Recent Activity

liked a model about 13 hours ago

zai-org/GLM-4.7-Flash

replied to AdinaY's post 2 days ago

After a VLM, StepFun dropped a new audio model: Step-Audio-R1.1, enabling thinking while speaking 🔥 https://huggingface.co/stepfun-ai/Step-Audio-R1.1 ✨ Apache 2.0 ✨ Combines dual-brain architecture and acoustic-grounded reasoning to enable real-time dialogue with SOTA-level reasoning

reacted to AdinaY's post with 👍 2 days ago

View all activity

Organizations

liked a model about 13 hours ago

zai-org/GLM-4.7-Flash

Text Generation • 31B • Updated about 1 hour ago • • 481

replied to AdinaY's post 2 days ago

i think i'll keep an eye on stepfun this year, they're cooking

reacted to AdinaY's post with 👍 2 days ago

Post

952

After a VLM, StepFun dropped a new audio model: Step-Audio-R1.1, enabling thinking while speaking 🔥

stepfun-ai/Step-Audio-R1.1

✨ Apache 2.0
✨ Combines dual-brain architecture and acoustic-grounded reasoning to enable real-time dialogue with SOTA-level reasoning

2 replies

upvoted a paper 2 days ago

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published 5 days ago • 173

liked a model 3 days ago

stepfun-ai/Step3-VL-10B

Image-Text-to-Text • 10B • Updated 41 minutes ago • 9.39k • 143

New activity in huggingface/InferenceSupport 5 days ago

meituan-longcat/LongCat-Flash-Thinking-2601

👍 4

#7453 opened 5 days ago by

aquiffoo

liked a model 5 days ago

meituan-longcat/LongCat-Flash-Thinking-2601

Text Generation • 562B • Updated 4 days ago • 180 • 69

liked a model 6 days ago

zai-org/GLM-Image

Text-to-Image • Updated 5 days ago • 7.59k • • 865

liked 2 datasets 7 days ago

MiniMaxAI/OctoCodingBench

Viewer • Updated 7 days ago • 72 • 7k • 216

ylecun/mnist

Viewer • Updated Aug 8, 2024 • 70k • 65.2k • 223

liked a model 7 days ago

anwgpt/anwgpt4.1-chat

30.7M • Updated 7 days ago • 17 • 2

liked a model 8 days ago

anwgpt/anwgpt4-chat

Text Generation • 27.2M • Updated 8 days ago • 45 • 1

New activity in aquiffoo/neo-3-1B-A90M-Base 9 days ago

Production deployment considerations

#1 opened 16 days ago by

Cagnicolas

liked a model 10 days ago

NousResearch/NousCoder-14B

Text Generation • 15B • Updated 14 days ago • 1.71k • 160

reacted to sergiopaniego's post with 🔥 10 days ago

Post

2197

New GRPO + TRL free Colab notebook out! 🔥

Fine-tune 7B+ models on T4 GPUs thanks to a ton of memory optimizations for GRPO

7B model uses only 9.2 GB VRAM (~7× reduction) 🤯

Try the notebook here 👉 https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/grpo_trl_lora_qlora.ipynb

reacted to Reality123b's post with 🤗 11 days ago

Post

2076

Happy birthday to me!!!

2 replies

upvoted a collection 11 days ago

Jamba2

Collection

Jamba2 is a highly-efficient open source family of language models built for maximum reliability and steerability in the enterprise. • 3 items • Updated 12 days ago • 5

liked a model 11 days ago

ai21labs/AI21-Jamba2-3B

Text Generation • 3B • Updated 11 days ago • 992 • 34

liked a model 12 days ago

ai21labs/AI21-Jamba2-Mini

Text Generation • 52B • Updated 11 days ago • 229 • 43

reacted to mlabonne's post with 🚀 12 days ago

Post

4354

New family of 1B models just dropped!

> LiquidAI/LFM2.5-1.2B-Base: 10T → 28T tokens
> LiquidAI/LFM2.5-1.2B-Instruct: new large-scale multi-stage RL
> LiquidAI/LFM2.5-1.2B-JP: our most polite model
> LiquidAI/LFM2.5-VL-1.6B: multi-image multilingual
> LiquidAI/LFM2.5-Audio-1.5B: 8x times faster, no quality loss

Super proud of this release 🤗

3 replies

aquiffoo

AI & ML interests

Recent Activity

Organizations

aquiffoo's activity

meituan-longcat/LongCat-Flash-Thinking-2601

Production deployment considerations