1 22 6

Ruobing Xie

Ruobing-Xie

https://ruobingxie.github.io/

AI & ML interests

Recommender System; Large Language Model; Natural Language Processing; Information Retrieval

Recent Activity

upvoted an article about 1 month ago

Why Did MiniMax M2 End Up as a Full Attention Model?

upvoted a paper 3 months ago

Why Language Models Hallucinate

upvoted a paper 4 months ago

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

View all activity

Organizations

None yet

upvoted an article about 1 month ago

Article

Why Did MiniMax M2 End Up as a Full Attention Model?

Oct 30

•

upvoted a paper 3 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 193

upvoted a paper 4 months ago

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published Aug 2 • 238

liked a Space 5 months ago

Hunyuan Turbos

💬

hunyuan-turbos模型体验

liked a model 5 months ago

tencent/Hunyuan-A13B-Instruct

Text Generation • 80B • Updated Aug 21 • 9.39k • 785

authored a paper 6 months ago

The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason

Paper • 2505.22653 • Published May 28 • 66

upvoted 3 papers 6 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 317

The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason

Paper • 2505.22653 • Published May 28 • 66

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28 • 131

upvoted 2 papers 9 months ago

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13 • 171

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Paper • 2503.07365 • Published Mar 10 • 61

liked a dataset 9 months ago

AIMClab-RUC/PhD

Viewer • Updated Apr 6 • 17.6k • 824 • 3

upvoted 2 papers 10 months ago

HMoE: Heterogeneous Mixture of Experts for Language Modeling

Paper • 2408.10681 • Published Aug 20, 2024 • 10

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 125

upvoted a paper 11 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 429

authored a paper 11 months ago

Autonomy-of-Experts Models

Paper • 2501.13074 • Published Jan 22 • 44

upvoted 3 papers 11 months ago

Autonomy-of-Experts Models

Paper • 2501.13074 • Published Jan 22 • 44

Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

Paper • 2501.12202 • Published Jan 21 • 48

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14 • 63

authored a paper 11 months ago

Scaling Laws for Floating Point Quantization Training

Paper • 2501.02423 • Published Jan 5 • 26

Ruobing Xie

AI & ML interests

Recent Activity

Organizations

Ruobing-Xie's activity

Why Did MiniMax M2 End Up as a Full Attention Model?

Hunyuan Turbos