Guowen Zhang's picture

12

Guowen Zhang

Lostgreen

AI & ML interests

None yet

Recent Activity

upvoted a paper 19 days ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

upvoted a paper 24 days ago

VideoSSR: Video Self-Supervised Reinforcement Learning

upvoted a paper 3 months ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

View all activity

Organizations

None yet

upvoted a paper 19 days ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9 • 129

upvoted a paper 24 days ago

VideoSSR: Video Self-Supervised Reinforcement Learning

Paper • 2511.06281 • Published Nov 9 • 24

upvoted 4 papers 3 months ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6 • 129

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published Aug 7 • 130

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22 • 160

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published Aug 14 • 144

upvoted a paper 6 months ago

Agentic Robot: A Brain-Inspired Framework for Vision-Language-Action Models in Embodied Agents

Paper • 2505.23450 • Published May 29 • 9

upvoted 4 papers 7 months ago

BadVLA: Towards Backdoor Attacks on Vision-Language-Action Models via Objective-Decoupled Optimization

Paper • 2505.16640 • Published May 22 • 3

Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start

Paper • 2505.22334 • Published May 28 • 36

Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO

Paper • 2505.22453 • Published May 28 • 46

MMMR: Benchmarking Massive Multi-Modal Reasoning Tasks

Paper • 2505.16459 • Published May 22 • 45

updated a dataset 7 months ago

Lostgreen/BadVLA

Updated May 23 • 40

published a dataset 7 months ago

Lostgreen/BadVLA

Updated May 23 • 40

upvoted a paper 7 months ago

EfficientLLM: Efficiency in Large Language Models

Paper • 2505.13840 • Published May 20 • 24