1 1 1

Zhi Zheng

zz1358m

https://zz1358m.github.io/zhizheng.github.io/

AI & ML interests

LLM reasoning, Trustworthy LLM, LLM application, Neural combinatorial optimization.

Recent Activity

liked a model about 2 months ago

zz1358m/SofT-GRPO-master

updated a model about 2 months ago

zz1358m/SofT-GRPO-master

authored a paper about 2 months ago

SofT-GRPO: Surpassing Discrete-Token LLM Reinforcement Learning via Gumbel-Reparameterized Soft-Thinking Policy Optimization

View all activity

Organizations

liked a model about 2 months ago

zz1358m/SofT-GRPO-master

Updated Nov 13, 2025 • 7

updated a model about 2 months ago

zz1358m/SofT-GRPO-master

Updated Nov 13, 2025 • 7

authored a paper about 2 months ago

SofT-GRPO: Surpassing Discrete-Token LLM Reinforcement Learning via Gumbel-Reparameterized Soft-Thinking Policy Optimization

Paper • 2511.06411 • Published Nov 9, 2025 • 17

commented a paper about 2 months ago

SofT-GRPO: Surpassing Discrete-Token LLM Reinforcement Learning via Gumbel-Reparameterized Soft-Thinking Policy Optimization

Paper • 2511.06411 • Published Nov 9, 2025 • 17 •

upvoted a paper about 2 months ago

SofT-GRPO: Surpassing Discrete-Token LLM Reinforcement Learning via Gumbel-Reparameterized Soft-Thinking Policy Optimization

Paper • 2511.06411 • Published Nov 9, 2025 • 17

published a model about 2 months ago

zz1358m/SofT-GRPO-master

Updated Nov 13, 2025 • 7

updated a model 4 months ago

zz1358m/Reasoning-CV

Updated Sep 10, 2025

authored 3 papers 8 months ago

Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design

Paper • 2501.08603 • Published Jan 15, 2025

UDC: A Unified Neural Divide-and-Conquer Framework for Large-Scale Combinatorial Optimization Problems

Paper • 2407.00312 • Published Jun 29, 2024

Reasoning-CV: Fine-tuning Powerful Reasoning LLMs for Knowledge-Assisted Claim Verification

Paper • 2505.12348 • Published May 18, 2025

published a model 8 months ago

zz1358m/Reasoning-CV

Updated Sep 10, 2025

Zhi Zheng

AI & ML interests

Recent Activity

Organizations

zz1358m's activity