2 6 1

Wenyuan Zhang

WYRipple

https://scholar.google.com/citations?user=5weUrvgAAAAJ&hl=zh-CN

WYRipple

AI & ML interests

LLM Social Agent, Role-playing-LLM, Dialogue

Recent Activity

upvoted a paper about 1 month ago

Query-focused and Memory-aware Reranker for Long Context Processing

upvoted a paper 2 months ago

ExpSeek: Self-Triggered Experience Seeking for Web Agents

submitted a paper 2 months ago

ExpSeek: Self-Triggered Experience Seeking for Web Agents

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

Query-focused and Memory-aware Reranker for Long Context Processing

Paper • 2602.12192 • Published Feb 12 • 57

upvoted a paper 2 months ago

ExpSeek: Self-Triggered Experience Seeking for Web Agents

Paper • 2601.08605 • Published Jan 13 • 16

submitted a paper to Daily Papers 2 months ago

ExpSeek: Self-Triggered Experience Seeking for Web Agents

Paper • 2601.08605 • Published Jan 13 • 16

upvoted a paper 3 months ago

Mindscape-Aware Retrieval Augmented Generation for Improved Long Context Understanding

Paper • 2512.17220 • Published Dec 19, 2025 • 113

authored a paper 10 months ago

Revealing the Challenge of Detecting Character Knowledge Errors in LLM Role-Playing

Paper • 2409.11726 • Published Sep 18, 2024

updated 2 collections 10 months ago

Benchmark paper

Collection

2 items • Updated May 29, 2025

SOTOPIA-Ω Checkpoints

Collection

ACL 2025 (main) paper -- SOTOPIA-Ω: Dynamic Strategy Injection Learning and Social Instruction Following Evaluation for Social Agents. • 3 items • Updated May 27, 2025

updated a dataset 10 months ago

WYRipple/RoleKE-Bench

Viewer • Updated May 20, 2025 • 990 • 20

published a dataset 10 months ago

WYRipple/RoleKE-Bench

Viewer • Updated May 20, 2025 • 990 • 20

updated a dataset 11 months ago

WYRipple/S1-Bench

Viewer • Updated May 8, 2025 • 422 • 6 • 5

liked a dataset 11 months ago

WYRipple/S1-Bench

Viewer • Updated May 8, 2025 • 422 • 6 • 5

authored a paper 12 months ago

S1-Bench: A Simple Benchmark for Evaluating System 1 Thinking Capability of Large Reasoning Models

Paper • 2504.10368 • Published Apr 14, 2025 • 22

upvoted a paper 12 months ago

Debiasing Multimodal Large Language Models via Noise-Aware Preference Optimization

Paper • 2503.17928 • Published Mar 23, 2025 • 2

commented a paper 12 months ago

S1-Bench: A Simple Benchmark for Evaluating System 1 Thinking Capability of Large Reasoning Models

Paper • 2504.10368 • Published Apr 14, 2025 • 22 •

upvoted a paper 12 months ago

S1-Bench: A Simple Benchmark for Evaluating System 1 Thinking Capability of Large Reasoning Models

Paper • 2504.10368 • Published Apr 14, 2025 • 22

published a dataset 12 months ago

WYRipple/S1-Bench

Viewer • Updated May 8, 2025 • 422 • 6 • 5

updated a dataset about 1 year ago

WYRipple/sotopia-omega

Viewer • Updated Feb 19, 2025 • 25.9k • 28

published a dataset about 1 year ago

WYRipple/sotopia-omega

Viewer • Updated Feb 19, 2025 • 25.9k • 28

Wenyuan Zhang

AI & ML interests

Recent Activity

Organizations

WYRipple's activity