University of Wisconsin - Madison

university

Verified

https://www.wisc.edu

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

jungtaekkim authored a paper about 6 hours ago

Exploration and Exploitation Errors Are Measurable for Language Model Agents

jpark677 submitted a paper about 22 hours ago

Exploration and Exploitation Errors Are Measurable for Language Model Agents

gabeorlanski submitted a paper 21 days ago

SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks

View all activity

Papers

Exploration and Exploitation Errors Are Measurable for Language Model Agents

SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks

View all Papers

uw-madison 's Papers 11

Submitted by

Jaden Park

Exploration and Exploitation Errors Are Measurable for Language Model Agents

uw-madison

University of Wisconsin - Madison

Submitted by

Gabe Orlanski

SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks

uw-madison

University of Wisconsin - Madison

Submitted by

Tengyang Xie

Understanding Behavior Cloning with Action Quantization

uw-madison

University of Wisconsin - Madison

2

Submitted by

Tengyang Xie

Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States

uw-madison

University of Wisconsin - Madison

2

Submitted by

Jiayu (Mila) Wang

SkillOrchestra: Learning to Route Agents via Skill Transfer

uw-madison

University of Wisconsin - Madison

Submitted by

Jongwon Jeong

TAPE: Tool-Guided Adaptive Planning and Constrained Execution in Language Model Agents

uw-madison

University of Wisconsin - Madison

Submitted by

Changdae Oh

Thinking Makes LLM Agents Introverted: How Mandatory Thinking Can Backfire in User-Engaged Agents

uw-madison

University of Wisconsin - Madison

2

Submitted by

Changdae Oh

Towards Reducible Uncertainty Modeling for Reliable Large Language Model Agents

uw-madison

University of Wisconsin - Madison

3

Submitted by

Mu Cai

Contamination Detection for VLMs using Multi-Modal Semantic Perturbation

uw-madison

University of Wisconsin - Madison

UniTalk: Towards Universal Active Speaker Detection in Real World Scenarios

uw-madison

University of Wisconsin - Madison

VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data

uw-madison

University of Wisconsin - Madison