Submitted by
Jielin Qiu
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
MMPersuade: A Dataset and Evaluation Framework for Multimodal Persuasion