BadScientist: Can a Research Agent Write Convincing but Unsound Papers that Fool LLM Reviewers? Paper • 2510.18003 • Published Oct 20, 2025
Visual Aesthetic Benchmark: Can Frontier Models Judge Beauty? Paper • 2605.12684 • Published 11 days ago • 11
CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards Paper • 2510.08529 • Published Oct 9, 2025 • 19
TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents Paper • 2602.07274 • Published Feb 6 • 210
SimWorld: An Open-ended Realistic Simulator for Autonomous Agents in Physical and Social Worlds Paper • 2512.01078 • Published Nov 30, 2025 • 34
Approximate Nullspace Augmented Finetuning for Robust Vision Transformers Paper • 2403.10476 • Published Mar 15, 2024 • 1
Toward a Team of AI-made Scientists for Scientific Discovery from Gene Expression Data Paper • 2402.12391 • Published Feb 15, 2024
Vision Language Models See What You Want but not What You See Paper • 2410.00324 • Published Oct 1, 2024
Can Vision Language Models Infer Human Gaze Direction? A Controlled Study Paper • 2506.05412 • Published Jun 4, 2025 • 5
EgoPrivacy: What Your First-Person Camera Says About You? Paper • 2506.12258 • Published Jun 13, 2025 • 3
Unified Multimodal Understanding via Byte-Pair Visual Encoding Paper • 2506.23639 • Published Jun 30, 2025 • 4
GenoMAS: A Multi-Agent Framework for Scientific Discovery via Code-Driven Gene Expression Analysis Paper • 2507.21035 • Published Jul 28, 2025 • 3
aiXiv: A Next-Generation Open Access Ecosystem for Scientific Discovery Generated by AI Scientists Paper • 2508.15126 • Published Aug 20, 2025 • 20