Large Reward Models: Generalizable Online Robot Reward Generation with Vision-Language Models Paper • 2603.16065 • Published Mar 17
The Impact of Large Language Models in Academia: from Writing to Speaking Paper • 2409.13686 • Published Sep 20, 2024
MERIT: Multilingual Semantic Retrieval with Interleaved Multi-Condition Query Paper • 2506.03144 • Published Jun 3, 2025 • 8
Interleaved Scene Graph for Interleaved Text-and-Image Generation Assessment Paper • 2411.17188 • Published Nov 26, 2024 • 20
RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning Paper • 2504.18904 • Published Apr 26, 2025 • 9
An Empirical Study of GPT-4o Image Generation Capabilities Paper • 2504.05979 • Published Apr 8, 2025 • 64
Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens Paper • 2503.01710 • Published Mar 3, 2025 • 6
One Graph Model for Cross-domain Dynamic Link Prediction Paper • 2402.02168 • Published Feb 3, 2024 • 2
PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding Paper • 2501.16411 • Published Jan 27, 2025 • 19