Open to Collab

432 2535 2423

taesiri PRO

taesiri

https://taesiri.ai/

AI & ML interests

AGI ... one linear layer at a time

Recent Activity

submitted a paper about 9 hours ago

DeepCode: Open Agentic Coding

submitted a paper about 10 hours ago

EcomBench: Towards Holistic Evaluation of Foundation Agents in E-commerce

submitted a paper about 10 hours ago

TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels

View all activity

Organizations

commented 6 papers 1 day ago

commented 5 papers 2 days ago

SpaceControl: Introducing Test-Time Spatial Control to 3D Generative Modeling

Paper • 2512.05343 • Published 5 days ago • 11 •

ProPhy: Progressive Physical Alignment for Dynamic World Simulation

Paper • 2512.05564 • Published 5 days ago • 3 •

EditThinker: Unlocking Iterative Reasoning for Any Image Editor

Paper • 2512.05965 • Published 5 days ago • 36 •

Self-Improving VLM Judges Without Human Annotations

Paper • 2512.05145 • Published 8 days ago • 15 •

World Models That Know When They Don't Know: Controllable Video Generation with Calibrated Uncertainty

Paper • 2512.05927 • Published 5 days ago • 10 •

commented 5 papers 5 days ago

SIMA 2: A Generalist Embodied Agent for Virtual Worlds

Paper • 2512.04797 • Published 6 days ago • 17 •

TV2TV: A Unified Framework for Interleaved Language and Video Generation

Paper • 2512.05103 • Published 6 days ago • 12 •

On GRPO Collapse in Search-R1: The Lazy Likelihood-Displacement Death Spiral

Paper • 2512.04220 • Published 7 days ago • 11 •

DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle

Paper • 2512.04324 • Published 7 days ago • 146 •

Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction

Paper • 2512.04987 • Published 6 days ago • 71 •

commented 2 papers 6 days ago

RELIC: Interactive Video World Model with Long-Horizon Memory

Paper • 2512.04040 • Published 7 days ago • 21 •

ViDiC: Video Difference Captioning

Paper • 2512.03405 • Published 7 days ago • 26 •

commented 2 papers 7 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 8 days ago • 193 •

GUI Exploration Lab: Enhancing Screen Navigation in Agents via Multi-Turn Reinforcement Learning

Paper • 2512.02423 • Published 8 days ago • 3 •

taesiri PRO

AI & ML interests

Recent Activity

Organizations

taesiri's activity