XFACTORS: Disentangled Information Bottleneck via Contrastive Supervision Paper • 2601.21688 • Published Jan 29
Revealing Subtle Phenotypes in Small Microscopy Datasets Using Latent Diffusion Models Paper • 2502.09665 • Published Feb 12, 2025
CubeComposer: Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video Paper • 2603.04291 • Published 24 days ago • 13
CubeComposer: Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video Paper • 2603.04291 • Published 24 days ago • 13
BFTBrain: Adaptive BFT Consensus with Reinforcement Learning Paper • 2408.06432 • Published Aug 12, 2024
DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation Paper • 2502.11897 • Published Feb 17, 2025
SALAD: Achieve High-Sparsity Attention via Efficient Linear Attention Tuning for Video Diffusion Transformer Paper • 2601.16515 • Published Jan 23 • 15
SALAD: Achieve High-Sparsity Attention via Efficient Linear Attention Tuning for Video Diffusion Transformer Paper • 2601.16515 • Published Jan 23 • 15
DiffusionBrowser: Interactive Diffusion Previews via Multi-Branch Decoders Paper • 2512.13690 • Published Dec 15, 2025 • 3
Unified Speech-Text Pre-training for Speech Translation and Recognition Paper • 2204.05409 • Published Apr 11, 2022
data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language Paper • 2202.03555 • Published Feb 7, 2022
StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis Paper • 2110.08985 • Published Oct 18, 2021