UniDDT: Unifying Multimodal Understanding and Generation with Decoupled Diffusion Transformer Paper • 2606.16255 • Published 2 days ago • 8
Smaller Models are Natural Explorers for Policy-Level Diversity in GRPO Paper • 2605.30789 • Published 15 days ago • 22
RepFusion: Leveraging Multimodal Priors for Denoising in Representation Space Paper • 2606.14700 • Published 5 days ago • 11
VideoMDM: Towards 3D Human Motion Generation From 2D Supervision Paper • 2606.13364 • Published 6 days ago • 20
World Model Self-Distillation: Training World Models to Solve General Tasks Paper • 2606.12072 • Published 7 days ago • 13
SwiftVR: Real-Time One-Step Generative Video Restoration Paper • 2606.09516 • Published 9 days ago • 16
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models Paper • 2606.11025 • Published 8 days ago • 41
Test-Time Gradient Guidance of Flow Policies in Reinforcement Learning Paper • 2606.11087 • Published 8 days ago • 3
AHA-WAM:Asynchronous Horizon-Adaptive World-Action Modeling with Observation-Guided Context Routing Paper • 2606.09811 • Published 9 days ago • 14
AAD-1: Asymmetric Adversarial Distillation for One-Step Autoregressive Video Generation Paper • 2606.03972 • Published 15 days ago • 14
Flash-WAM: Modality-Aware Distillation for World Action Models Paper • 2606.05254 • Published 14 days ago • 7
LongLive-RAG: A General Retrieval-Augmented Framework for Long Video Generation Paper • 2606.02553 • Published 16 days ago • 19