Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models Paper β’ 2603.25716 β’ Published 10 days ago β’ 151
ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling Paper β’ 2603.25746 β’ Published 10 days ago β’ 153
MambaEye: A Size-Agnostic Visual Encoder with Causal Sequential Processing Paper β’ 2511.19963 β’ Published Nov 25, 2025 β’ 2
SegviGen: Repurposing 3D Generative Model for Part Segmentation Paper β’ 2603.16869 β’ Published 19 days ago β’ 18
Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing Paper β’ 2603.03143 β’ Published Mar 3 β’ 145
Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control Paper β’ 2602.18422 β’ Published Feb 20 β’ 30
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 Feb 20 β’ 493
PaperBanana: Automating Academic Illustration for AI Scientists Paper β’ 2601.23265 β’ Published Jan 30 β’ 222
Latent Diffusion Model without Variational Autoencoder Paper β’ 2510.15301 β’ Published Oct 17, 2025 β’ 50
Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset Paper β’ 2510.15742 β’ Published Oct 17, 2025 β’ 51
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI Paper β’ 2510.05684 β’ Published Oct 7, 2025 β’ 145
Lynx: Towards High-Fidelity Personalized Video Generation Paper β’ 2509.15496 β’ Published Sep 19, 2025 β’ 13
JAM-Flow: Joint Audio-Motion Synthesis with Flow Matching Paper β’ 2506.23552 β’ Published Jun 30, 2025 β’ 10
Seeing Voices: Generating A-Roll Video from Audio with Mirage Paper β’ 2506.08279 β’ Published Jun 9, 2025 β’ 27
Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features Paper β’ 2504.00557 β’ Published Apr 1, 2025 β’ 15
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation Paper β’ 2503.09641 β’ Published Mar 12, 2025 β’ 42
SANA-Sprint Collection πSANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation β’ 6 items β’ Updated 26 days ago β’ 44
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization Paper β’ 2412.17739 β’ Published Dec 23, 2024 β’ 41