Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length Paper • 2512.04677 • Published Dec 4, 2025 • 167
DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation Paper • 2511.23127 • Published Nov 28, 2025 • 43
LongCodeZip: Compress Long Context for Code Language Models Paper • 2510.00446 • Published Oct 1, 2025 • 106
StableToken: A Noise-Robust Semantic Speech Tokenizer for Resilient SpeechLLMs Paper • 2509.22220 • Published Sep 26, 2025 • 65
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding Paper • 2502.08946 • Published Feb 13, 2025 • 191
TextGenSHAP: Scalable Post-hoc Explanations in Text Generation with Long Documents Paper • 2312.01279 • Published Dec 3, 2023 • 6
Using Large Language Models to Accelerate Communication for Users with Severe Motor Impairments Paper • 2312.01532 • Published Dec 3, 2023 • 6
Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training Paper • 2312.01663 • Published Dec 4, 2023 • 6
Rejuvenating image-GPT as Strong Visual Representation Learners Paper • 2312.02147 • Published Dec 4, 2023 • 7
Axiomatic Preference Modeling for Longform Question Answering Paper • 2312.02206 • Published Dec 2, 2023 • 10
StableDreamer: Taming Noisy Score Distillation Sampling for Text-to-3D Paper • 2312.02189 • Published Dec 2, 2023 • 11
Generative agent-based modeling with actions grounded in physical, social, or digital space using Concordia Paper • 2312.03664 • Published Dec 6, 2023 • 11
Orthogonal Adaptation for Modular Customization of Diffusion Models Paper • 2312.02432 • Published Dec 5, 2023 • 14
OneLLM: One Framework to Align All Modalities with Language Paper • 2312.03700 • Published Dec 6, 2023 • 24
X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model Paper • 2312.02238 • Published Dec 4, 2023 • 27