Rethinking Training Dynamics in Scale-wise Autoregressive Generation Paper • 2512.06421 • Published 9 days ago • 5
OmniPSD: Layered PSD Generation with Diffusion Transformer Paper • 2512.09247 • Published 6 days ago • 43
Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning Paper • 2512.07461 • Published 7 days ago • 71
Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs Paper • 2512.07525 • Published 7 days ago • 55
Flowing Backwards: Improving Normalizing Flows via Reverse Representation Alignment Paper • 2511.22345 • Published 18 days ago • 12
Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach Paper • 2512.02834 • Published 13 days ago • 39
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper • 2512.02556 • Published 13 days ago • 211
SR-GRPO: Stable Rank as an Intrinsic Geometric Reward for Large Language Model Alignment Paper • 2512.02807 • Published 13 days ago • 7
PixelDiT: Pixel Diffusion Transformers for Image Generation Paper • 2511.20645 • Published 20 days ago • 29
Guided Self-Evolving LLMs with Minimal Human Supervision Paper • 2512.02472 • Published 14 days ago • 49
RefineBench: Evaluating Refinement Capability of Language Models via Checklists Paper • 2511.22173 • Published 19 days ago • 12
Architecture Decoupling Is Not All You Need For Unified Multimodal Model Paper • 2511.22663 • Published 18 days ago • 28
Monet: Reasoning in Latent Visual Space Beyond Images and Language Paper • 2511.21395 • Published 19 days ago • 15
GigaEvo: An Open Source Optimization Framework Powered By LLMs And Evolution Algorithms Paper • 2511.17592 • Published 28 days ago • 118
Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation Paper • 2511.20714 • Published 21 days ago • 45