Pre-training Distillation for Large Language Models: A Design Space Exploration Paper โข 2410.16215 โข Published Oct 21, 2024 โข 17
A Survey of Reinforcement Learning for Large Reasoning Models Paper โข 2509.08827 โข Published Sep 10 โข 190
DeepPrune Collection Parallel Scaling without Inter-trace Redundancy โข 3 items โข Updated Oct 10 โข 1
DeepPrune: Parallel Scaling without Inter-trace Redundancy Paper โข 2510.08483 โข Published Oct 9 โข 24