akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-E2EGRPO-OpenR1_Math_SpecR_GRPO_Mini-MiniSet 2B • Updated Nov 20 • 7
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-E2EGRPO-OpenR1_Math_SpecR_GRPO_Mini-MiniSet_14BDrafter 2B • Updated Jun 16 • 9
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-E2EGRPO-OpenR1_Math_SpecR_GRPO_Mini-MiniSet_32BDrafter Updated Jun 13
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-SpeculativeReasoner_Mini Text Generation • 2B • Updated Jun 11 • 10
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-SplitReasoner Text Generation • 2B • Updated Apr 22 • 13
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-SpeculativeReasoner Text Generation • 2B • Updated Apr 19 • 21 • 1
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-SpeculativeReasoner Text Generation • 2B • Updated Apr 17 • 385
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-SpecReasoner_SFT_GRPO_14k_v3 Text Generation • 2B • Updated Apr 15 • 7
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-SpecReasoner_SFT_14k Text Generation • 2B • Updated Apr 14 • 7