·
AI & ML interests
RL, Planning
Organizations
models 27
Text Generation
• 8B • Updated • 2
movefast/Qwen2.5-1.5B-Open-R1-GRPO
2B • Updated • 36
movefast/qwen3_8b_orm_step_20
8B • Updated • 1
movefast/qwen3_8b_orm_step_35
8B • Updated • 1
movefast/OpenR1-Distill-7B
Text Generation
• 8B • Updated • 3
movefast/Qwen2.5-7B-mult-task-sft-v2-2.5e-6
8B • Updated • 1
movefast/Qwen2.5-7B-mult-task-sft-v2-5e-6
8B • Updated • 1
movefast/Qwen2.5-7B-mult-task-sft-v2-1e-5
Text Generation
• 8B • Updated • 2
movefast/Qwen2.5-7B-mult-task-sft-v1-1e-5
Text Generation
• 2B • Updated • 9
movefast/Qwen2.5-7B-mult-task-sft-v1
Text Generation
• 2B • Updated • 2