pittawat/qwen2.5-7b-instruct-new-math-1k-grpo-with-length-0.1-cot-prompt-v6-planning 8B • Updated 1 day ago • 9
pittawat/qwen2.5-7b-instruct-new-math-1k-grpo-with-length-0.1-cot-prompt-v6-reasoning-strategies 8B • Updated 1 day ago • 18
pittawat/qwen2.5-7b-instruct-new-math-1k-grpo-with-length-0.1-cot-prompt-v6-self-correct 8B • Updated 1 day ago • 8
pittawat/qwen2.5-7b-instruct-new-math-1k-grpo-with-length-0.1-cot-prompt-v6-from-rl 8B • Updated 23 days ago • 13
pittawat/qwen2.5-14b-instruct-still-3-1k-grpo-with-length-0.1-cot-prompt-v6 15B • Updated Dec 3, 2025 • 3
pittawat/qwen2.5-7b-instruct-math-1k-grpo-cot-prompt-new-intermediate-ckpt-124 8B • Updated Dec 3, 2025 • 3
pittawat/qwen2.5-7b-instruct-math-1k-grpo-cot-prompt-new-intermediate-ckpt-93 8B • Updated Dec 3, 2025 • 6
pittawat/qwen2.5-7b-instruct-math-1k-grpo-cot-prompt-new-intermediate-ckpt-62 8B • Updated Dec 3, 2025 • 3
pittawat/qwen2.5-7b-instruct-math-1k-grpo-cot-prompt-new-intermediate-ckpt-31 8B • Updated Dec 3, 2025 • 4
pittawat/qwen2.5-7b-instruct-new-math-1k-grpo-with-length-0.1-cot-prompt-v6-intermediate-ckpt-124 8B • Updated Dec 2, 2025 • 4
pittawat/qwen2.5-7b-instruct-new-math-1k-grpo-with-length-0.1-cot-prompt-v6-intermediate-ckpt-93 8B • Updated Dec 2, 2025 • 3
pittawat/qwen2.5-7b-instruct-new-math-1k-grpo-with-length-0.1-cot-prompt-v6-intermediate-ckpt-62 8B • Updated Dec 2, 2025 • 4
pittawat/qwen2.5-7b-instruct-new-math-1k-grpo-with-length-0.1-cot-prompt-v6-intermediate-ckpt-31 8B • Updated Dec 2, 2025 • 3
pittawat/qwen2.5-14b-instruct-new-math-1k-grpo-with-length-0.1-cot-prompt-v6-v2 15B • Updated Dec 1, 2025 • 1
pittawat/qwen2.5-7b-instruct-new-math-1k-grpo-with-length-0.1-cot-prompt-v6-2x-rollout-n 8B • Updated Nov 30, 2025 • 3
pittawat/qwen2.5-7b-instruct-math-dolci-grpo-with-length-0.1-cot-prompt-v6 8B • Updated Nov 30, 2025 • 3
pittawat/qwen2.5-7b-instruct-new-math-1k-grpo-with-length-0.1-cot-prompt-v1 8B • Updated Nov 27, 2025 • 2
pittawat/qwen2.5-7b-instruct-math-new-10k-grpo-with-length-0.1-cot-prompt-v6 8B • Updated Nov 26, 2025 • 3