Shekswess
/

tiny-think-dpo-math-stem-apo_zero-beta1-lr3e-6-e1-bs8

Text Generation

Generated from Trainer

Model card Files Files and versions

tiny-think-dpo-math-stem-apo_zero-beta1-lr3e-6-e1-bs8

298 MB

Ctrl+K

Ctrl+K

1 contributor

History: 3 commits

Shekswess's picture

Update README.md

91aab7e verified 3 months ago