Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:

Duplicated from  bigcode/bigcode-playground

trl-lib
/
trl-text-environment
Sleeping

App Files Files Community
1
Fetching metadata from the HF Docker repository...
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Built a stability adapter on top of TRL's SFTTrainer (CRMA + ZClip) — sharing ablation results

#1 opened 5 days ago by
Fourwheels2512
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs