Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

golaxy
/
gogpt-560m

Text Generation
Transformers
PyTorch
Chinese
bloom
text-generation-inference
Model card Files Files and versions
xet
Community
3
  • GoGPT
  • 测试效果
  • TODO
  • 感谢

GoGPT

基于中文指令数据微调BLOOM img.png

训练第一轮足够了,后续第二轮和第三轮提升不大

  • 🚀多样性指令数据
  • 🚀筛选高质量中文数据
模型名字 参数量 模型地址
gogpt-560m 5.6亿参数 🤗golaxy/gogpt-560m
gogpt-3b 30亿参数 🤗golaxy/gogpt-3b

测试效果

img.png img.png img.png img.png img.png img.png

TODO

  • 进行RLFH训练
  • 后续加入中英平行语料

感谢

  • @hz大佬-zero_nlp
  • stanford_alpaca
  • Belle数据
Downloads last month
881
Inference Providers NEW
Text Generation
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Datasets used to train golaxy/gogpt-560m

BelleGroup/train_0.5M_CN

Viewer • Updated Apr 3, 2023 • 519k • 1.12k • 118

BelleGroup/train_1M_CN

Viewer • Updated Apr 3, 2023 • 917k • 857 • 157

BelleGroup/train_3.5M_CN

Viewer • Updated Aug 16, 2023 • 3.61M • 621 • 150

Spaces using golaxy/gogpt-560m 30

🏆
Intel/low_bit_open_llm_leaderboard
🏆
BAAI/open_cn_llm_leaderboard
😻
GTBench/GTBench
🥇
BAAI/open_flageval_vlm_leaderboard
🎨
OPTML-Group/UnlearnCanvas-Benchmark
🏆
gsaivinay/open_llm_leaderboard
🏆
Vikhrmodels/small-shlepa-lb
🏆
kz-transformers/kaz-llm-lb
🐢
BAAI/FlagEval-Robo
🏆
felixz/open_llm_leaderboard
🏆
Vikhrmodels/Russian_Arena_General
🏆
Vikhrmodels/small-shlepa
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs