4 21 6

Bingxiang He

hbx

https://hbx-hbx.github.io/

AI & ML interests

NLP

Recent Activity

updated a model about 22 hours ago

hbx/JustRL-Nemotron-1.5B

updated a model about 22 hours ago

hbx/JustRL-DeepSeek-1.5B

new activity 6 days ago

hbx/JustRL-Nemotron-1.5B:Add Hugging Face paper link badge to model card

View all activity

Organizations

updated 2 models about 22 hours ago

hbx/JustRL-Nemotron-1.5B

Text Generation • 2B • Updated about 22 hours ago • 334 • 2

hbx/JustRL-DeepSeek-1.5B

Text Generation • 2B • Updated about 22 hours ago • 416 • 8

New activity in hbx/JustRL-Nemotron-1.5B 6 days ago

Add Hugging Face paper link badge to model card

#1 opened 6 days ago by

nielsr

New activity in hbx/JustRL-DeepSeek-1.5B 6 days ago

Improve model card: Update title, add paper link, correct license and citation

#1 opened 6 days ago by

nielsr

commented a paper 7 days ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Paper • 2512.16649 • Published 8 days ago • 22 •

upvoted a paper 7 days ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Paper • 2512.16649 • Published 8 days ago • 22

submitted a paper to Daily Papers 7 days ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Paper • 2512.16649 • Published 8 days ago • 22

upvoted a paper about 1 month ago

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17 • 133

liked 2 models about 1 month ago

hbx/JustRL-DeepSeek-1.5B

Text Generation • 2B • Updated about 22 hours ago • 416 • 8

hbx/JustRL-Nemotron-1.5B

Text Generation • 2B • Updated about 22 hours ago • 334 • 2

upvoted a paper about 2 months ago

CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents

Paper • 2511.02734 • Published Nov 4 • 20

updated 2 models about 2 months ago

hbx/JustRL-DeepSeek-1.5B

Text Generation • 2B • Updated about 22 hours ago • 416 • 8

hbx/JustRL-Nemotron-1.5B

Text Generation • 2B • Updated about 22 hours ago • 334 • 2

updated a collection about 2 months ago

JustRL

Collection

2 items • Updated Nov 1 • 1

published a model about 2 months ago

hbx/JustRL-Nemotron-1.5B

Text Generation • 2B • Updated about 22 hours ago • 334 • 2

updated a collection about 2 months ago

JustRL

Collection

2 items • Updated Nov 1 • 1

published a model about 2 months ago

hbx/JustRL-DeepSeek-1.5B

Text Generation • 2B • Updated about 22 hours ago • 416 • 8

authored 3 papers 3 months ago

Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents

Paper • 2402.09205 • Published Feb 14, 2024

AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset

Paper • 2504.03612 • Published Apr 4 • 2

MiniCPM4: Ultra-Efficient LLMs on End Devices

Paper • 2506.07900 • Published Jun 9 • 93

Bingxiang He

AI & ML interests

Recent Activity

Organizations

hbx's activity

Add Hugging Face paper link badge to model card

Improve model card: Update title, add paper link, correct license and citation