9 10 2

Baohao Liao

baohao

https://baohaoliao.github.io/

AI & ML interests

NLP

Recent Activity

upvoted a paper 2 days ago

Self-Hinting Language Models Enhance Reinforcement Learning

submitted a paper 2 days ago

Self-Hinting Language Models Enhance Reinforcement Learning

upvoted a paper 2 months ago

3-in-1: 2D Rotary Adaptation for Efficient Finetuning, Efficient Batching and Composability

View all activity

Organizations

upvoted a paper 2 days ago

Self-Hinting Language Models Enhance Reinforcement Learning

Paper • 2602.03143 • Published 4 days ago • 23

submitted a paper to Daily Papers 2 days ago

Self-Hinting Language Models Enhance Reinforcement Learning

Paper • 2602.03143 • Published 4 days ago • 23

upvoted a paper 2 months ago

3-in-1: 2D Rotary Adaptation for Efficient Finetuning, Efficient Batching and Composability

Paper • 2409.00119 • Published Aug 28, 2024 • 1

updated a dataset 3 months ago

baohao/Fineweb-Edu-1BT-len2048

Preview • Updated Nov 16, 2025 • 137

published a dataset 3 months ago

baohao/Fineweb-Edu-1BT-len2048

Preview • Updated Nov 16, 2025 • 137

updated a collection 3 months ago

Reinforce-Ada

Collection

Training & test sets and finetuned models • 19 items • Updated Oct 26, 2025 • 3

updated 2 models 3 months ago

RLHFlow/Qwen2.5-Math-1.5B-DAPO-easy

2B • Updated Oct 26, 2025 • 9

RLHFlow/Qwen2.5-Math-1.5B-GRPO-n8-easy

2B • Updated Oct 26, 2025 • 6

published 2 models 3 months ago

RLHFlow/Qwen2.5-Math-1.5B-DAPO-easy

2B • Updated Oct 26, 2025 • 9

RLHFlow/Qwen2.5-Math-1.5B-GRPO-n8-easy

2B • Updated Oct 26, 2025 • 6

updated 2 datasets 4 months ago

RLHFlow/reinforce_ada_hard_prompt_1-5b

Viewer • Updated Oct 16, 2025 • 13.3k • 38

RLHFlow/reinforce_ada_simple_prompt_1-5b

Viewer • Updated Oct 16, 2025 • 25k • 31

updated a collection 4 months ago

Reinforce-Ada

Collection

Training & test sets and finetuned models • 19 items • Updated Oct 26, 2025 • 3

updated a model 4 months ago

RLHFlow/Qwen2.5-Math-7B-Reinforce-Ada-balance-easy

8B • Updated Oct 10, 2025 • 4

published a model 4 months ago

RLHFlow/Qwen2.5-Math-7B-Reinforce-Ada-balance-easy

8B • Updated Oct 10, 2025 • 4

Baohao Liao

AI & ML interests

Recent Activity

Organizations

baohao's activity