Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
2
4
liang
PRO
CharlesLi
Follow
AI & ML interests
Trustworthy Machine Learning
Recent Activity
new
activity
25 days ago
deepcs233/Visual-CoT:
Not compatible with HF Datasets
upvoted
a
paper
3 months ago
LANPO: Bootstrapping Language and Numerical Feedback for Reinforcement Learning in LLMs
View all activity
Organizations
None yet
CharlesLi
's models
515
Sort: Recently updated
CharlesLi/grpo_5_epoch_graph_task_ins_bsz8_8192_entropy_0005_200
3B
•
Updated
May 3, 2025
•
5
CharlesLi/dapo_5_epoch_graph_task_3B_9216_200
3B
•
Updated
May 3, 2025
•
3
CharlesLi/grpo_5_epoch_graph_task_ins_bsz16_4096_entropy_0001_350
3B
•
Updated
May 3, 2025
•
6
CharlesLi/dapo_5_epoch_graph_task_3B_800
3B
•
Updated
May 2, 2025
•
8
CharlesLi/dapo_5_epoch_graph_task_3B_1150
3B
•
Updated
May 2, 2025
•
6
CharlesLi/grpo_5_epoch_graph_task_ins_bsz16_4096_entropy_0001_400
3B
•
Updated
May 2, 2025
•
6
CharlesLi/grpo_5_epoch_graph_task_ins_large_entropy_1000
3B
•
Updated
May 2, 2025
•
5
CharlesLi/grpo_5_epoch_graph_task_ins_large_entropy_1150
3B
•
Updated
May 2, 2025
•
6
CharlesLi/grpo_5_epoch_graph_task_ins_large_entropy_700
3B
•
Updated
May 1, 2025
•
5
CharlesLi/grpo_5_epoch_graph_task_ins_large_entropy_600
3B
•
Updated
May 1, 2025
•
5
CharlesLi/grpo_5_epoch_graph_task_ins_large_entropy_500
3B
•
Updated
May 1, 2025
•
5
CharlesLi/grpo_5_epoch_graph_task_ins_large_entropy_400
3B
•
Updated
May 1, 2025
•
6
CharlesLi/grpo_5_epoch_graph_task_ins_large_entropy_300
3B
•
Updated
May 1, 2025
•
4
CharlesLi/grpo_5_epoch_graph_task_ins_large_400
3B
•
Updated
Apr 29, 2025
•
5
CharlesLi/grpo_5_epoch_graph_task_ins_400
3B
•
Updated
Apr 29, 2025
•
4
CharlesLi/grpo_5_epoch_graph_task_ins_large_300
3B
•
Updated
Apr 29, 2025
•
5
CharlesLi/grpo_5_epoch_graph_task_ins_300
3B
•
Updated
Apr 29, 2025
•
4
CharlesLi/grpo_5_epoch_graph_hard_ins_gpu_800
3B
•
Updated
Apr 28, 2025
•
5
CharlesLi/grpo_5_epoch_graph_hard_base_gpu_800
3B
•
Updated
Apr 28, 2025
•
6
CharlesLi/grpo_5_epoch_graph_hard_ins_start_400
3B
•
Updated
Apr 28, 2025
•
4
CharlesLi/grpo_5_epoch_graph_hard_base_start_400
3B
•
Updated
Apr 28, 2025
•
5
CharlesLi/grpo_5_epoch_graph_hard_ins_gpu_700
3B
•
Updated
Apr 28, 2025
•
6
CharlesLi/grpo_5_epoch_graph_hard_base_gpu_700
3B
•
Updated
Apr 28, 2025
•
4
CharlesLi/grpo_5_epoch_graph_hard_ins_gpu_400
3B
•
Updated
Apr 27, 2025
•
6
CharlesLi/grpo_5_epoch_graph_hard_base_gpu_400
3B
•
Updated
Apr 27, 2025
•
4
CharlesLi/grpo_5_epoch_graph_hard_ins_start_200
3B
•
Updated
Apr 27, 2025
•
5
CharlesLi/grpo_5_epoch_graph_hard_ins_gpu_200
3B
•
Updated
Apr 27, 2025
•
5
CharlesLi/grpo_5_epoch_graph_hard_base_start_200
3B
•
Updated
Apr 27, 2025
•
6
CharlesLi/grpo_5_epoch_graph_hard_base_gpu_200
3B
•
Updated
Apr 27, 2025
•
5
CharlesLi/graph_grpo_40
3B
•
Updated
Apr 24, 2025
•
4
Previous
1
2
3
4
...
18
Next