Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
39
198
49
KABI
dongguanting
Follow
Stars321123's profile picture
varuy322's profile picture
SnowNation's profile picture
60 followers
·
97 following
https://dongguanting.github.io/
kakakbibibi
dongguanting
AI & ML interests
Reasoning and Alignment for Large Language Models
Recent Activity
authored
a paper
1 day ago
ET-Agent: Incentivizing Effective Tool-Integrated Reasoning Agent via Behavior Calibration
upvoted
a
paper
1 day ago
ET-Agent: Incentivizing Effective Tool-Integrated Reasoning Agent via Behavior Calibration
authored
a paper
2 days ago
EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis
View all activity
Organizations
dongguanting
's models
16
Sort: Recently updated
dongguanting/Qwen3-8B-AEPO-DeepSearch
Text Generation
•
8B
•
Updated
25 days ago
•
22
•
2
dongguanting/QwQ-32B-AEPO-DeepSearch
Text Generation
•
33B
•
Updated
25 days ago
•
13
•
1
dongguanting/QwQ-32B-ARPO-DeepSearch
33B
•
Updated
25 days ago
•
9
•
1
dongguanting/aepo_light
8B
•
Updated
Nov 3, 2025
•
5
dongguanting/Qwen2.5-7B-AEPO
Text Generation
•
8B
•
Updated
Oct 27, 2025
•
14
•
1
dongguanting/Qwen3-14B-AEPO-DeepSearch
Robotics
•
15B
•
Updated
Oct 21, 2025
•
6
•
1
dongguanting/Qwen2.5-7B-ARPO
Text Generation
•
8B
•
Updated
Aug 19, 2025
•
19
•
2
dongguanting/Llama3.1-8B-ARPO
Text Generation
•
8B
•
Updated
Aug 12, 2025
•
10
•
1
dongguanting/Qwen2.5-3B-ARPO
Text Generation
•
3B
•
Updated
Aug 12, 2025
•
37
•
3
dongguanting/Qwen3-14B-ARPO-DeepSearch
Text Generation
•
15B
•
Updated
Aug 12, 2025
•
8
•
5
dongguanting/Qwen3-8B-ARPO-DeepSearch
8B
•
Updated
Jul 29, 2025
•
14
•
2
dongguanting/Tool-Star-Qwen-7B
Text Generation
•
8B
•
Updated
Jun 30, 2025
•
6
•
2
dongguanting/RAG-Critic-3B
Text Generation
•
3B
•
Updated
Jun 28, 2025
•
17
•
4
dongguanting/Tool-Star-Qwen-0.5B
Text Generation
•
0.6B
•
Updated
Jun 6, 2025
•
2
•
1
dongguanting/Tool-Star-Qwen-1.5B
Text Generation
•
2B
•
Updated
Jun 6, 2025
•
2
dongguanting/Tool-Star-Qwen-3B
Text Generation
•
3B
•
Updated
May 25, 2025
•
6
•
5