arxiv:2304.10498
Xiaohang Tang
timxiaohangt
AI & ML interests
Reinforcement Learning, Game Theory
Recent Activity
upvoted a paper about 19 hours ago
LLM-WikiRace Benchmark: How Far Can LLMs Plan over Real-World Knowledge Graphs? published
a model 14 days ago
timxiaohangt/Qwen2.5-1.5B-Open-R1-GRPO updated
a model about 1 month ago
timxiaohangt/DeepSeek-R1-Distill-Qwen-1.5B-GRPO