Jingcheng Liang
leoleung04
AI & ML interests
None yet
Recent Activity
upvoted a paper about 15 hours ago
Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients upvoted a paper about 2 months ago
Abstain-R1: Calibrated Abstention and Post-Refusal Clarification via Verifiable RL upvoted a paper 2 months ago
SPPO: Sequence-Level PPO for Long-Horizon Reasoning TasksOrganizations
None yet