Zhi Zheng's picture

1 1 1

Zhi Zheng

zz1358m

·

https://zz1358m.github.io/zhizheng.github.io/

AI & ML interests

LLM reasoning, Trustworthy LLM, LLM application, Neural combinatorial optimization.

Recent Activity

liked a model about 1 month ago

zz1358m/SofT-GRPO-master

updated a model about 2 months ago

zz1358m/SofT-GRPO-master

authored a paper about 2 months ago

SofT-GRPO: Surpassing Discrete-Token LLM Reinforcement Learning via Gumbel-Reparameterized Soft-Thinking Policy Optimization

View all activity

Organizations

zz1358m 's models 2

zz1358m/SofT-GRPO-master

Updated Nov 13 • 7

zz1358m/Reasoning-CV