Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
1
Zhi Zheng
zz1358m
Follow
0 followers
·
1 following
https://zz1358m.github.io/zhizheng.github.io/
AI & ML interests
LLM reasoning, Trustworthy LLM, LLM application, Neural combinatorial optimization.
Recent Activity
liked
a model
about 1 month ago
zz1358m/SofT-GRPO-master
updated
a model
about 2 months ago
zz1358m/SofT-GRPO-master
authored
a paper
about 2 months ago
SofT-GRPO: Surpassing Discrete-Token LLM Reinforcement Learning via Gumbel-Reparameterized Soft-Thinking Policy Optimization
View all activity
Organizations
zz1358m
's models
2
Sort: Recently updated
zz1358m/SofT-GRPO-master
Updated
Nov 13
•
7
zz1358m/Reasoning-CV
Updated
Sep 10