Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
8
wenlong deng
dwenlong
Follow
0 followers
·
3 following
AI & ML interests
None yet
Recent Activity
liked
a model
3 days ago
mradermacher/LLDS-A-GRPO-Qwen2.5-7B-Base-i1-GGUF
liked
a model
3 days ago
SEGAgentRL/LLDS-A-GSPO-Qwen2.5-3B-Ins
upvoted
a
paper
4 days ago
On GRPO Collapse in Search-R1: The Lazy Likelihood-Displacement Death Spiral
View all activity
Organizations
dwenlong
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
2 models
3 days ago
mradermacher/LLDS-A-GRPO-Qwen2.5-7B-Base-i1-GGUF
8B
•
Updated
21 days ago
•
7.12k
•
2
SEGAgentRL/LLDS-A-GSPO-Qwen2.5-3B-Ins
Reinforcement Learning
•
3B
•
Updated
21 days ago
•
33
•
1
liked
2 models
21 days ago
SEGAgentRL/LLDS-A-GRPO-Qwen2.5-7B-Ins
Reinforcement Learning
•
8B
•
Updated
21 days ago
•
92
•
2
SEGAgentRL/LLDS-A-GRPO-Qwen2.5-7B-Base
Reinforcement Learning
•
8B
•
Updated
21 days ago
•
70
•
2
liked
a model
10 months ago
UCSC-VLAA/MedReason-8B
Question Answering
•
8B
•
Updated
Jul 30, 2025
•
853
•
14
liked
a dataset
10 months ago
UCSC-VLAA/MedReason
Viewer
•
Updated
May 27, 2025
•
32.7k
•
538
•
81
liked
2 models
11 months ago
deepseek-ai/DeepSeek-V3-0324
Text Generation
•
685B
•
Updated
Mar 27, 2025
•
265k
•
•
3.09k
junnyu/DeepScaleR-1.5B-Preview-Reproduce
Text Generation
•
2B
•
Updated
Feb 26, 2025
•
9
•
4