wenlong deng's picture

2 8

wenlong deng

dwenlong

·

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

mradermacher/LLDS-A-GRPO-Qwen2.5-7B-Base-i1-GGUF

liked a model 3 days ago

SEGAgentRL/LLDS-A-GSPO-Qwen2.5-3B-Ins

upvoted a paper 4 days ago

On GRPO Collapse in Search-R1: The Lazy Likelihood-Displacement Death Spiral

View all activity

Organizations

liked 2 models 3 days ago

mradermacher/LLDS-A-GRPO-Qwen2.5-7B-Base-i1-GGUF

8B • Updated 21 days ago • 7.12k • 2

SEGAgentRL/LLDS-A-GSPO-Qwen2.5-3B-Ins

Reinforcement Learning • 3B • Updated 21 days ago • 33 • 1

liked 2 models 21 days ago

SEGAgentRL/LLDS-A-GRPO-Qwen2.5-7B-Ins

Reinforcement Learning • 8B • Updated 21 days ago • 92 • 2

SEGAgentRL/LLDS-A-GRPO-Qwen2.5-7B-Base

Reinforcement Learning • 8B • Updated 21 days ago • 70 • 2

liked a model 10 months ago

UCSC-VLAA/MedReason-8B

Question Answering • 8B • Updated Jul 30, 2025 • 853 • 14

liked a dataset 10 months ago

UCSC-VLAA/MedReason

Viewer • Updated May 27, 2025 • 32.7k • 538 • 81

liked 2 models 11 months ago

deepseek-ai/DeepSeek-V3-0324

Text Generation • 685B • Updated Mar 27, 2025 • 265k • • 3.09k

junnyu/DeepScaleR-1.5B-Preview-Reproduce

Text Generation • 2B • Updated Feb 26, 2025 • 9 • 4