Haolin Liu's picture

22

Haolin Liu

lhl616

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories

upvoted a paper about 2 months ago

G-Zero: Self-Play for Open-Ended Generation from Zero Data

upvoted a paper about 2 months ago

LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories

Paper • 2605.21468 • Published May 20 • 51

upvoted 2 papers about 2 months ago

G-Zero: Self-Play for Open-Ended Generation from Zero Data

Paper • 2605.09959 • Published May 11 • 17

LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling

Paper • 2605.08083 • Published May 8 • 70

upvoted a paper 2 months ago

Turing Test on Screen: A Benchmark for Mobile GUI Agent Humanization

Paper • 2604.09574 • Published Feb 24 • 30

upvoted 2 papers 5 months ago

Training Data Efficiency in Multimodal Process Reward Models

Paper • 2602.04145 • Published Feb 4 • 80

Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing

Paper • 2602.03845 • Published Feb 3 • 27

upvoted 3 papers 6 months ago

RelayLLM: Efficient Reasoning via Collaborative Decoding

Paper • 2601.05167 • Published Jan 8 • 31

Benchmark^2: Systematic Evaluation of LLM Benchmarks

Paper • 2601.03986 • Published Jan 7 • 34

Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning

Paper • 2512.15687 • Published Dec 17, 2025 • 22

upvoted a paper 7 months ago

MotionEdit: Benchmarking and Learning Motion-Centric Image Editing

Paper • 2512.10284 • Published Dec 11, 2025 • 26

updated a model 7 months ago

lhl616/Qwen3-8B-axon-error-aware-128-8-ratio

8B • Updated Nov 29, 2025 • 1

published a model 7 months ago

lhl616/Qwen3-8B-axon-error-aware-128-8-ratio

8B • Updated Nov 29, 2025 • 1

updated a model 7 months ago

lhl616/Qwen3-8B-axon-error-aware-128-8-mixed

8B • Updated Nov 29, 2025 • 1

published a model 7 months ago

lhl616/Qwen3-8B-axon-error-aware-128-8-mixed

8B • Updated Nov 29, 2025 • 1

updated a model 7 months ago

lhl616/Qwen3-8B-Base-axon-ppo

8B • Updated Nov 29, 2025 • 1

published a model 7 months ago

lhl616/Qwen3-8B-Base-axon-ppo

8B • Updated Nov 29, 2025 • 1

updated a model 7 months ago

lhl616/Qwen3-8B-Base-axon-grpo-step-128-8

8B • Updated Nov 29, 2025 • 6

published a model 7 months ago

lhl616/Qwen3-8B-Base-axon-grpo-step-128-8

8B • Updated Nov 29, 2025 • 6

updated a model 7 months ago

lhl616/Qwen3-8B-Base-axon-error-aware-128-8-ratio-new

8B • Updated Nov 29, 2025 • 1

published a model 7 months ago

lhl616/Qwen3-8B-Base-axon-error-aware-128-8-ratio-new

8B • Updated Nov 29, 2025 • 1