Yulianghua's picture

Yulianghua

lianghua

·

AI & ML interests

None yet

Recent Activity

upvoted an article about 1 month ago

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

liked a Space 6 months ago

HuggingFaceTB/smol-training-playbook

liked a model 8 months ago

meituan-longcat/LongCat-Flash-Chat

View all activity

Organizations

upvoted an article about 1 month ago

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

Jan 27

•

71

upvoted an article 9 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8, 2025

•

769

upvoted an article 11 months ago

Article

DualPipe Explained: A Comprehensive Guide to DualPipe That Anyone Can Understand—Even Without a Distributed Training Background

Feb 28, 2025

•

17

upvoted an article about 1 year ago

Article

Open R1: Update #3

Mar 11, 2025

•

297

upvoted a collection almost 2 years ago

Zephyr ORPO

Models and datasets to align LLMs with Odds Ratio Preference Optimisation (ORPO). Recipes here: https://github.com/huggingface/alignment-handbook • 3 items • Updated Apr 12, 2024 • 18