5 14

Yulianghua

lianghua

AI & ML interests

None yet

Recent Activity

upvoted an article 26 days ago

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

liked a Space 5 months ago

HuggingFaceTB/smol-training-playbook

liked a model 7 months ago

meituan-longcat/LongCat-Flash-Chat

View all activity

Organizations

upvoted an article 26 days ago

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

Jan 27

•

liked a Space 5 months ago

The Smol Training Playbook

📚

3.07k

The secrets to building world-class LLMs

liked a model 7 months ago

meituan-longcat/LongCat-Flash-Chat

Text Generation • 562B • Updated Sep 24, 2025 • 42.7k • 527

upvoted an article 8 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8, 2025

•

767

liked 3 Spaces 9 months ago

LLM训练终极指南 | The Ultra-Scale Playbook

🔥

263

了解LLM训练的方方面面

FineWeb: decanting the web for the finest text data at scale

🍷

1.32k

Read a detailed overview of the FineWeb web‑scale text dataset

The Ultra-Scale Playbook

🌌

3.76k

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 11 months ago

Article

DualPipe Explained: A Comprehensive Guide to DualPipe That Anyone Can Understand—Even Without a Distributed Training Background

Feb 28, 2025

•

upvoted an article about 1 year ago

Article

Open R1: Update #3

Mar 11, 2025

•

297

liked a model over 1 year ago

TencentBAC/Conan-embedding-v1

0.3B • Updated Nov 27, 2024 • 309k • 166

upvoted a collection almost 2 years ago

Zephyr ORPO

Collection

Models and datasets to align LLMs with Odds Ratio Preference Optimisation (ORPO). Recipes here: https://github.com/huggingface/alignment-handbook • 3 items • Updated Apr 12, 2024 • 18

liked a model almost 2 years ago

meta-llama/Meta-Llama-3-8B

Text Generation • 8B • Updated Sep 27, 2024 • 3.35M • • 6.5k

liked a dataset almost 2 years ago

Open-Orca/OpenOrca

Viewer • Updated Feb 19, 2025 • 2.94M • 17.6k • 1.51k

liked a Space about 2 years ago

LLaMA Board

🦙

216

Fine-tuning large language model with Gradio UI

liked 3 models about 2 years ago

BAAI/bge-m3

openai/whisper-large-v3

Automatic Speech Recognition • Updated Aug 12, 2024 • 4.78M • • 5.53k

xverse/XVERSE-13B-256K

Text Generation • Updated Jun 28, 2024 • 48 • 31

liked 2 models over 2 years ago

microsoft/phi-2

Text Generation • 3B • Updated Dec 8, 2025 • 1.72M • 3.44k

mistralai/Mixtral-8x7B-v0.1

47B • Updated Jul 24, 2025 • 134k • 1.8k

Yulianghua

AI & ML interests

Recent Activity

Organizations

lianghua's activity

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

The Smol Training Playbook

SmolLM3: smol, multilingual, long-context reasoner

LLM训练终极指南 | The Ultra-Scale Playbook

FineWeb: decanting the web for the finest text data at scale

The Ultra-Scale Playbook

DualPipe Explained: A Comprehensive Guide to DualPipe That Anyone Can Understand—Even Without a Distributed Training Background

Open R1: Update #3

LLaMA Board