Open-AgentRL Collection RLAnything & DemyAgent: Open-Source RL for LLMs and Agentic Scenarios β’ 12 items β’ Updated Feb 3 β’ 7
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method β’ 30 items β’ Updated Feb 25 β’ 139
view article Article PP-OCRv5 on Hugging Face: A Specialized Approach to OCR baidu β’ Sep 10, 2025 β’ 111
mmBERT: a modern multilingual encoder Collection mmBERT is trained on 3T tokens from over 1800 languages, showing SoTA scores on benchmarks and exceptional low-resource performance β’ 16 items β’ Updated Sep 9, 2025 β’ 53
view article Article Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training +3 smohammadi, siro1, winglian, marcsun13, djsaunde β’ Aug 8, 2025 β’ 98
Llama 4 Collection Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth! β’ 15 items β’ Updated 25 days ago β’ 56
On Teacher Hacking in Language Model Distillation Paper β’ 2502.02671 β’ Published Feb 4, 2025 β’ 18
π§ Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community β’ 24 items β’ Updated May 19, 2025 β’ 190
view article Article Improving Hugging Face Training Efficiency Through Packing with Flash Attention 2 +4 RQlee, ArthurZ, achikundu, lwtr, rganti, mayank-mishra β’ Aug 21, 2024 β’ 41
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. β’ 43 items β’ Updated Mar 2 β’ 720
The Big Benchmarks Collection Collection Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) β’ 13 items β’ Updated Nov 18, 2024 β’ 265
Standard-format-preference-dataset Collection We collect the open-source datasets and process them into the standard format. β’ 12 items β’ Updated Mar 2 β’ 26
FP8 LLMs for vLLM Collection Accurate FP8 quantized models by Neural Magic, ready for use with vLLM! β’ 42 items β’ Updated Mar 2 β’ 80
Korean Datasets I've released so far. Collection μ§κΈκΉμ§ μ λ‘λν νκ΅μ΄ λ°μ΄ν°μ μ½λ μ μ λλ€. β’ 8 items β’ Updated May 24, 2024 β’ 21