view article Article M2.1: Multilingual and Multi-Task Coding with Strong Generalization 10 days ago • 32
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 28 days ago • 113
view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand Dec 4, 2025 • 63
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 271
Running on Zero Featured 584 Wan 2 2 First Last Frame 💻 584 Generate videos from start and end images with prompts
view article Article Building for an Open Future - our new partnership with Google Cloud Nov 13, 2025 • 46
view article Article High-Quality Datasets for Far-Field ASR (Treble Technologies x Hugging Face) Oct 13, 2025 • 16