view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand 10 days ago • 60
Running on CPU Upgrade Featured 2.59k The Smol Training Playbook 📚 2.59k The secrets to building world-class LLMs
Apertus LLM Collection Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages • 4 items • Updated Oct 1 • 310
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7 • 390