view article Article Welcome GPT OSS, the new open-source model family from OpenAI! +10 reach-vb, pcuenq, lewtun, clem, Rocketknight1, clefourrier, celinah, Wauplin, marcsun13, pagezyhf, ahadnagy, joaogante • Aug 5, 2025 • 513
view article Article Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques jmamou • Mar 24, 2025 • 20
view article Article Universal Assisted Generation: Faster Decoding with Any Assistant Model +6 danielkorat, orenpereg, mber, jmamou, joaogante, lewtun, Nadav-Timor, moshew • Oct 29, 2024 • 61
view article Article Introducing SynthID Text +4 sumedhghaisas, sdathath, RyanMullins, joaogante, marcsun13, RaushanTurganbay • Oct 23, 2024 • 59
view article Article Faster Assisted Generation with Dynamic Speculation +5 jmamou, orenpereg, joaogante, lewtun, danielkorat, Nadav-Timor, moshew • Oct 8, 2024 • 51
view article Article Google releases Gemma 2 2B, ShieldGemma and Gemma Scope +2 Xenova, pcuenq, reach-vb, joaogante • Jul 31, 2024 • 60
view article Article Code Llama: Llama 2 learns to code +6 philschmid, osanseviero, pcuenq, lewtun, lvwerra, loubnabnl, ArthurZ, joaogante • Aug 25, 2023 • 10
view article Article Assisted Generation: a new direction toward low-latency text generation joaogante • May 11, 2023 • 78