Olmo 3.1 - a allenai Collection

allenai 's Collections

Olmo 3

Olmo 3 Pre-training

Olmo 3 Post-training

MolmoAct Data Mixture

IFBench

OLMo 2

olmOCR

OLMoE (January 2025)

PixMo

Tulu 3 Datasets

Molmo

OLMoE (November 2024)

Tulu V2.5 Suite

Paloma

SciRIFF

AI2 Safety Toolkit

Zebra Logic Bench

OLMo 2 Preview Post-trained Models

ACE

Olmo 3.1

updated 1 day ago

The latest members of the Olmo 3 family: another 3 weeks of RL for 32B Think, the 32B Instruct model, large post-training research datasets...

allenai/Olmo-3.1-32B-Think

Text Generation • 32B • Updated 1 day ago • 580 • 19

Note 📈 Scaling RL to make our latest model.
allenai/Olmo-3.1-32B-Instruct-SFT

32B • Updated 1 day ago • 445 • 3
allenai/Olmo-3.1-32B-Instruct-DPO

Text Generation • 32B • Updated 1 day ago • 723 • 2
allenai/Olmo-3.1-32B-Instruct

Text Generation • 32B • Updated about 24 hours ago • 155 • 8

Note 💨 Our best model yet for chat & sensitive tasks.
allenai/Olmo-3.1-7B-RL-Zero-Math

Text Generation • 528k • Updated 1 day ago • 7 • 3

Note 🧮 Improved RL Zero performance & more training steps!
allenai/Olmo-3.1-7B-RL-Zero-Code

Text Generation • 528k • Updated 1 day ago • 7 • 4

Note 💻 Improved RL Zero performance & more training steps!
allenai/Dolci-Think-RL-7B-Completions-SFT

Viewer • Updated 1 day ago • 636k • 3 • 3

Note Large datasets of completions used to filter prompts for our RL runs.
allenai/Dolci-Think-RL-7B-Completions-DPO

Viewer • Updated 1 day ago • 556k • 5 • 1
allenai/Dolci-DPO-Model-Response-Pool

Viewer • Updated 2 days ago • 71.2M • 44

Note A very large set of completions across many models for preference tuning and reward modeling research.