RLVR Linearity - a Miaow-Lab Collection

Miaow-Lab 's Collections

RLVR Linearity

updated 9 days ago

RL training and evaluation datasets, and checkpoints in 'Linear Dynamics in the RLVR Training of Large Language Models'

Not All Steps are Informative: On the Linearity of LLMs' RLVR Training

Paper • 2601.04537 • Published Jan 8
Miaow-Lab/RLVR-Linearity-Dataset

Viewer • Updated 9 days ago • 40.3k • 31
Miaow-Lab/RLVR-Linearity-Checkpoints

Text Generation • Updated 9 days ago