RLMs (Reasoning Language Models)
updated
LADDER: Self-Improving LLMs Through Recursive Problem Decomposition
Paper
• 2503.00735
• Published
• 23
START: Self-taught Reasoner with Tools
Paper
• 2503.04625
• Published
• 113
R1-Searcher: Incentivizing the Search Capability in LLMs via
Reinforcement Learning
Paper
• 2503.05592
• Published
• 27
R1-Omni: Explainable Omni-Multimodal Emotion Recognition with
Reinforcing Learning
Paper
• 2503.05379
• Published
• 38
21B • Updated
• 466
• 389
Viewer
• Updated
• 269 • 1.06k
• 47
Text Generation
• Updated
• 55.8k
• • 2.89k
Text Generation
• 8B • Updated
• 178
• • 183
Text Generation
• 33B • Updated
• 26
• • 156
Reinforcement Learning for Reasoning in Small LLMs: What Works and What
Doesn't
Paper
• 2503.16219
• Published
• 52
predibase/Predibase-T2T-32B-RFT
33B • Updated
• 5
• 20
agentica-org/DeepCoder-1.5B-Preview
Text Generation
• 2B • Updated
• 276
• 74
agentica-org/DeepCoder-14B-Preview
Text Generation
• Updated
• 365
• • 680
Feature Extraction
• Updated
• 1.57k
• 53
deepseek-ai/DeepSeek-R1-0528
Text Generation
• 685B • Updated
• 1.09M
• • 2.41k
nvidia/Nemotron-Research-Reasoning-Qwen-1.5B
Text Generation
• Updated
• 3.65k
• 239
Video-Text-to-Text
• 9B • Updated
• 122
• 23
mistralai/Magistral-Small-2506
24B • Updated
• 65.5k
• 608
microsoft/Phi-4-mini-reasoning
Text Generation
• Updated
• 9.56k
• 217
microsoft/Phi-4-mini-flash-reasoning
Text Generation
• Updated
• 1.67k
• 269
microsoft/Phi-4-reasoning
Text Generation
• Updated
• 8.54k
• 216
osmosis-ai/Osmosis-Apply-1.7B
Text Generation
• 2B • Updated
• 29
• 95
33B • Updated
• 38
• 192
numind/NuMarkdown-8B-Thinking
Image-to-Text
• Updated
• 54.7k
• 446
moonshotai/Kimi-K2-Thinking
Text Generation
• Updated
• 87.1k
• • 1.68k
Text Generation
• Updated
• 1.81k
• 514
MaziyarPanahi/VibeThinker-1.5B-GGUF
Text Generation
• 2B • Updated
• 301
• 35
ServiceNow-AI/Apriel-1.5-15b-Thinker
Image-Text-to-Text
• Updated
• 317
• 464
Image-Text-to-Text
• 10B • Updated
• 34.9k
• • 592