jerad fields's picture

jerad fields

jeradf

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 18 hours ago

Self-Distillation Enables Continual Learning

upvoted a paper about 18 hours ago

Reinforcement Learning via Self-Distillation

upvoted a paper about 18 hours ago

HDPO: Hybrid Distillation Policy Optimization via Privileged Self-Distillation

View all activity

Organizations

upvoted 4 papers about 18 hours ago

Self-Distillation Enables Continual Learning

Paper • 2601.19897 • Published Jan 27 • 37

Reinforcement Learning via Self-Distillation

Paper • 2601.20802 • Published Jan 28 • 50

HDPO: Hybrid Distillation Policy Optimization via Privileged Self-Distillation

Paper • 2603.23871 • Published Mar 25 • 1

Unmasking On-Policy Distillation: Where It Helps, Where It Hurts, and Why

Paper • 2605.10889 • Published May 11 • 6

upvoted a paper about 19 hours ago

Embarrassingly Simple Self-Distillation Improves Code Generation

Paper • 2604.01193 • Published Apr 1 • 56

upvoted a paper 9 days ago

Controllable User Simulation

Paper • 2605.11519 • Published May 12 • 1

upvoted a paper 23 days ago

Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence

Paper • 2604.24954 • Published Apr 27 • 25

upvoted 2 papers 29 days ago

Do Enterprise Systems Need Learned World Models? The Importance of Context to Infer Dynamics

Paper • 2605.12178 • Published May 12 • 61

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

Paper • 2605.13841 • Published about 1 month ago • 75

upvoted an article 29 days ago

Article

A New Framework for Evaluating Voice Agents (EVA)

ServiceNow-AI

•

Mar 24

• 95

upvoted 10 papers 2 months ago

Eureka-Audio: Triggering Audio Intelligence in Compact Language Models

Paper • 2602.13954 • Published Feb 15 • 4

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published Mar 29 • 147

EpochX: Building the Infrastructure for an Emergent Agent Civilization

Paper • 2603.27304 • Published Mar 28 • 47

On-Policy Self-Distillation for Reasoning Compression

Paper • 2603.05433 • Published Mar 5 • 9

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published Feb 9 • 290

Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs

Paper • 2603.09906 • Published Mar 10 • 76

Hyperagents

Paper • 2603.19461 • Published Mar 19 • 51

Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 198

Beyond Language Modeling: An Exploration of Multimodal Pretraining

Paper • 2603.03276 • Published Mar 3 • 106

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Paper • 2603.25040 • Published Mar 26 • 133