Jihwan Kim's picture

Jihwan Kim

navvh

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Self-Distilled RLVR

upvoted a paper 5 days ago

Where does output diversity collapse in post-training?

upvoted a paper 6 days ago

PersonaVLM: Long-Term Personalized Multimodal LLMs

View all activity

Organizations

None yet

upvoted 2 papers 5 days ago

Self-Distilled RLVR

Paper • 2604.03128 • Published 23 days ago • 166

Where does output diversity collapse in post-training?

Paper • 2604.16027 • Published 9 days ago • 22

upvoted a paper 6 days ago

PersonaVLM: Long-Term Personalized Multimodal LLMs

Paper • 2604.13074 • Published Mar 20 • 46

upvoted 2 papers 8 days ago

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published 24 days ago • 489

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published 12 days ago • 85

upvoted 6 papers 12 days ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published 18 days ago • 321

AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents

Paper • 2603.27490 • Published 28 days ago • 17

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published 17 days ago • 240

From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models

Paper • 2604.09459 • Published 13 days ago • 13

Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music

Paper • 2604.10905 • Published 13 days ago • 28

EXAONE 4.5 Technical Report

Paper • 2604.08644 • Published 17 days ago • 66

upvoted a paper 27 days ago

Sommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language Models

Paper • 2603.25750 • Published Mar 20 • 36

upvoted 8 papers about 1 month ago

4DGS360: 360° Gaussian Reconstruction of Dynamic Objects from a Single Video

Paper • 2603.21618 • Published Mar 23 • 15

2Xplat: Two Experts Are Better Than One Generalist

Paper • 2603.21064 • Published Mar 22 • 25

Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model

Paper • 2603.21986 • Published Mar 23 • 124

Omni-WorldBench: Towards a Comprehensive Interaction-Centric Evaluation for World Models

Paper • 2603.22212 • Published Mar 23 • 126

PEARL: Personalized Streaming Video Understanding Model

Paper • 2603.20422 • Published Mar 20 • 40

SNAP: Speaker Nulling for Artifact Projection in Speech Deepfake Detection

Paper • 2603.20686 • Published Mar 21 • 4

3DreamBooth: High-Fidelity 3D Subject-Driven Video Generation Model

Paper • 2603.18524 • Published Mar 19 • 58

Grounding World Simulation Models in a Real-World Metropolis

Paper • 2603.15583 • Published Mar 16 • 153