arxiv:2605.28109
Hao Jiang
Lutalica
AI & ML interests
Multimodal LLMs, LLM Reasoning, Reinforcement Learning, Efficient Inference
Recent Activity
authored a paper 2 days ago
D-CORE: Incentivizing Task Decomposition in Large Reasoning Models for Complex Tool Use authored a paper 2 days ago
Long Live The Balance: Information Bottleneck Driven Tree-based Policy Optimization authored a paper 2 days ago
Pyramid Texture Filtering