-
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs
Paper • 2504.11536 • Published • 63 -
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Paper • 2505.24726 • Published • 277 -
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
Paper • 2503.12605 • Published • 35 -
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Paper • 2506.13585 • Published • 273
Av
Avi66
·
AI & ML interests
ML Research , LLMs , Applications
MultiModality
Recent Activity
updated
a collection
13 days ago
TTS
updated
a collection
4 months ago
TTS
updated
a collection
4 months ago
Papers
Organizations
Vlm
-
XiaomiMiMo/MiMo-VL-7B-RL
Image-Text-to-Text • 8B • Updated • 1.23k • 166 -
mradermacher/Janus-Pro-7B-LM-GGUF
7B • Updated • 510 • 36 -
deepseek-ai/deepseek-vl2-tiny
Image-Text-to-Text • 3B • Updated • 54.7k • 237 -
RedHatAI/Llama-3.2-11B-Vision-Instruct-FP8-dynamic
Text Generation • 11B • Updated • 15.1k • 24
Spaces
Papers
-
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs
Paper • 2504.11536 • Published • 63 -
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Paper • 2505.24726 • Published • 277 -
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
Paper • 2503.12605 • Published • 35 -
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Paper • 2506.13585 • Published • 273
Tamil llm
Vlm
-
XiaomiMiMo/MiMo-VL-7B-RL
Image-Text-to-Text • 8B • Updated • 1.23k • 166 -
mradermacher/Janus-Pro-7B-LM-GGUF
7B • Updated • 510 • 36 -
deepseek-ai/deepseek-vl2-tiny
Image-Text-to-Text • 3B • Updated • 54.7k • 237 -
RedHatAI/Llama-3.2-11B-Vision-Instruct-FP8-dynamic
Text Generation • 11B • Updated • 15.1k • 24
TTS
Spaces