-
Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation
Paper • 2403.16422 • Published • 1 -
PHAnToM: Personality Has An Effect on Theory-of-Mind Reasoning in Large Language Models
Paper • 2403.02246 • Published • 1 -
ZipIR: Latent Pyramid Diffusion Transformer for High-Resolution Image Restoration
Paper • 2504.08591 • Published • 18 -
Minthy/ToriiGate-v0.4-7B
Image-Text-to-Text • 8B • Updated • 339 • 77
Sam Flin
sflindrs
AI & ML interests
None yet
Recent Activity
upvoted a collection about 2 hours ago
Gemma 4 liked a model 1 day ago
mradermacher/XORTRON-GGUF liked a model 1 day ago
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-DistilledOrganizations
None yet
Captioning
- Runtime error17
CogVLMv1 Captionner
⚙17Generate a detailed image description
-
sdasd112132/Vision-8B-MiniCPM-2_5-Uncensored-and-Detailed-4bit
Visual Question Answering • 9B • Updated • 276 • 32 - RunningFeatured561
Vision Arena (Testing VLMs side-by-side)
🖼561Explore Vision Arena’s computer‑vision tools online
-
dphn/dolphin-vision-72b
Text Generation • 73B • Updated • 192 • 133
Favorites
-
Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation
Paper • 2403.16422 • Published • 1 -
PHAnToM: Personality Has An Effect on Theory-of-Mind Reasoning in Large Language Models
Paper • 2403.02246 • Published • 1 -
ZipIR: Latent Pyramid Diffusion Transformer for High-Resolution Image Restoration
Paper • 2504.08591 • Published • 18 -
Minthy/ToriiGate-v0.4-7B
Image-Text-to-Text • 8B • Updated • 339 • 77
flux
Captioning
- Runtime error17
CogVLMv1 Captionner
⚙17Generate a detailed image description
-
sdasd112132/Vision-8B-MiniCPM-2_5-Uncensored-and-Detailed-4bit
Visual Question Answering • 9B • Updated • 276 • 32 - RunningFeatured561
Vision Arena (Testing VLMs side-by-side)
🖼561Explore Vision Arena’s computer‑vision tools online
-
dphn/dolphin-vision-72b
Text Generation • 73B • Updated • 192 • 133