Sapiens Collection Foundation models for human tasks. Code: https://github.com/facebookresearch/sapiens β’ 72 items β’ Updated Sep 18, 2024 β’ 60
LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training Paper β’ 2509.23661 β’ Published Sep 28, 2025 β’ 47
Running on Zero MCP Featured 1.59k Wan2.1 Fast π₯ 1.59k Generate a video from an image with a prompt
Running Featured 565 Image Arena Leaderboard π 565 Image Generation and Image Editing Arena & Leaderboard
Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model Paper β’ 2502.10248 β’ Published Feb 14, 2025 β’ 55
laion/CLIP-ViT-bigG-14-laion2B-39B-b160k Zero-Shot Image Classification β’ Updated Jan 22, 2025 β’ 85.7k β’ 303
Alibaba-NLP/gte-Qwen2-7B-instruct Sentence Similarity β’ 8B β’ Updated Mar 24, 2025 β’ 88.3k β’ 476