Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
zhan's picture
9 20

zhan

zzzzzzfdfd
21world's profile picture
·

AI & ML interests

None yet

Organizations

Alibaba-PAI's profile picture

upvoted a paper 5 months ago

Ovis2.5 Technical Report

Paper • 2508.11737 • Published Aug 15, 2025 • 111
upvoted a paper 7 months ago

Ovis-U1 Technical Report

Paper • 2506.23044 • Published Jun 29, 2025 • 61
upvoted a paper 9 months ago

Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities

Paper • 2505.02567 • Published May 5, 2025 • 80
upvoted a collection 12 months ago

Ovis2

Collection
Our latest advancement in multi-modal large language models (MLLMs) • 15 items • Updated Mar 25, 2025 • 65
upvoted 5 papers over 1 year ago

Block Transformer: Global-to-Local Language Modeling for Fast Inference

Paper • 2406.02657 • Published Jun 4, 2024 • 41

Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion

Paper • 2406.03184 • Published Jun 5, 2024 • 21

Ovis: Structural Embedding Alignment for Multimodal Large Language Model

Paper • 2405.20797 • Published May 31, 2024 • 30

Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent Collaboration

Paper • 2406.01014 • Published Jun 3, 2024 • 33

Parrot: Multilingual Visual Instruction Tuning

Paper • 2406.02539 • Published Jun 4, 2024 • 36
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs