Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
ibm-granite 's Collections
Granite 4.1 Language Models
Granite Docling
Granite 4.0 Language Models
Granite 4.0 Nano Language Models
Granite Embedding
Granite Speech
Granite Vision
Granite Guardian
Granite Time Series
Granite Libraries
Granite 3.3
Granite Geospatial Models
Granite Data
Granite Experiments
Granite Quantized Models

Granite Vision

updated 3 days ago

Multimodal models built for visual document analysis and image understanding.

Upvote
41

  • Running on Zero
    Agents
    42

    Multimodal RAG with Granite Vision

    🚀
    42

    RAG example using Granite [vision, embedding, instruct]


  • ibm-granite/granite-vision-4.1-4b

    Image-Text-to-Text • 4B • Updated 8 days ago • 73.4k • 78

  • ibm-granite/granite-4.0-3b-vision

    Image-Text-to-Text • 4B • Updated 25 days ago • 105k • 109

  • ibm-granite/granite-vision-3.3-2b

    Image-to-Text • 3B • Updated Apr 2 • 134k • 83

  • ibm-granite/granite-vision-3.1-2b-preview

    Image-Text-to-Text • Updated Jun 12, 2025 • 965 • 113

  • ibm-granite/granite-vision-3.3-2b-embedding

    Feature Extraction • 3B • Updated Aug 16, 2025 • 66 • 28

  • ibm-granite/granite-vision-3.2-2b

    Image-Text-to-Text • 3B • Updated Apr 2 • 4.69k • 122
Upvote
41
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs