Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
dian yang's picture
5

dian yang

youman
Youhatang's profile picture shijiay's profile picture 21world's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 26 days ago
AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?
upvoted a paper 26 days ago
CaptionQA: Is Your Caption as Useful as the Image Itself?
upvoted a paper over 1 year ago
Multitask Vision-Language Prompt Tuning
View all activity

Organizations

None yet

upvoted 2 papers 26 days ago

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Paper • 2412.02611 • Published Dec 3, 2024 • 25

CaptionQA: Is Your Caption as Useful as the Image Itself?

Paper • 2511.21025 • Published Nov 26 • 27
upvoted 3 papers over 1 year ago

Multitask Vision-Language Prompt Tuning

Paper • 2211.11720 • Published Nov 21, 2022 • 2

HallE-Switch: Rethinking and Controlling Object Existence Hallucinations in Large Vision Language Models for Detailed Caption

Paper • 2310.01779 • Published Oct 3, 2023 • 4

Law of Vision Representation in MLLMs

Paper • 2408.16357 • Published Aug 29, 2024 • 95
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs