dian yang's picture

5

dian yang

youman

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 26 days ago

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

upvoted a paper 26 days ago

CaptionQA: Is Your Caption as Useful as the Image Itself?

upvoted a paper over 1 year ago

Multitask Vision-Language Prompt Tuning

View all activity

Organizations

None yet

upvoted 2 papers 26 days ago

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Paper • 2412.02611 • Published Dec 3, 2024 • 25

CaptionQA: Is Your Caption as Useful as the Image Itself?

Paper • 2511.21025 • Published Nov 26 • 27

upvoted 3 papers over 1 year ago

Multitask Vision-Language Prompt Tuning

Paper • 2211.11720 • Published Nov 21, 2022 • 2

HallE-Switch: Rethinking and Controlling Object Existence Hallucinations in Large Vision Language Models for Detailed Caption

Paper • 2310.01779 • Published Oct 3, 2023 • 4

Law of Vision Representation in MLLMs

Paper • 2408.16357 • Published Aug 29, 2024 • 95