zhanghang's picture

zhanghang

hangzhang-nlp

·

hangzhang-nlp

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 months ago

Qwen3-VL Technical Report

liked a model 6 months ago

Qwen/Qwen3-VL-2B-Thinking

liked a model 6 months ago

Qwen/Qwen3-VL-2B-Instruct

View all activity

Organizations

upvoted a paper 5 months ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26, 2025 • 162

liked 13 models 6 months ago

Qwen/Qwen3-VL-2B-Thinking

Image-Text-to-Text • 2B • Updated Oct 20, 2025 • 69k • 111

Qwen/Qwen3-VL-2B-Instruct

Image-Text-to-Text • 2B • Updated Oct 23, 2025 • 98.8M • 372

Qwen/Qwen3-VL-4B-Instruct

Image-Text-to-Text • 4B • Updated Oct 15, 2025 • 2.14M • 376

Qwen/Qwen3-VL-4B-Thinking

Image-Text-to-Text • 4B • Updated Oct 15, 2025 • 445k • 108

Qwen/Qwen3-VL-8B-Instruct

Image-Text-to-Text • 9B • Updated Oct 15, 2025 • 3.94M • • 882

Qwen/Qwen3-VL-8B-Thinking

Image-Text-to-Text • 9B • Updated Nov 26, 2025 • 344k • 203

Qwen/Qwen3-VL-30B-A3B-Instruct-FP8

Image-Text-to-Text • Updated Nov 26, 2025 • 275k • 105

Qwen/Qwen3-VL-30B-A3B-Instruct

Image-Text-to-Text • Updated Nov 26, 2025 • 1.13M • • 565

Qwen/Qwen3-VL-30B-A3B-Thinking

Image-Text-to-Text • 31B • Updated Nov 26, 2025 • 63.6k • • 197

Qwen/Qwen3-VL-235B-A22B-Instruct-FP8

Image-Text-to-Text • 236B • Updated Nov 26, 2025 • 45k • 43

Qwen/Qwen3-VL-235B-A22B-Thinking-FP8

Image-Text-to-Text • 236B • Updated Nov 26, 2025 • 3.95k • 28

Qwen/Qwen3-VL-235B-A22B-Instruct

Image-Text-to-Text • 236B • Updated Nov 26, 2025 • 1.08M • • 383

Qwen/Qwen3-VL-235B-A22B-Thinking

Image-Text-to-Text • 236B • Updated Nov 26, 2025 • 228k • • 389

liked a Space 10 months ago

VideoRefer VideoLLaMA3

VideoRefer x VideoLLaMA3

upvoted a paper 10 months ago

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Paper • 2506.07044 • Published Jun 8, 2025 • 114

upvoted 4 papers about 1 year ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14, 2025 • 308

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Paper • 2406.07476 • Published Jun 11, 2024 • 36

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19, 2025 • 217

LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization

Paper • 2502.13922 • Published Feb 19, 2025 • 27