Chao Tang

Tangc03

https://tangc03.github.io

Tangc03

AI & ML interests

None yet

Recent Activity

upvoted a paper 25 days ago

Does Hearing Help Seeing? Investigating Audio-Video Joint Denoising for Video Generation

upvoted a paper about 1 month ago

MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation

upvoted a paper 2 months ago

Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence

View all activity

Organizations

None yet

upvoted a paper 25 days ago

Does Hearing Help Seeing? Investigating Audio-Video Joint Denoising for Video Generation

Paper • 2512.02457 • Published 25 days ago • 13

upvoted a paper about 1 month ago

MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation

Paper • 2511.09611 • Published Nov 12 • 68

upvoted 2 papers 2 months ago

Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence

Paper • 2510.20579 • Published Oct 23 • 55

Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Paper • 2510.18876 • Published Oct 21 • 36

upvoted a paper 6 months ago

VMoBA: Mixture-of-Block Attention for Video Diffusion Models

Paper • 2506.23858 • Published Jun 30 • 31

updated a dataset 6 months ago

Tangc03/anyedit_top2000_thinking

Viewer • Updated Jun 25 • 2k • 34 • 1

published a dataset 6 months ago

Tangc03/anyedit_top2000_thinking

Viewer • Updated Jun 25 • 2k • 34 • 1

authored a paper 8 months ago

An Empirical Study of GPT-4o Image Generation Capabilities

Paper • 2504.05979 • Published Apr 8 • 64

upvoted a paper 8 months ago

On Path to Multimodal Generalist: General-Level and General-Bench

Paper • 2505.04620 • Published May 7 • 82

upvoted a paper 9 months ago

An Empirical Study of GPT-4o Image Generation Capabilities

Paper • 2504.05979 • Published Apr 8 • 64

upvoted a paper about 1 year ago

Generative World Explorer

Paper • 2411.11844 • Published Nov 18, 2024 • 77

authored a paper about 1 year ago

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published Dec 10, 2024 • 48

liked a dataset about 1 year ago

jianzongwu/MangaZero

Viewer • Updated Dec 11, 2024 • 32.7k • 167 • 33