Boqiang Zhang's picture

Boqiang Zhang

Cyril666

·

https://cyrilsterling.github.io/

CyrilSterling

AI & ML interests

Multi-modal Large Language Models Vision-Language-Action Models

Recent Activity

upvoted a paper 2 days ago

AIBench: Evaluating Visual-Logical Consistency in Academic Illustration Generation

liked a dataset 15 days ago

tencent/Penguin-Recap-I

liked a model 17 days ago

tencent/Penguin-VL-8B

View all activity

Organizations

upvoted a paper 2 days ago

AIBench: Evaluating Visual-Logical Consistency in Academic Illustration Generation

Paper • 2603.28068 • Published 6 days ago • 9

liked a dataset 15 days ago

tencent/Penguin-Recap-I

Viewer • Updated 17 days ago • 104M • 1.14k • 15

liked a model 17 days ago

tencent/Penguin-VL-8B

Text Generation • 9B • Updated 25 days ago • 8.49k • 74

updated a model 20 days ago

Cyril666/Penguin-Encoder-Init

Feature Extraction • 0.4B • Updated 20 days ago • 37

published a model 20 days ago

Cyril666/Penguin-Encoder-Init

Feature Extraction • 0.4B • Updated 20 days ago • 37

New activity in tencent/Penguin-VL-8B 25 days ago

Update README.md

#7 opened 25 days ago by

New activity in tencent/Penguin-VL-2B 25 days ago

Update README.md

#6 opened 25 days ago by

upvoted a collection 27 days ago

Penguin-VL

7 items • Updated 1 day ago • 13

New activity in tencent/Penguin-VL-2B 28 days ago

please upload to modelscope

#1 opened 29 days ago by

authored a paper 28 days ago

Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

Paper • 2603.06569 • Published about 1 month ago • 118

New activity in tencent/Penguin-VL-8B 28 days ago

Update processing_penguinvl.py

#4 opened 28 days ago by

New activity in tencent/Penguin-VL-2B 28 days ago

Update processing_penguinvl.py

#4 opened 28 days ago by

Update processing_penguinvl.py

#3 opened 28 days ago by

New activity in tencent/Penguin-VL-8B 28 days ago

Update processing_penguinvl.py

#3 opened 28 days ago by

Update processing_penguinvl.py

#2 opened 28 days ago by

upvoted a paper 28 days ago

Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

Paper • 2603.06569 • Published about 1 month ago • 118

liked 2 models 29 days ago

tencent/Penguin-Encoder

Feature Extraction • 0.4B • Updated 28 days ago • 8k • 21

tencent/Penguin-VL-2B

Text Generation • 2B • Updated 25 days ago • 3.4k • 35

authored 2 papers about 1 month ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22, 2025 • 91

What Is a Good Caption? A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Thoroughness

Paper • 2502.14914 • Published Feb 19, 2025