Qingyan's picture

Qingyan PRO

QingyanBai

·

https://bqy.info/

AI & ML interests

Generative Models, UMMs, and Agents.

Recent Activity

liked a dataset 1 day ago

KlingTeam/Scene-Decoupled-Video-dataset

liked a dataset 6 days ago

liguang0115/EgoEdit

upvoted a paper 13 days ago

Beyond Language Modeling: An Exploration of Multimodal Pretraining

View all activity

Organizations

upvoted a paper 13 days ago

Beyond Language Modeling: An Exploration of Multimodal Pretraining

Paper • 2603.03276 • Published 21 days ago • 100

upvoted a paper 22 days ago

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Paper • 2602.24286 • Published 25 days ago • 97

upvoted a paper 25 days ago

Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control

Paper • 2602.18422 • Published Feb 20 • 30

upvoted 2 papers about 2 months ago

PaperBanana: Automating Academic Illustration for AI Scientists

Paper • 2601.23265 • Published Jan 30 • 220

Advancing Open-source World Models

Paper • 2601.20540 • Published Jan 28 • 133

upvoted 2 papers 3 months ago

Kling-Omni Technical Report

Paper • 2512.16776 • Published Dec 18, 2025 • 173

Exploring MLLM-Diffusion Information Transfer with MetaCanvas

Paper • 2512.11464 • Published Dec 12, 2025 • 15

upvoted 4 papers 4 months ago

RELIC: Interactive Video World Model with Long-Horizon Memory

Paper • 2512.04040 • Published Dec 3, 2025 • 24

MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues

Paper • 2512.03046 • Published Dec 2, 2025 • 12

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 263

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published Nov 12, 2025 • 214

upvoted 5 papers 5 months ago

World Simulation with Video Foundation Models for Physical AI

Paper • 2511.00062 • Published Oct 28, 2025 • 45

From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model

Paper • 2510.19871 • Published Oct 22, 2025 • 30

HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives

Paper • 2510.20822 • Published Oct 23, 2025 • 41

Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset

Paper • 2510.15742 • Published Oct 17, 2025 • 51

D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI

Paper • 2510.05684 • Published Oct 7, 2025 • 145

upvoted 2 papers 6 months ago

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data

Paper • 2509.15221 • Published Sep 18, 2025 • 111

Scaling Agents via Continual Pre-training

Paper • 2509.13310 • Published Sep 16, 2025 • 117

upvoted 2 papers 9 months ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 162

Rethinking Verification for LLM Code Generation: From Generation to Testing

Paper • 2507.06920 • Published Jul 9, 2025 • 29