1 22 3

jinxu

co1dspring

AI & ML interests

None yet

Recent Activity

upvoted a paper 24 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

upvoted a paper about 1 month ago

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

upvoted a collection 4 months ago

PixMo

View all activity

Organizations

None yet

upvoted a paper 24 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 244

upvoted a paper about 1 month ago

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published Nov 12, 2025 • 201

upvoted a collection 4 months ago

PixMo

Collection

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 10 items • Updated 11 days ago • 85

upvoted 6 papers 4 months ago

Pixie: Fast and Generalizable Supervised Learning of 3D Physics from Pixels

Paper • 2508.17437 • Published Aug 20, 2025 • 38

Ovis2.5 Technical Report

Paper • 2508.11737 • Published Aug 15, 2025 • 111

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6, 2025 • 129

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 259

DINOv3

Paper • 2508.10104 • Published Aug 13, 2025 • 291

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 211

New activity in dddraxxx/sealvqa_spatial 4 months ago

Can not read images from parquet

#2 opened 4 months ago by

co1dspring

upvoted 5 papers 8 months ago

LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale

Paper • 2504.16030 • Published Apr 22, 2025 • 36

100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models

Paper • 2505.00551 • Published May 1, 2025 • 36

PrimitiveAnything: Human-Crafted 3D Primitive Assembly Generation with Auto-Regressive Transformer

Paper • 2505.04622 • Published May 7, 2025 • 27

Memorization-Compression Cycles Improve Generalization

Paper • 2505.08727 • Published May 13, 2025 • 5

MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder

Paper • 2505.07916 • Published May 12, 2025 • 134

upvoted 2 papers 9 months ago

Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs

Paper • 2504.07866 • Published Apr 10, 2025 • 12

VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning

Paper • 2504.07956 • Published Apr 10, 2025 • 46

liked 2 datasets 9 months ago

VLM-Reasoning/VCR-Bench

Viewer • Updated May 11, 2025 • 1.03k • 209 • 6

OpenGVLab/MMPR-v1.1

Preview • Updated Apr 13, 2025 • 92 • 46

upvoted a paper 10 months ago

R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization

Paper • 2503.12937 • Published Mar 17, 2025 • 30

jinxu

AI & ML interests

Recent Activity

Organizations

co1dspring's activity

Can not read images from parquet