3 18 27

Ren Tianhe

rentianhe

https://rentainhe.github.io/

rentainhe

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Web World Models

upvoted a paper 10 days ago

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

upvoted a paper 12 days ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

View all activity

Organizations

upvoted a paper 5 days ago

Web World Models

Paper • 2512.23676 • Published 6 days ago • 19

upvoted a paper 10 days ago

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

Paper • 2512.20557 • Published 12 days ago • 48

upvoted a paper 12 days ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published 13 days ago • 61

liked a Space 16 days ago

Qwen Image Layered

🚀

366

Decompose an image into layers and export as PPTX or ZIP

upvoted a paper 25 days ago

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Paper • 2512.08765 • Published 26 days ago • 128

upvoted a paper about 1 month ago

LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling

Paper • 2511.20785 • Published Nov 25, 2025 • 182

upvoted a paper about 2 months ago

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5, 2025 • 128

liked 2 Spaces 2 months ago

LBM Relighting

✨

414

Fast image relighting using Latent Bridge Matching

IC Light

📈

1.35k

Generate relit images with foreground condition

liked a model 2 months ago

Qwen/Qwen-Image

Text-to-Image • Updated Aug 18, 2025 • 232k • • 2.32k

upvoted 3 papers 3 months ago

Detect Anything via Next Point Prediction

Paper • 2510.12798 • Published Oct 14, 2025 • 46

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 176

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26, 2025 • 184

upvoted 2 papers 6 months ago

Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks

Paper • 2401.14159 • Published Jan 25, 2024 • 6

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 159

upvoted a paper 8 months ago

Emerging Properties in Unified Multimodal Pretraining

Paper • 2505.14683 • Published May 20, 2025 • 133

upvoted a paper about 1 year ago

TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video

Paper • 2411.18671 • Published Nov 27, 2024 • 20

liked a Space about 1 year ago

TAPTR

🏆

Track Any Point Transformer

liked a Space over 1 year ago

Omost

😻

756

Generate images from text prompts using AI

upvoted a paper over 1 year ago

F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions

Paper • 2407.12435 • Published Jul 17, 2024 • 14