Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Kai's picture
2 8 1

Kai

KaiLv
SII-xrliu's profile picture Hongling1114's profile picture mengen's profile picture
·

AI & ML interests

None yet

Organizations

None yet

upvoted 2 papers 6 months ago

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Paper • 2507.10532 • Published Jul 14, 2025 • 89

Pre-Trained Policy Discriminators are General Reward Models

Paper • 2507.05197 • Published Jul 7, 2025 • 39
upvoted 4 papers 10 months ago

DropletVideo: A Dataset and Approach to Explore Integral Spatio-Temporal Consistent Video Generation

Paper • 2503.06053 • Published Mar 8, 2025 • 138

World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning

Paper • 2503.10480 • Published Mar 13, 2025 • 55

DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting

Paper • 2503.00784 • Published Mar 2, 2025 • 13

CritiQ: Mining Data Quality Criteria from Human Preferences

Paper • 2502.19279 • Published Feb 26, 2025 • 10
upvoted a paper 11 months ago

Thus Spake Long-Context Large Language Model

Paper • 2502.17129 • Published Feb 24, 2025 • 73
upvoted a paper over 2 years ago

Full Parameter Fine-tuning for Large Language Models with Limited Resources

Paper • 2306.09782 • Published Jun 16, 2023 • 30
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs