Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zhen Yang's picture
2 3 2

Zhen Yang

andyyang
ishaqsaviani's profile picture Alya-cc's profile picture LighterDarkness's profile picture
·

AI & ML interests

None yet

Organizations

Tencent's profile picture

upvoted a paper 9 months ago

TransMamba: Flexibly Switching between Transformer and Mamba

Paper • 2503.24067 • Published Mar 31 • 21
upvoted a paper 10 months ago

Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models

Paper • 2502.15499 • Published Feb 21 • 15
upvoted a paper 12 months ago

Lossless KV Cache Compression to 2%

Paper • 2410.15252 • Published Oct 20, 2024 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs