Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
stoksweet 's Collections
Papers

Papers

updated Jan 5
Upvote
-

  • The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

    Paper • 2402.17764 • Published Feb 27, 2024 • 627

  • Hierarchical Reasoning Model

    Paper • 2506.21734 • Published Jun 26, 2025 • 47

  • Less is More: Recursive Reasoning with Tiny Networks

    Paper • 2510.04871 • Published Oct 6, 2025 • 506

  • Training language models to follow instructions with human feedback

    Paper • 2203.02155 • Published Mar 4, 2022 • 24

  • Best-of-N Jailbreaking

    Paper • 2412.03556 • Published Dec 4, 2024

  • Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

    Paper • 2506.14245 • Published Jun 17, 2025 • 45
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs