Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
PKU-ML 's Collections
SSL4RL
G1

G1

updated Jun 2, 2025

Portfolio of models, datasets and demos presented in the paper G1: Teaching LLMs to Reason on Graphs with Reinforcement Learning

Upvote
5

  • PKU-ML/G1-7B

    Text Generation • 8B • Updated Jun 17, 2025 • 14 • 2

  • PKU-ML/G1-3B

    Text Generation • 3B • Updated Jun 17, 2025 • 58 • 1

  • PKU-ML/G1-Direct-SFT-3B

    Text Generation • 3B • Updated Jun 17, 2025 • 8

  • PKU-ML/G1-Direct-SFT-7B

    Text Generation • 8B • Updated Jun 17, 2025 • 4

  • PKU-ML/G1-CoT-SFT-3B

    Text Generation • 3B • Updated Jun 17, 2025 • 10

  • PKU-ML/G1-CoT-SFT-7B

    Text Generation • 8B • Updated Jun 17, 2025 • 6

  • PKU-ML/G1-Zero-3B

    Text Generation • 3B • Updated Jun 17, 2025 • 12

  • PKU-ML/G1-Zero-7B

    Text Generation • 8B • Updated Jun 17, 2025 • 6

  • PKU-ML/Erdos

    Viewer • Updated Jun 1, 2025 • 104k • 170 • 2

  • PKU-ML/Erdos-CoT

    Viewer • Updated Jun 1, 2025 • 4.94k • 42
Upvote
5
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs