Binfeng Xu

billxbf

AI & ML interests

evolving back to apes

Recent Activity

upvoted a paper 14 days ago

Polar: Agentic RL on Any Harness at Scale

upvoted a paper about 1 month ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

updated a model about 1 month ago

billxbf/qwen3.5-4b-pi-polar

View all activity

Organizations

Collections 2

models 21

billxbf/qwen3.5-4b-pi-polar

4B • Updated May 18 • 2

billxbf/qwen3.5-4b-opencode-polar

4B • Updated May 16 • 2

billxbf/qwen3.5-4b-qwencode-polar

4B • Updated May 14 • 3

billxbf/qwen3.5-4b-claudecode-polar

4B • Updated May 13 • 29

billxbf/qwen3.5-4b-codex-polar-step72

Reinforcement Learning • 5B • Updated May 2 • 3

billxbf/zephyr-7b-dpo-iter1

Text Generation • 274k • Updated Nov 10, 2025 • 82

billxbf/zephyr-7b-dpo-iter3

Text Generation • 266k • Updated Nov 8, 2025 • 86

billxbf/zephyr-7b-dpo-iter2

Text Generation • 266k • Updated Nov 8, 2025 • 8

billxbf/Nano-Raccoon-Preview-1104

425k • Updated Nov 4, 2025 • 3

billxbf/zephyr-7b-sft-iter3

Text Generation • 266k • Updated Nov 4, 2025 • 43

datasets 20

billxbf/math_pile_v3

Viewer • Updated Dec 23, 2025 • 1.52M • 35

billxbf/ultrafeedback-dpo-iter3

Viewer • Updated Nov 12, 2025 • 20.4k • 10

billxbf/ultrafeedback-dpo-iter1

Viewer • Updated Nov 10, 2025 • 20.4k • 6

billxbf/ultrafeedback-dpo-iter2

Viewer • Updated Nov 10, 2025 • 20.4k • 5

billxbf/ultrafeedback-sft-iter3

Viewer • Updated Nov 4, 2025 • 20.4k • 11

billxbf/ultrafeedback-sft-iter2

Viewer • Updated Nov 4, 2025 • 20.4k • 7

billxbf/ultrafeedback-sft-iter1

Viewer • Updated Nov 3, 2025 • 20.4k • 5

billxbf/verified100-chitchat

Viewer • Updated Nov 3, 2025 • 100 • 7

billxbf/verified100-lite

Viewer • Updated Nov 1, 2025 • 100 • 13

billxbf/verified100

Viewer • Updated Oct 30, 2025 • 100 • 8

View 20 datasets