34 23

Xiang Fu

craigxiangfu

AI & ML interests

None yet

Recent Activity

upvoted a paper about 9 hours ago

YaRN: Efficient Context Window Extension of Large Language Models

liked a model 28 days ago

mistralai/Mistral-Large-Instruct-2411

liked a dataset 3 months ago

Anthropic/hh-rlhf

View all activity

Organizations

upvoted a paper about 9 hours ago

YaRN: Efficient Context Window Extension of Large Language Models

Paper • 2309.00071 • Published Aug 31, 2023 • 81

liked a model 28 days ago

mistralai/Mistral-Large-Instruct-2411

Updated Jul 28, 2025 • 17.5k • 252

liked a dataset 3 months ago

Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 17.5k • 1.67k

upvoted a paper 4 months ago

Matryoshka Quantization

Paper • 2502.06786 • Published Feb 10, 2025 • 32

liked a model 4 months ago

deepseek-ai/DeepSeek-OCR

Image-Text-to-Text • Updated Nov 4, 2025 • 3.51M • 3.17k

liked a model 5 months ago

google/gemma-3-270m

Text Generation • Updated Aug 14, 2025 • 143k • 991

liked 2 models 6 months ago

allenai/OLMo-2-0425-1B-early-training

Text Generation • 1B • Updated Aug 18, 2025 • 482 • 6

CohereLabs/command-a-reasoning-08-2025

Text Generation • 111B • Updated Jan 13 • 877 • • 133

upvoted 4 collections 7 months ago

upvoted 2 papers 8 months ago

RExBench: Can coding agents autonomously implement AI research extensions?

Paper • 2506.22598 • Published Jun 27, 2025 • 11

In-Context Learning Strategies Emerge Rationally

Paper • 2506.17859 • Published Jun 21, 2025 • 10

liked a model 8 months ago

mistralai/Mistral-Nemo-Instruct-2407

Updated Jul 28, 2025 • 90.1k • 1.65k

upvoted 5 papers 9 months ago

Geopolitical biases in LLMs: what are the "good" and the "bad" countries according to contemporary language models

Paper • 2506.06751 • Published Jun 7, 2025 • 71

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263

Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers

Paper • 2409.04109 • Published Sep 6, 2024 • 48

DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research

Paper • 2505.19253 • Published May 25, 2025 • 34

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28, 2025 • 131

Xiang Fu

AI & ML interests

Recent Activity

Organizations

craigxiangfu's activity