gn00029914's picture

gn00029914

gn00029914

·

AI & ML interests

None yet

Recent Activity

liked a Space 1 day ago

webml-community/Qwen3.5-0.8B-WebGPU

liked a model 2 days ago

Qwen/Qwen3.5-9B

upvoted a paper 3 days ago

SOM Directions are Better than One: Multi-Directional Refusal Suppression in Language Models

View all activity

Organizations

upvoted 2 papers 3 days ago

SOM Directions are Better than One: Multi-Directional Refusal Suppression in Language Models

Paper • 2511.08379 • Published Nov 11, 2025 • 4

EAGLE-3: Scaling up Inference Acceleration of Large Language Models via Training-Time Test

Paper • 2503.01840 • Published Mar 3, 2025 • 6

upvoted a paper 6 days ago

HLE-Verified: A Systematic Verification and Structured Revision of Humanity's Last Exam

Paper • 2602.13964 • Published 18 days ago • 10

upvoted a collection 7 days ago

Qwen3.5

21 items • Updated 2 days ago • 938

upvoted a paper 7 days ago

Discovering Multiagent Learning Algorithms with Large Language Models

Paper • 2602.16928 • Published 14 days ago • 16

upvoted 2 papers 9 days ago

Understanding Silent Data Corruption in LLM Training

Paper • 2502.12340 • Published Feb 17, 2025 • 1

Silent Data Corruption by 10x Test Escapes Threatens Reliable Computing

Paper • 2508.01786 • Published Aug 3, 2025 • 1

upvoted a paper 11 days ago

The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning

Paper • 2601.06002 • Published Jan 9 • 56

upvoted 3 papers 14 days ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 318

Training-Free Long-Context Scaling of Large Language Models

Paper • 2402.17463 • Published Feb 27, 2024 • 24

Apriel-1.5-15b-Thinker

Paper • 2510.01141 • Published Oct 1, 2025 • 121

upvoted a collection 14 days ago

Apriel-1.6-15B-Thinker

3 items • Updated Dec 16, 2025 • 7

upvoted an article 14 days ago

Article

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

Dec 9, 2025

•

84

upvoted 3 papers 19 days ago

SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

Paper • 2505.20411 • Published May 26, 2025 • 93

Towards Autonomous Mathematics Research

Paper • 2602.10177 • Published 23 days ago • 36

SWE-Search: Enhancing Software Agents with Monte Carlo Tree Search and Iterative Refinement

Paper • 2410.20285 • Published Oct 26, 2024 • 1

upvoted 2 collections 20 days ago

Steering Reasoning VLAs

Steering Reasoning VLA in robotics manipulation https://www.arxiv.org/abs/2510.16281 • 2 items • Updated 1 day ago • 1

Nvidia reward models GGUF

4 items • Updated Nov 3, 2025 • 1

upvoted 2 papers 25 days ago

AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models

Paper • 2304.06364 • Published Apr 13, 2023 • 3

Michelangelo: Long Context Evaluations Beyond Haystacks via Latent Structure Queries

Paper • 2409.12640 • Published Sep 19, 2024 • 3