-
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence
Paper • 2511.18538 • Published • 275 -
SWE-Compass: Towards Unified Evaluation of Agentic Coding Abilities for Large Language Models
Paper • 2511.05459 • Published • 3 -
SWE-EVO: Benchmarking Coding Agents in Long-Horizon Software Evolution Scenarios
Paper • 2512.18470 • Published • 7
Jim White PRO
jimwhite
·
AI & ML interests
None yet
Recent Activity
updated
a collection
about 7 hours ago
Coding Benchmarks
updated
a collection
9 days ago
PUP
updated
a collection
9 days ago
PUP