2 4 2

Thanawat Lodkaew

skydddoogg

https://skydddoogg.github.io

Skydddoogg

AI & ML interests

None yet

Recent Activity

upvoted a paper about 11 hours ago

CoffeeBench: Benchmarking Long-Horizon LLM Agents in Heterogeneous Multi-Agent Economies

liked a dataset 4 days ago

ishidalab/capcode

upvoted a paper 5 days ago

Mitigating Reward Hacking in RLHF via Advantage Sign Robustness

View all activity

Organizations

upvoted a paper about 11 hours ago

CoffeeBench: Benchmarking Long-Horizon LLM Agents in Heterogeneous Multi-Agent Economies

Paper • 2606.16613 • Published 14 days ago • 8

liked a dataset 4 days ago

ishidalab/capcode

Viewer • Updated 7 days ago • 378 • 105 • 1

upvoted a paper 5 days ago

Mitigating Reward Hacking in RLHF via Advantage Sign Robustness

Paper • 2604.02986 • Published Apr 3 • 3

updated a dataset 7 days ago

ishidalab/capcode

Viewer • Updated 7 days ago • 378 • 105 • 1

authored a paper 18 days ago

Do Coding Agents Deceive Us? Detecting and Preventing Cheating via Capped Evaluation with Randomized Tests

Paper • 2606.07379 • Published 24 days ago • 5

New activity in ishidalab/capcode 18 days ago

Add task category and license metadata

#2 opened 18 days ago by

nielsr

upvoted 2 papers 18 days ago

How Can I Publish My LLM Benchmark Without Giving the True Answers Away?

Paper • 2505.18102 • Published May 23, 2025 • 2

Do Coding Agents Deceive Us? Detecting and Preventing Cheating via Capped Evaluation with Randomized Tests

Paper • 2606.07379 • Published 24 days ago • 5

submitted a paper to Daily Papers 18 days ago

Do Coding Agents Deceive Us? Detecting and Preventing Cheating via Capped Evaluation with Randomized Tests

Paper • 2606.07379 • Published 24 days ago • 5

published a dataset 21 days ago

ishidalab/capcode

Viewer • Updated 7 days ago • 378 • 105 • 1

authored a paper 4 months ago

How Can I Publish My LLM Benchmark Without Giving the True Answers Away?

Paper • 2505.18102 • Published May 23, 2025 • 2

liked a dataset 4 months ago

ishidalab/capbencher

Viewer • Updated 29 days ago • 15.5k • 48 • 2

updated a dataset 4 months ago

ishidalab/capbencher

Viewer • Updated 29 days ago • 15.5k • 48 • 2

Thanawat Lodkaew

AI & ML interests

Recent Activity

Organizations

skydddoogg's activity

Add task category and license metadata