AI & ML interests
None defined yet.
test-gen/mbpp_Qwen2.5-Coder-7B-Instruct_t0.0_n1_generated_tests_updated
Viewer
• Updated
• 500 • 6
test-gen/mbpp_Qwen2.5-Coder-3B-Instruct_t0.0_n1_generated_tests_updated
Viewer
• Updated
• 500 • 5
test-gen/mbpp_Qwen2.5-Coder-1.5B-Instruct_t0.0_n1_generated_tests_updated
Viewer
• Updated
• 500 • 6
test-gen/mbpp_Qwen2.5-Coder-0.5B-Instruct_t0.0_n1_generated_tests_updated
Viewer
• Updated
• 500 • 6
test-gen/humaneval_Qwen2.5-Coder-32B-Instruct_t0.0_n1_generated_tests_updated
Viewer
• Updated
• 164 • 6
test-gen/humaneval_Qwen2.5-Coder-14B-Instruct_t0.0_n1_generated_tests_updated
Viewer
• Updated
• 164 • 6
test-gen/humaneval_Qwen2.5-Coder-7B-Instruct_t0.0_n1_generated_tests_updated
Viewer
• Updated
• 164 • 6
test-gen/humaneval_Qwen2.5-Coder-3B-Instruct_t0.0_n1_generated_tests_updated
Viewer
• Updated
• 164 • 6
test-gen/humaneval_Qwen2.5-Coder-1.5B-Instruct_t0.0_n1_generated_tests_updated
Viewer
• Updated
• 164 • 6
test-gen/humaneval_Qwen2.5-Coder-0.5B-Instruct_t0.0_n1_generated_tests_updated
Viewer
• Updated
• 164 • 6
test-gen/livecodebench_Qwen2.5-Coder-32B-Instruct_t0.0_n1_generated_tests_updated
Viewer
• Updated
• 182 • 6
test-gen/livecodebench_Qwen2.5-Coder-14B-Instruct_t0.0_n1_generated_tests_updated
Viewer
• Updated
• 182 • 5
test-gen/livecodebench_Qwen2.5-Coder-7B-Instruct_t0.0_n1_generated_tests_updated
Viewer
• Updated
• 182 • 5
test-gen/livecodebench_Qwen2.5-Coder-3B-Instruct_t0.0_n1_generated_tests_updated
Viewer
• Updated
• 182 • 6
test-gen/livecodebench_Qwen2.5-Coder-1.5B-Instruct_t0.0_n1_generated_tests_updated
Viewer
• Updated
• 182 • 6
test-gen/livecodebench_Qwen2.5-Coder-0.5B-Instruct_t0.0_n1_generated_tests_updated
Viewer
• Updated
• 182 • 6
test-gen/code_livecodebench_qwen2.5-32b_t1.0_n8_tests_livecodebench_qwen2.5-32b_t0.0_n1
Viewer
• Updated
• 182 • 6
test-gen/code_livecodebench_qwen2.5-14b_t1.0_n8_tests_livecodebench_qwen2.5-14b_t0.0_n1
Viewer
• Updated
• 182 • 6
test-gen/code_livecodebench_qwen2.5-3b_t1.0_n8_tests_livecodebench_qwen2.5-3b_t0.0_n1
Viewer
• Updated
• 182 • 6
test-gen/code_livecodebench_qwen2.5-1.5b_t1.0_n8_tests_livecodebench_qwen2.5-1.5b_t0.0_n1
Viewer
• Updated
• 182 • 6
test-gen/code_livecodebench_qwen2.5-0.5b_t1.0_n8_tests_livecodebench_qwen2.5-0.5b_t0.0_n1
Viewer
• Updated
• 182 • 5
test-gen/code_humaneval_qwen2.5-32b_t1.0_n8_tests_humaneval_qwen2.5-32b_t0.0_n1
Viewer
• Updated
• 164 • 5
test-gen/code_humaneval_qwen2.5-14b_t1.0_n8_tests_humaneval_qwen2.5-14b_t0.0_n1
Viewer
• Updated
• 164 • 5
test-gen/code_humaneval_qwen2.5-3b_t1.0_n8_tests_humaneval_qwen2.5-3b_t0.0_n1
Viewer
• Updated
• 164 • 6
test-gen/code_humaneval_qwen2.5-1.5b_t1.0_n8_tests_humaneval_qwen2.5-1.5b_t0.0_n1
Viewer
• Updated
• 164 • 6
test-gen/code_humaneval_qwen2.5-0.5b_t1.0_n8_tests_humaneval_qwen2.5-0.5b_t0.0_n1
Viewer
• Updated
• 164 • 6
test-gen/code_mbpp_qwen2.5-32b_t1.0_n8_tests_mbpp_qwen2.5-32b_t0.0_n1
Viewer
• Updated
• 500 • 5
test-gen/code_mbpp_qwen2.5-14b_t1.0_n8_tests_mbpp_qwen2.5-14b_t0.0_n1
Viewer
• Updated
• 500 • 6
test-gen/code_mbpp_qwen2.5-3b_t1.0_n8_tests_mbpp_qwen2.5-3b_t0.0_n1
Viewer
• Updated
• 500 • 6
test-gen/code_mbpp_qwen2.5-1.5b_t1.0_n8_tests_mbpp_qwen2.5-1.5b_t0.0_n1
Viewer
• Updated
• 500 • 6