stephen-flood
's Collections
Benchmarks
updated
Viewer
•
Updated
•
2.09k
•
24
•
4
Viewer
•
Updated
•
5.82M
•
12.2k
•
43
Viewer
•
Updated
•
231k
•
298k
•
609
Benchmark
•
Updated
•
17.6k
•
412k
•
1.09k
Viewer
•
Updated
•
19.6k
•
23
lighteval/legal_summarization
Viewer
•
Updated
•
26.9k
•
106
•
25
Viewer
•
Updated
•
1.6k
•
164
•
1
lighteval/synthetic_reasoning
Viewer
•
Updated
•
33k
•
167
•
7
lighteval/synthetic_reasoning_natural
Viewer
•
Updated
•
22k
•
98
•
15
Viewer
•
Updated
•
90.3k
•
214
•
3
lighteval/GPT3_unscramble
Viewer
•
Updated
•
50k
•
25
•
1
lighteval/aimo_progress_prize_1
Viewer
•
Updated
•
10
•
7
Viewer
•
Updated
•
1.7k
•
37
Viewer
•
Updated
•
72.5k
•
2.25k
•
140
Viewer
•
Updated
•
860k
•
10.6k
•
521
Text Classification
•
73B
•
Updated
•
31.7k
•
81
Jofthomas/hermes-function-calling-thinking-V1
Viewer
•
Updated
•
3.57k
•
720
•
72
NousResearch/hermes-function-calling-v1
Viewer
•
Updated
•
11.6k
•
1.69k
•
365
Viewer
•
Updated
•
15.7k
•
64
•
5
Viewer
•
Updated
•
621M
•
36.3k
•
84
open-web-math/open-web-math
Viewer
•
Updated
•
6.32M
•
7.51k
•
324