AILuminate: Introducing v1.0 of the AI Risk and Reliability Benchmark from MLCommons Paper • 2503.05731 • Published Feb 19, 2025 • 3
Large Language Model Hacking: Quantifying the Hidden Risks of Using LLMs for Text Annotation Paper • 2509.08825 • Published Sep 10, 2025 • 2