Everyday Physics in Korean Contexts: A Culturally Grounded Physical Reasoning Benchmark
Paper
•
2509.17807
•
Published
•
1
AI Safety & AI Security
COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs
X-Teaming Evolutionary M2S: Automated Discovery of Multi-turn to Single-turn Jailbreak Templates