Yige Li's picture

Yige Li

Liyige

·

https://github.com/bboylyg

bboylyg

AI & ML interests

Trustworthy Machine Learning

Recent Activity

upvoted a paper about 16 hours ago

AgentHazard: A Benchmark for Evaluating Harmful Behavior in Computer-Use Agents

upvoted a paper 12 days ago

Internal Safety Collapse in Frontier Large Language Models

new activity about 1 year ago

BackdoorLLM/Backdoored_Dataset:[bot] Conversion to Parquet

View all activity

Organizations

Papers 3

arxiv:2410.19427

arxiv:2408.12798

arxiv:2401.15295

models 0

None public yet

datasets 0

None public yet