Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
ttttonyhe
's Collections
Guardrails
Red-Teaming Datasets
Guardrail Datasets and Benchmarks
Prompt Injection Defenses
Specialized LLMs
Red-Teaming Models
Safety Alignment Datasets
Dense LLMs
Reasoning LLMs
Tiny Models
Small Models
Embedding Models
OCR Models
Domain-specific Datasets
Novel Model Architectures
Templates
Red-Teaming Datasets
updated
12 days ago
Upvote
1
mlabonne/orca-agentinstruct-1M-v1-cleaned
Viewer
•
Updated
Jan 25, 2025
•
1.05M
•
242
•
67
walledai/AdvBench
Viewer
•
Updated
Jul 4, 2024
•
520
•
11.3k
•
100
jkazdan/HeX-PHI-usable
Viewer
•
Updated
Dec 26, 2024
•
300
•
47
walledai/HarmBench
Viewer
•
Updated
Jul 31, 2024
•
400
•
17.3k
•
43
allenai/wildjailbreak
Viewer
•
Updated
Aug 8, 2024
•
2.21k
•
8.62k
•
129
walledai/XSTest
Viewer
•
Updated
Jul 4, 2024
•
450
•
11k
•
23
walledai/StrongREJECT
Viewer
•
Updated
Oct 18, 2024
•
313
•
4.13k
•
22
LLM-Tuning-Safety/HEx-PHI
Preview
•
Updated
Aug 19, 2024
•
627
•
64
Upvote
1
Share collection
View history
Collection guide
Browse collections