Post
94
🚀 Released two Responsible AI lightweight instruction-tuned models focused on toxicity, bias, and safety analysis
Model 1: Responsible AI Safety Assistant (Qwen 2.5)
Kurapika993/qwen2.5-7b-responsible-ai-qlora
Base Model: Qwen2.5-7B-Instruct
Method: QLoRA
Training Data: BeaverTails + Wiki Toxic + custom Responsible AI instruction dataset
Model 2: Responsible AI Assistant (Llama)
Kurapika993/llama-3.1-8b-responsible-ai-safety-lora
Base Model: Llama-3.1-8b Instruct
Method: QLoRA
Training Data: BeaverTails + Wiki Toxic + custom curated examples
This model follows the same structured output format but explores the impact of a different base architecture on safety-analysis tasks.
Intended Use
These models are designed for:
✅ Responsible AI research
✅ Moderation decisions
✅ Safety and bias analysis
✅ Human-in-the-loop moderation workflows
✅ Dataset generation and annotation assistance
Model 1: Responsible AI Safety Assistant (Qwen 2.5)
Kurapika993/qwen2.5-7b-responsible-ai-qlora
Base Model: Qwen2.5-7B-Instruct
Method: QLoRA
Training Data: BeaverTails + Wiki Toxic + custom Responsible AI instruction dataset
Model 2: Responsible AI Assistant (Llama)
Kurapika993/llama-3.1-8b-responsible-ai-safety-lora
Base Model: Llama-3.1-8b Instruct
Method: QLoRA
Training Data: BeaverTails + Wiki Toxic + custom curated examples
This model follows the same structured output format but explores the impact of a different base architecture on safety-analysis tasks.
Intended Use
These models are designed for:
✅ Responsible AI research
✅ Moderation decisions
✅ Safety and bias analysis
✅ Human-in-the-loop moderation workflows
✅ Dataset generation and annotation assistance