IlyaGusev/saiga_preferences
Viewer • Updated • 30.6k • 101 • 7
How to use radm/Qwen2.5-32B-simpo-LoRA with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2.5-32B-Instruct")
model = PeftModel.from_pretrained(base_model, "radm/Qwen2.5-32B-simpo-LoRA")This model is a fine-tuned version of ../models/Qwen2.5-32B-Instruct on the custom dataset.
Full model (FP8): radm/Qwen2.5-32B-simpo-FP8
The following hyperparameters were used during training: