LoRA: Low-Rank Adaptation of Large Language Models
Paper • 2106.09685 • Published • 61
How to use y0mur/voxtral-mini-3b-tr-lora with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="y0mur/voxtral-mini-3b-tr-lora") # Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("y0mur/voxtral-mini-3b-tr-lora", dtype="auto")How to use y0mur/voxtral-mini-3b-tr-lora with PEFT:
Task type is invalid.
This model is a Turkish Automatic Speech Recognition (ASR) LoRA adapter fine-tuned on top of Voxtral-Mini-3B-2507 using QLoRA.
The base Voxtral model is kept fully frozen, and only a small set of Low-Rank Adaptation (LoRA) parameters are trained.
This results in a parameter-efficient, lightweight, and easily deployable ASR model specialized for Turkish speech.
Users are advised to:
import torch
from transformers import AutoProcessor, VoxtralForConditionalGeneration
from peft import PeftModel
BASE_MODEL = "mistralai/Voxtral-Mini-3B-2507"
LORA_REPO = "y0mur/voxtral-mini-3b-tr-lora"
processor = AutoProcessor.from_pretrained(BASE_MODEL)
base_model = VoxtralForConditionalGeneration.from_pretrained(
BASE_MODEL,
device_map="auto",
torch_dtype=torch.float16,
)
model = PeftModel.from_pretrained(base_model, LORA_REPO)
model.eval()
Base model
mistralai/Voxtral-Mini-3B-2507