shakedzy/Sky-T1_data_steps
Viewer • Updated • 182 • 100
How to use shakedzy/Sky-T1-32B-Steps with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("NovaSky-AI/Sky-T1-32B-Preview")
model = PeftModel.from_pretrained(base_model, "shakedzy/Sky-T1-32B-Steps")This repo contains a LoRA adapter to NovaSky-AI/Sky-T1-32B-Preview for step-by-step thinking. It breaks down the thinking process to steps, allowing better tracking of the thought process.
The model was finetuned without any system prompt.
Each step is outputted in a specific format:
<|step|> TITLE
TEXT
Thus the model's response should adhere to the following format:
<|begin_of_thought|>
<|step|> STEP_1_TITLE
STEP_1_THINKING
<|step|> STEP_2_TITLE
STEP_2_THINKING
...
<|end_of_thought|>
<|begin_of_solution|>
...
<|end_of_solution|>
Base model
Qwen/Qwen2.5-32B