allura-org
/

Luna-27B-v0

Image-Text-to-Text

character-training

text-generation-inference

Model card Files Files and versions

Fizzarolli commited on 4 days ago

Commit

ed9bfec

·

verified ·

1 Parent(s): a048b84

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -53,6 +53,7 @@ Furthermore, we are also planning to look into proper RLVR to make her better at
 A more detailed writeup will be released whenever we get to a more final model, but currently this model has undergone
 - Abliteration
 - SFT on primarily roleplay logs and scrapes
 - WPO\[2\] on general preference data, writing data, and Luna persona data
 ## Citations

 A more detailed writeup will be released whenever we get to a more final model, but currently this model has undergone
 - Abliteration
 - SFT on primarily roleplay logs and scrapes
+- A merge to heal some of the bad characteristics of the SFT phase with the original abliteration, epoch 1, and epoch 2 of the SFT
 - WPO\[2\] on general preference data, writing data, and Luna persona data
 ## Citations