rankalign-v6-gemma-2-2b-d0.15-e2-ambigqa-all-tcs-vlo

Fine-tuned checkpoint from the rankalign project.

Training Details

Field Value
Base model google/gemma-2-2b
Version v6
Task ambigqa-all
Epoch 2
Delta 0.15
Typicality correction self
Length normalization False
Preference loss weight 1
NLL validator weight 0
NLL generator weight 0
Validator log-odds True
Force same-x False
Semi-supervised ratio None
Labeled-only ratio None

Reproducibility

Original checkpoint name: v6-google--gemma-2-2b-delta0.15-epoch2--ambigqa-all--d2g--random--alpha1.0--tc-self--tcoracle--full-completion--vallogodds

To evaluate:

python scripts/eval_by_claude.py \
    --model TAUR-dev/rankalign-v6-gemma-2-2b-d0.15-e2-ambigqa-all-tcs-vlo \
    --task ambigqa-all \
    --split_type random --gen-shots zero --disc-shots few --validator-log-odds --save-scores-csv \
    --self-typicality
Downloads last month
8
Safetensors
Model size
3B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for TAUR-dev/rankalign-v6-gemma-2-2b-d0.15-e2-ambigqa-all-tcs-vlo

Finetuned
(560)
this model