sft_iteration_0

This model is a fine-tuned version of /home/shutingw/.cache/huggingface/hub/models--meta-llama--Meta-Llama-3-8B-Instruct/snapshots/5f0b02c75b57c5855da9ae460ce51323ea669d8a on the /data/user_data/shutingw/wentaos/Optima/my_datasets/arc_sft_dpo/sft/iteration_0 dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss
0.386	0.9994	857	nan
0.2178	2.0	1715	nan
0.1194	2.9994	2572	nan
0.0704	3.9977	3428	nan

Safetensors

Model size

8B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support