DPO Alpaca
updated
Setpember/Alpaca_GPT2M_DPSGD_epi_01
Updated
Setpember/Alpaca_GPT2M_DPSGD_epi_05
Updated
Setpember/Alpaca_GPT2M_DPSGD_epi_1
Updated
Setpember/Alpaca_GPT2M_DPSGD_epi_2
Updated
Setpember/Alpaca_GPT2M_DPO_props_epi_01
Text Generation
• 0.4B • Updated
• 1
Setpember/Alpaca_GPT2M_DPO_props_epi_05
Text Generation
• 0.4B • Updated
Setpember/Alpaca_GPT2M_DPO_props_epi_1
Text Generation
• 0.4B • Updated
Setpember/Alpaca_GPT2M_DPO_props_epi_2
Text Generation
• 0.4B • Updated
• 1
Setpember/Alpaca_GPT2M_DPO_RR_epi_01
Text Generation
• 0.4B • Updated
• 1
Setpember/Alpaca_GPT2M_DPO_RR_epi_05
Text Generation
• 0.4B • Updated
Setpember/Alpaca_GPT2M_DPO_RR_epi_1
Text Generation
• 0.4B • Updated
Setpember/Alpaca_GPT2M_DPO_RR_epi_2
Text Generation
• 0.4B • Updated
• 1
Setpember/Alpaca_GPT2L_DPO_props_epi_01
Text Generation
• 0.8B • Updated
Setpember/Alpaca_GPT2L_DPO_props_epi_05
Text Generation
• 0.8B • Updated
Setpember/Alpaca_GPT2L_DPO_props_epi_1
Text Generation
• 0.8B • Updated
Setpember/Alpaca_GPT2L_DPO_props_epi_2
Text Generation
• 0.8B • Updated
Setpember/Alpaca_GPT2L_DPO_RR_epi_01
Text Generation
• 0.8B • Updated
Setpember/Alpaca_GPT2L_DPO_RR_epi_05
Text Generation
• 0.8B • Updated
Setpember/Alpaca_GPT2L_DPO_RR_epi_1
Text Generation
• 0.8B • Updated
Setpember/Alpaca_GPT2L_DPO_RR_epi_2
Text Generation
• 0.8B • Updated