-
-
-
-
-
-
Inference Providers
Active filters:
RLHF
aaditya/Llama3-OpenBioLLM-8B
Text Generation
•
Updated
•
3.61k
•
•
232
aaditya/Llama3-OpenBioLLM-70B
Text Generation
•
Updated
•
3.01k
•
498
MoMonir/Llama3-OpenBioLLM-8B-GGUF
8B
•
Updated
•
69
•
5
NousResearch/Hermes-2-Theta-Llama-3-8B-GGUF
8B
•
Updated
•
499
•
90
psp-dada/Llama-3-8B-Base-SFT-Uni-DPO-v2-GPT-4
Text Generation
•
8B
•
Updated
•
10
•
1
psp-dada/Gemma2-9B-IT-Uni-DPO
Text Generation
•
9B
•
Updated
•
13
•
1
psp-dada/Llama-3-8B-Base-SFT-Uni-DPO-v2-Qwen
Text Generation
•
8B
•
Updated
•
28
•
1
psp-dada/Llama-3-8B-Base-SFT-Uni-DPO
Text Generation
•
8B
•
Updated
•
11
•
1
psp-dada/Llama-3-8B-Instruct-Uni-DPO-v2-ArmoRM
Text Generation
•
8B
•
Updated
•
26
•
1
psp-dada/Llama-3-8B-Instruct-Uni-DPO-v2-GPT-4o
Text Generation
•
8B
•
Updated
•
7
•
1
psp-dada/Qwen2.5-7B-Uni-DPO
Text Generation
•
8B
•
Updated
•
14
•
1
psp-dada/Llama-3-8B-Instruct-Uni-DPO
Text Generation
•
8B
•
Updated
•
10
•
1
psp-dada/Qwen2.5-Math-7B-Uni-DPO
Text Generation
•
8B
•
Updated
•
12
•
1
OpenAssistant/reward-model-deberta-v3-base
Text Classification
•
Updated
•
352
•
13
OpenAssistant/reward-model-electra-large-discriminator
Text Classification
•
Updated
•
11
•
5
OpenAssistant/reward-model-deberta-v3-large
Text Classification
•
Updated
•
259
•
26
OpenAssistant/reward-model-deberta-v3-large-v2
Text Classification
•
Updated
•
15.7k
•
•
244
Text Ranking
•
0.4B
•
Updated
•
4
•
3
nicholasKluge/RewardModelPT
Text Classification
•
0.1B
•
Updated
•
15
nicholasKluge/RewardModel
Text Classification
•
0.1B
•
Updated
•
53
•
1
fb700/chatglm-fitness-RLHF
Updated
•
268
fb700/Bofan-chatglm-Best-lora
Updated
•
5
•
11
kubernetes-bad/Ligma-L2-13b
Updated
•
3
•
3
Text Generation
•
Updated
•
147
•
205
berkeley-nest/Starling-LM-7B-alpha
Text Generation
•
Updated
•
2k
•
557
berkeley-nest/Starling-RM-7B-alpha
Updated
•
17
•
103
LoneStriker/Starling-LM-7B-alpha-3.0bpw-h6-exl2
Text Generation
•
Updated
LoneStriker/Starling-LM-7B-alpha-4.0bpw-h6-exl2
Text Generation
•
Updated
•
1
•
1
LoneStriker/Starling-LM-7B-alpha-5.0bpw-h6-exl2
Text Generation
•
Updated
•
2
•
2