Inference Providers
Active filters: 4-bit
unsloth/DeepSeek-R1-Distill-Qwen-7B-unsloth-bnb-4bit
Text Generation
• 8B • Updated • 24.2k
• 25
Qwen/Qwen2.5-VL-3B-Instruct-AWQ
Image-Text-to-Text
• 4B • Updated • 40.9k
• 64
iFaz/llama32_3B_en_emo_2000_stp
Text Generation
• 3B • Updated • 10
• 1
mlx-community/gemma-3-text-27b-it-4bit
Text Generation
• Updated • 164
• 3
sleepdeprived3/Reformed-Christian-Bible-Expert-v2.1-12B_EXL2_4bpw_H8
Text Generation
• Updated • 13
• 1
unsloth/Qwen3-8B-unsloth-bnb-4bit
Updated • 126k
• 20
unsloth/Qwen3-8B-bnb-4bit
Updated • 38.7k
• 10
lmstudio-community/Phi-4-mini-reasoning-MLX-4bit
Text Generation
• 0.6B • Updated • 58.5k
• 4
mlx-community/Qwen3-Embedding-0.6B-4bit-DWQ
Text Generation
• Updated • 17.8k
• 9
unsloth/LFM2-700M-unsloth-bnb-4bit
Text Generation
• 0.8B • Updated • 32
• 1
steampunque/GLM-Z1-9B-0414-MP-GGUF
9B • Updated • 21
• 2
mlx-community/Qwen3-30B-A3B-Instruct-2507-4bit
Text Generation
• Updated • 1.15k
• 10
QuantTrio/Qwen3-30B-A3B-Thinking-2507-AWQ-BF16Mix
Text Generation
• 31B • Updated • 1.17k
• 5
lmstudio-community/Qwen3-4B-Thinking-2507-MLX-4bit
Text Generation
• 0.6B • Updated • 61.8k
• 13
mlx-community/Qwen3-Next-80B-A3B-Instruct-4bit
Text Generation
• Updated • 2.7k
• 24
LeDXIII/NuMarkdown-8B-Thinking-bnb4
8B • Updated • 21
• 1
QuantTrio/Qwen3-VL-30B-A3B-Instruct-AWQ
Text Generation
• 31B • Updated • 1.25M
• 43
SEOKDONG/gpt-oss-safeguard-20b-kor-enterprise-gptq-4bit
Text Generation
• 21B • Updated • 69
• 4
mlx-community/Trinity-Mini-4bit
Text Generation
• Updated • 47
• 2
mlx-community/Ministral-3-3B-Instruct-2512-4bit
Updated • 21.2k
• 5
MaziyarPanahi/GLM-4.6V-Flash-GGUF
Text Generation
• 9B • Updated • 80.2k
• 6
mlx-community/LFM2.5-VL-1.6B-4bit
Image-Text-to-Text
• 0.6B • Updated • 5.09k
• 3
unsloth/medgemma-1.5-4b-it-unsloth-bnb-4bit
Image-Text-to-Text
• 4B • Updated • 1.81k
• 3
steampunque/GLM-4.7-Flash-MP-GGUF
30B • Updated • 762
• 1
mlx-community/DeepSeek-OCR-2-4bit
Image-Text-to-Text
• 0.9B • Updated • 267
• 1
lmstudio-community/Qwen3-Coder-Next-MLX-4bit
80B • Updated • 224k
• 23
mlx-community/Qwen3.5-397B-A17B-nvfp4
Text Generation
• 396B • Updated • 362
• 5
saricles/Qwen3-Coder-Next-NVFP4-GB10
Text Generation
• Updated • 8.66k
• 28
mlx-community/Qwen3.5-27B-4bit
Image-Text-to-Text
• 5B • Updated • 96.9k
• 47
mlx-community/Qwen3.5-35B-A3B-4bit
Image-Text-to-Text
• 6B • Updated • 8.19k
• 37