Inference Providers
Active filters: torchao
andysalerno/Qwen3-8B-ao-autoquant
Text Generation
• Updated • 12
andrewor14/Llama-3.1-8B-Instruct-float8dq
Text Generation
• Updated • 7
GingerBled/DPO-Quantized_8bit_mock1
Text Generation
• Updated • 7
GingerBled/DPO-Quantized_8bit_mock2
Text Generation
• Updated • 5
sajal09/MNLP_M2_quantized_model2
Text Generation
• Updated • 5
Erland/softpick-1.8B-4096-model-AO-W4A4
Text Generation
• Updated • 5
Erland/softpick-1.8B-4096-model-AO-W4
Text Generation
• Updated • 5
Erland/vanilla-1.8B-4096-model-AO-W4A4
Text Generation
• Updated • 5
Erland/vanilla-1.8B-4096-model-AO-W4
Text Generation
• Updated • 6
Cloudmaster/Llama-3.2-3B-torchao
Text Generation
• Updated • 6
Jiqing/cuda_torchao_llama_68m
Cloudmaster/Llama-3.2-3B-torchao-int4
Text Generation
• Updated • 6
Cloudmaster/Llama-3.2-3B-torchao-int4-t4
Text Generation
• Updated • 6
Text Generation
• Updated • 5
Cloudmaster/Llama-3.2-3B-torchao-autoquant
Text Generation
• Updated • 6
Cloudmaster/Llama-3.2-3B-torchao-I8WI8A-attn
Text Generation
• Updated • 6
Cloudmaster/Llama-3.2-3B-torchao-final
Text Generation
• Updated • 6
oskdabk/first_quantized_model
Text Generation
• Updated • 5
Text Generation
• Updated • 5
Cloudmaster/Llama-3.2-3B-torchao-final00
Text Generation
• Updated • 8
Cloudmaster/Llama-3.2-3B-torchao-final-woclass
Text Generation
• Updated • 7
Cloudmaster/Llama-3.2-3B-torchao-final-wattn
Text Generation
• Updated • 7
• 1
Cloudmaster/Llama-3.2-3B-torchao-final01
Text Generation
• Updated • 7
Cloudmaster/Llama-3.2-3B-torchao-final02
Text Generation
• Updated • 6
Text Generation
• Updated • 5
Cloudmaster/Llama-3.2-3B-torchao-final03
Text Generation
• Updated • 7
Text Generation
• Updated • 2.13k