XuebinWang
XuebinWang
·
AI & ML interests
None yet
Recent Activity
new activity
19 days ago
amd/gpt-oss-20b-WFP8-AFP8-KVFP8:Amd R9700 - vLLM crashes on startup published
a model about 1 month ago
amd/gpt-oss-20b-MoE-Quant-W-MXFP4-A-FP8-KV-FP8 published
a model about 1 month ago
amd/gpt-oss-20b-WFP8-AFP8-KVFP8 Organizations
Amd R9700 - vLLM crashes on startup
2
#6 opened 21 days ago
by
jmander11
update readme
#5 opened about 1 month ago
by
XuebinWang
Update models and readme with accuracy number and disclaimer
#7 opened about 1 month ago
by
XuebinWang
update readme with disclaimer
#4 opened about 1 month ago
by
XuebinWang
update readme
#3 opened about 2 months ago
by
XuebinWang
update README (results etc) and upload LICENSE and USAGE_POLICY
#2 opened about 2 months ago
by
XuebinWang
Update README and upload original files
#6 opened 3 months ago
by
XuebinWang
Use self_attn in config.json
#5 opened 3 months ago
by
XuebinWang
KV cache quantization in FP8
#1 opened 4 months ago
by
XuebinWang
Change to FP8 customized attention quantization, and update README
#4 opened 4 months ago
by
XuebinWang
update model with several fixings
#5 opened 4 months ago
by
XuebinWang
Update README (NOT ready to use)
#3 opened 4 months ago
by
XuebinWang
Update README
#2 opened 4 months ago
by
XuebinWang
Initial commit to be used in vllm PR#27334
#1 opened 4 months ago
by
XuebinWang
Initial commit to be used in vllm PR#27334
#1 opened 4 months ago
by
XuebinWang
update readme and upload LICENSE file
#2 opened 5 months ago
by
XuebinWang
copy needed files from original meta-llama
#4 opened 5 months ago
by
XuebinWang
upload readme file
#3 opened 5 months ago
by
XuebinWang
upload readme file
#3 opened 5 months ago
by
XuebinWang
upload readme file
#4 opened 5 months ago
by
XuebinWang