XuebinWang's picture

XuebinWang

XuebinWang

·

AI & ML interests

None yet

Recent Activity

new activity 19 days ago

amd/gpt-oss-20b-WFP8-AFP8-KVFP8:Amd R9700 - vLLM crashes on startup

published a model about 1 month ago

amd/gpt-oss-20b-MoE-Quant-W-MXFP4-A-FP8-KV-FP8

published a model about 1 month ago

amd/gpt-oss-20b-WFP8-AFP8-KVFP8

View all activity

Organizations

New activity in amd/gpt-oss-20b-WFP8-AFP8-KVFP8 19 days ago

Amd R9700 - vLLM crashes on startup

#6 opened 21 days ago by

New activity in amd/gpt-oss-20b-WFP8-AFP8-KVFP8 about 1 month ago

update readme

#5 opened about 1 month ago by

New activity in amd/gpt-oss-20b-MoE-Quant-W-MXFP4-A-FP8-KV-FP8 about 1 month ago

Update models and readme with accuracy number and disclaimer

#7 opened about 1 month ago by

New activity in amd/gpt-oss-20b-WFP8-AFP8-KVFP8 about 1 month ago

update readme with disclaimer

#4 opened about 1 month ago by

New activity in amd/gpt-oss-20b-WFP8-AFP8-KVFP8 about 2 months ago

update readme

#3 opened about 2 months ago by

update README (results etc) and upload LICENSE and USAGE_POLICY

#2 opened about 2 months ago by

New activity in amd/gpt-oss-20b-MoE-Quant-W-MXFP4-A-FP8-KV-FP8 3 months ago

Update README and upload original files

#6 opened 3 months ago by

Use self_attn in config.json

#5 opened 3 months ago by

New activity in amd/gpt-oss-20b-WFP8-AFP8-KVFP8 4 months ago

KV cache quantization in FP8

#1 opened 4 months ago by

New activity in amd/gpt-oss-20b-MoE-Quant-W-MXFP4-A-FP8-KV-FP8 4 months ago

Change to FP8 customized attention quantization, and update README

#4 opened 4 months ago by

New activity in amd/Mixtral-8x7B-Instruct-v0.1-WMXFP4FP8-AMXFP4FP8-AMP-KVFP8 4 months ago

update model with several fixings

#5 opened 4 months ago by

New activity in amd/gpt-oss-20b-MoE-Quant-W-MXFP4-A-FP8-KV-FP8 4 months ago

Update README (NOT ready to use)

#3 opened 4 months ago by

Update README

#2 opened 4 months ago by

Initial commit to be used in vllm PR#27334

#1 opened 4 months ago by

Initial commit to be used in vllm PR#27334

#1 opened 4 months ago by

New activity in amd/Qwen3-8B-WMXFP4FP8-AMXFP4FP8-AMP-KVFP8 5 months ago

update readme and upload LICENSE file

#2 opened 5 months ago by

New activity in amd/Llama-2-70b-chat-hf-WMXFP4FP8-AMXFP4FP8-AMP-KVFP8 5 months ago

copy needed files from original meta-llama

#4 opened 5 months ago by

upload readme file

#3 opened 5 months ago by

upload readme file

#3 opened 5 months ago by

New activity in amd/Mixtral-8x7B-Instruct-v0.1-WMXFP4FP8-AMXFP4FP8-AMP-KVFP8 5 months ago

upload readme file

#4 opened 5 months ago by