Peter
rtzurtz
AI & ML interests
None yet
Recent Activity
new activity about 21 hours ago
Qwen/Qwen3.6-27B:Dense 60-120B new activity about 21 hours ago
Qwen/Qwen3.6-27B:delete new activity 21 days ago
unsloth/Qwen3.6-27B-MTP-GGUF:Report: 56 t/s on RTX 4090D (48GB VRAM) with UD-Q6_K_XLOrganizations
None yet
Dense 60-120B
#44 opened about 21 hours ago
by
rtzurtz
Report: 56 t/s on RTX 4090D (48GB VRAM) with UD-Q6_K_XL
3
#25 opened about 1 month ago
by
SlavikF
40B to 80B dense LLM: A better/smarter model than 122B that can still fit
1
#16 opened 3 months ago
by
rtzurtz
Thank you team Qwen for a 120B LLM
❤️ 3
2
#3 opened 4 months ago
by
rtzurtz
27B GGUF quants benchmark?
👀 1
1
#25 opened 3 months ago
by
rtzurtz
when new updated version waiting
👀👍 3
1
#22 opened 3 months ago
by
gopi87
Smaller GGUF without the vision weights?
1
#19 opened 4 months ago
by
rtzurtz
How much Vram needed for the full context length?
6
#31 opened 9 months ago
by
Aly87
Suggesting an open-weight Gpt-Oss LLM between the 20B and 120B parameters
#156 opened 7 months ago
by
rtzurtz
230B vs 235B: Why no comparison against Qwen3-235B-A22B-Thinking-2507 ?
🤝👍 2
7
#20 opened 8 months ago
by
rtzurtz
Are the F16 weights upcasted MXFP4? -- Why no `gpt-oss-20b-MXFP4.gguf`?
3
#34 opened 8 months ago
by
rtzurtz
Q3_K_M (112 GB) is bigger than Q3_K_XL (104 GB)?
5
#8 opened 9 months ago
by
rtzurtz
Any quants between Q8_K_XL and BF16?
1
#16 opened 11 months ago
by
rtzurtz
Good idea to remove the hybrid thinking mode
👍 4
1
#16 opened 11 months ago
by
rtzurtz
A _Q8_K_XL quant?
#4 opened 11 months ago
by
rtzurtz
MoE version with the same performance as this 32B dense
#37 opened 11 months ago
by
rtzurtz
First evaluation suggest only 14B (dense) performance?
👀 5
#33 opened 12 months ago
by
rtzurtz
Qwen is loosing broad knowledge since Qwen2.
🔥👍 11
16
#16 opened about 1 year ago
by
phil111