mradermacher commited on
Commit
6121ec4
·
verified ·
1 Parent(s): f54e971

auto-patch README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -2
README.md CHANGED
@@ -37,7 +37,7 @@ more details, including on how to concatenate multi-part files.
37
  | [GGUF](https://huggingface.co/mradermacher/wizardllama-7b-GGUF/resolve/main/wizardllama-7b.Q3_K_L.gguf) | Q3_K_L | 3.9 | |
38
  | [GGUF](https://huggingface.co/mradermacher/wizardllama-7b-GGUF/resolve/main/wizardllama-7b.IQ4_XS.gguf) | IQ4_XS | 3.9 | |
39
  | [GGUF](https://huggingface.co/mradermacher/wizardllama-7b-GGUF/resolve/main/wizardllama-7b.Q4_0.gguf) | Q4_0 | 4.1 | fast, low quality |
40
- | [GGUF](https://huggingface.co/mradermacher/wizardllama-7b-GGUF/resolve/main/wizardllama-7b.IQ4_NL.gguf) | IQ4_NL | 4.1 | slightly worse than Q4_K_S |
41
  | [GGUF](https://huggingface.co/mradermacher/wizardllama-7b-GGUF/resolve/main/wizardllama-7b.Q4_K_S.gguf) | Q4_K_S | 4.1 | fast, recommended |
42
  | [GGUF](https://huggingface.co/mradermacher/wizardllama-7b-GGUF/resolve/main/wizardllama-7b.Q4_K_M.gguf) | Q4_K_M | 4.3 | fast, recommended |
43
  | [GGUF](https://huggingface.co/mradermacher/wizardllama-7b-GGUF/resolve/main/wizardllama-7b.Q5_K_S.gguf) | Q5_K_S | 4.9 | |
@@ -45,7 +45,6 @@ more details, including on how to concatenate multi-part files.
45
  | [GGUF](https://huggingface.co/mradermacher/wizardllama-7b-GGUF/resolve/main/wizardllama-7b.Q6_K.gguf) | Q6_K | 5.8 | very good quality |
46
  | [GGUF](https://huggingface.co/mradermacher/wizardllama-7b-GGUF/resolve/main/wizardllama-7b.Q8_0.gguf) | Q8_0 | 7.4 | fast, best quality |
47
 
48
-
49
  Here is a handy graph by ikawrakow comparing some lower-quality quant
50
  types (lower is better):
51
 
 
37
  | [GGUF](https://huggingface.co/mradermacher/wizardllama-7b-GGUF/resolve/main/wizardllama-7b.Q3_K_L.gguf) | Q3_K_L | 3.9 | |
38
  | [GGUF](https://huggingface.co/mradermacher/wizardllama-7b-GGUF/resolve/main/wizardllama-7b.IQ4_XS.gguf) | IQ4_XS | 3.9 | |
39
  | [GGUF](https://huggingface.co/mradermacher/wizardllama-7b-GGUF/resolve/main/wizardllama-7b.Q4_0.gguf) | Q4_0 | 4.1 | fast, low quality |
40
+ | [GGUF](https://huggingface.co/mradermacher/wizardllama-7b-GGUF/resolve/main/wizardllama-7b.IQ4_NL.gguf) | IQ4_NL | 4.1 | prefer IQ4_XS |
41
  | [GGUF](https://huggingface.co/mradermacher/wizardllama-7b-GGUF/resolve/main/wizardllama-7b.Q4_K_S.gguf) | Q4_K_S | 4.1 | fast, recommended |
42
  | [GGUF](https://huggingface.co/mradermacher/wizardllama-7b-GGUF/resolve/main/wizardllama-7b.Q4_K_M.gguf) | Q4_K_M | 4.3 | fast, recommended |
43
  | [GGUF](https://huggingface.co/mradermacher/wizardllama-7b-GGUF/resolve/main/wizardllama-7b.Q5_K_S.gguf) | Q5_K_S | 4.9 | |
 
45
  | [GGUF](https://huggingface.co/mradermacher/wizardllama-7b-GGUF/resolve/main/wizardllama-7b.Q6_K.gguf) | Q6_K | 5.8 | very good quality |
46
  | [GGUF](https://huggingface.co/mradermacher/wizardllama-7b-GGUF/resolve/main/wizardllama-7b.Q8_0.gguf) | Q8_0 | 7.4 | fast, best quality |
47
 
 
48
  Here is a handy graph by ikawrakow comparing some lower-quality quant
49
  types (lower is better):
50