willopcbeta
/

unsloth-whisper-small-ONNX

Automatic Speech Recognition

Transformers.js

hf-asr-leaderboard

Eval Results (legacy)

Model card Files Files and versions

willopcbeta commited on Apr 14

Commit

5283657

·

verified ·

1 Parent(s): fbff46a

Update README.md

Files changed (1) hide show

README.md +11 -0

README.md CHANGED Viewed

@@ -178,7 +178,18 @@ library_name: transformers.js
 # whisper-small (ONNX)
 This is an ONNX version of [unsloth/whisper-small](https://huggingface.co/unsloth/whisper-small). It was automatically converted and uploaded using [this Hugging Face Space](https://huggingface.co/spaces/onnx-community/convert-to-onnx).

 # whisper-small (ONNX)
+Based on unsloth-whisper-small, the following configuration can be used for conversion:
+## Do not use the v4 version to convert Whisper; it will not work at all in the Q4 series.
+```
+quantization: {
+    encoder_model: 'q4f16',
+    decoder_model_merged: 'q4',
+},
+```
+This combination helps reduce memory usage during speech recognition. It also addresses the issue where using q4f16 with onnx-community/whisper-small or Xenova/whisper-small may result in garbled output and malfunctioning systems.
 This is an ONNX version of [unsloth/whisper-small](https://huggingface.co/unsloth/whisper-small). It was automatically converted and uploaded using [this Hugging Face Space](https://huggingface.co/spaces/onnx-community/convert-to-onnx).