File size: 10,123 Bytes
327ab4e 03dc79a 327ab4e 5c1c651 327ab4e 5c1c651 d48e459 5c1c651 d48e459 327ab4e 05b98d5 327ab4e | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 | ---
library_name: pytorch
license: other
tags:
- real_time
- android
pipeline_tag: image-segmentation
---

# BiseNet: Optimized for Qualcomm Devices
BiSeNet (Bilateral Segmentation Network) is a novel architecture designed for real-time semantic segmentation. It addresses the challenge of balancing spatial resolution and receptive field by employing a Spatial Path to preserve high-resolution features and a context path to capture sufficient receptive field.
This is based on the implementation of BiseNet found [here](https://github.com/ooooverflow/BiSeNet).
This repository contains pre-exported model files optimized for Qualcomm® devices. You can use the [Qualcomm® AI Hub Models](https://github.com/quic/ai-hub-models/blob/main/qai_hub_models/models/bisenet) library to export with custom configurations. More details on model performance across various devices, can be found [here](#performance-summary).
Qualcomm AI Hub Models uses [Qualcomm AI Hub Workbench](https://workbench.aihub.qualcomm.com) to compile, profile, and evaluate this model. [Sign up](https://myaccount.qualcomm.com/signup) to run these models on a hosted Qualcomm® device.
## Getting Started
There are two ways to deploy this model on your device:
### Option 1: Download Pre-Exported Models
Below are pre-exported model assets ready for deployment.
| Runtime | Precision | Chipset | SDK Versions | Download |
|---|---|---|---|---|
| ONNX | float | Universal | QAIRT 2.42, ONNX Runtime 1.24.1 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/bisenet/releases/v0.47.0/bisenet-onnx-float.zip)
| ONNX | w8a8 | Universal | QAIRT 2.42, ONNX Runtime 1.24.1 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/bisenet/releases/v0.47.0/bisenet-onnx-w8a8.zip)
| QNN_DLC | float | Universal | QAIRT 2.43 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/bisenet/releases/v0.47.0/bisenet-qnn_dlc-float.zip)
| QNN_DLC | w8a8 | Universal | QAIRT 2.43 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/bisenet/releases/v0.47.0/bisenet-qnn_dlc-w8a8.zip)
| TFLITE | float | Universal | QAIRT 2.43, TFLite 2.17.0 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/bisenet/releases/v0.47.0/bisenet-tflite-float.zip)
| TFLITE | w8a8 | Universal | QAIRT 2.43, TFLite 2.17.0 | [Download](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/models/bisenet/releases/v0.47.0/bisenet-tflite-w8a8.zip)
For more device-specific assets and performance metrics, visit **[BiseNet on Qualcomm® AI Hub](https://aihub.qualcomm.com/models/bisenet)**.
### Option 2: Export with Custom Configurations
Use the [Qualcomm® AI Hub Models](https://github.com/quic/ai-hub-models/blob/main/qai_hub_models/models/bisenet) Python library to compile and export the model with your own:
- Custom weights (e.g., fine-tuned checkpoints)
- Custom input shapes
- Target device and runtime configurations
This option is ideal if you need to customize the model beyond the default configuration provided here.
See our repository for [BiseNet on GitHub](https://github.com/quic/ai-hub-models/blob/main/qai_hub_models/models/bisenet) for usage instructions.
## Model Details
**Model Type:** Model_use_case.semantic_segmentation
**Model Stats:**
- Model checkpoint: best_dice_loss_miou_0.655.pth
- Inference latency: RealTime
- Input resolution: 720x960
- Number of parameters: 12.0M
- Model size (float): 45.7 MB
## Performance Summary
| Model | Runtime | Precision | Chipset | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit
|---|---|---|---|---|---|---
| BiseNet | ONNX | float | Snapdragon® X Elite | 30.848 ms | 66 - 66 MB | NPU
| BiseNet | ONNX | float | Snapdragon® 8 Gen 3 Mobile | 25.504 ms | 73 - 318 MB | NPU
| BiseNet | ONNX | float | Qualcomm® QCS8550 (Proxy) | 32.063 ms | 71 - 79 MB | NPU
| BiseNet | ONNX | float | Qualcomm® QCS9075 | 49.312 ms | 8 - 11 MB | NPU
| BiseNet | ONNX | float | Snapdragon® 8 Elite For Galaxy Mobile | 18.153 ms | 65 - 251 MB | NPU
| BiseNet | ONNX | float | Snapdragon® 8 Elite Gen 5 Mobile | 15.113 ms | 56 - 255 MB | NPU
| BiseNet | ONNX | float | Snapdragon® X2 Elite | 15.48 ms | 64 - 64 MB | NPU
| BiseNet | ONNX | w8a8 | Snapdragon® X Elite | 8.661 ms | 19 - 19 MB | NPU
| BiseNet | ONNX | w8a8 | Snapdragon® 8 Gen 3 Mobile | 5.979 ms | 18 - 257 MB | NPU
| BiseNet | ONNX | w8a8 | Qualcomm® QCS6490 | 239.497 ms | 224 - 236 MB | CPU
| BiseNet | ONNX | w8a8 | Qualcomm® QCS8550 (Proxy) | 8.315 ms | 16 - 22 MB | NPU
| BiseNet | ONNX | w8a8 | Qualcomm® QCS9075 | 10.171 ms | 18 - 21 MB | NPU
| BiseNet | ONNX | w8a8 | Qualcomm® QCM6690 | 229.436 ms | 230 - 238 MB | CPU
| BiseNet | ONNX | w8a8 | Snapdragon® 8 Elite For Galaxy Mobile | 4.752 ms | 18 - 218 MB | NPU
| BiseNet | ONNX | w8a8 | Snapdragon® 7 Gen 4 Mobile | 216.926 ms | 250 - 258 MB | CPU
| BiseNet | ONNX | w8a8 | Snapdragon® 8 Elite Gen 5 Mobile | 3.811 ms | 18 - 219 MB | NPU
| BiseNet | ONNX | w8a8 | Snapdragon® X2 Elite | 3.761 ms | 17 - 17 MB | NPU
| BiseNet | QNN_DLC | float | Snapdragon® X Elite | 28.586 ms | 8 - 8 MB | NPU
| BiseNet | QNN_DLC | float | Snapdragon® 8 Gen 3 Mobile | 20.622 ms | 8 - 285 MB | NPU
| BiseNet | QNN_DLC | float | Qualcomm® QCS8275 (Proxy) | 107.61 ms | 2 - 190 MB | NPU
| BiseNet | QNN_DLC | float | Qualcomm® QCS8550 (Proxy) | 28.399 ms | 8 - 10 MB | NPU
| BiseNet | QNN_DLC | float | Qualcomm® SA8775P | 38.899 ms | 1 - 189 MB | NPU
| BiseNet | QNN_DLC | float | Qualcomm® QCS9075 | 55.932 ms | 8 - 49 MB | NPU
| BiseNet | QNN_DLC | float | Qualcomm® QCS8450 (Proxy) | 59.788 ms | 8 - 277 MB | NPU
| BiseNet | QNN_DLC | float | Qualcomm® SA7255P | 107.61 ms | 2 - 190 MB | NPU
| BiseNet | QNN_DLC | float | Qualcomm® SA8295P | 44.235 ms | 0 - 212 MB | NPU
| BiseNet | QNN_DLC | float | Snapdragon® 8 Elite For Galaxy Mobile | 15.769 ms | 0 - 222 MB | NPU
| BiseNet | QNN_DLC | float | Snapdragon® 8 Elite Gen 5 Mobile | 12.716 ms | 8 - 286 MB | NPU
| BiseNet | QNN_DLC | float | Snapdragon® X2 Elite | 14.51 ms | 8 - 8 MB | NPU
| BiseNet | QNN_DLC | w8a8 | Snapdragon® X Elite | 10.129 ms | 2 - 2 MB | NPU
| BiseNet | QNN_DLC | w8a8 | Snapdragon® 8 Gen 3 Mobile | 6.686 ms | 2 - 231 MB | NPU
| BiseNet | QNN_DLC | w8a8 | Qualcomm® QCS6490 | 40.354 ms | 1 - 13 MB | NPU
| BiseNet | QNN_DLC | w8a8 | Qualcomm® QCS8275 (Proxy) | 20.012 ms | 2 - 182 MB | NPU
| BiseNet | QNN_DLC | w8a8 | Qualcomm® QCS8550 (Proxy) | 9.516 ms | 2 - 4 MB | NPU
| BiseNet | QNN_DLC | w8a8 | Qualcomm® SA8775P | 10.243 ms | 2 - 183 MB | NPU
| BiseNet | QNN_DLC | w8a8 | Qualcomm® QCS9075 | 13.131 ms | 1 - 12 MB | NPU
| BiseNet | QNN_DLC | w8a8 | Qualcomm® QCM6690 | 89.126 ms | 2 - 206 MB | NPU
| BiseNet | QNN_DLC | w8a8 | Qualcomm® QCS8450 (Proxy) | 16.144 ms | 2 - 231 MB | NPU
| BiseNet | QNN_DLC | w8a8 | Qualcomm® SA7255P | 20.012 ms | 2 - 182 MB | NPU
| BiseNet | QNN_DLC | w8a8 | Qualcomm® SA8295P | 12.698 ms | 2 - 185 MB | NPU
| BiseNet | QNN_DLC | w8a8 | Snapdragon® 8 Elite For Galaxy Mobile | 5.175 ms | 2 - 191 MB | NPU
| BiseNet | QNN_DLC | w8a8 | Snapdragon® 7 Gen 4 Mobile | 13.349 ms | 2 - 201 MB | NPU
| BiseNet | QNN_DLC | w8a8 | Snapdragon® 8 Elite Gen 5 Mobile | 4.297 ms | 2 - 193 MB | NPU
| BiseNet | QNN_DLC | w8a8 | Snapdragon® X2 Elite | 4.918 ms | 2 - 2 MB | NPU
| BiseNet | TFLITE | float | Snapdragon® 8 Gen 3 Mobile | 20.464 ms | 31 - 288 MB | NPU
| BiseNet | TFLITE | float | Qualcomm® QCS8275 (Proxy) | 105.803 ms | 32 - 247 MB | NPU
| BiseNet | TFLITE | float | Qualcomm® QCS8550 (Proxy) | 28.692 ms | 32 - 34 MB | NPU
| BiseNet | TFLITE | float | Qualcomm® SA8775P | 37.637 ms | 32 - 247 MB | NPU
| BiseNet | TFLITE | float | Qualcomm® QCS9075 | 55.884 ms | 0 - 66 MB | NPU
| BiseNet | TFLITE | float | Qualcomm® QCS8450 (Proxy) | 61.614 ms | 32 - 306 MB | NPU
| BiseNet | TFLITE | float | Qualcomm® SA7255P | 105.803 ms | 32 - 247 MB | NPU
| BiseNet | TFLITE | float | Qualcomm® SA8295P | 44.076 ms | 32 - 247 MB | NPU
| BiseNet | TFLITE | float | Snapdragon® 8 Elite For Galaxy Mobile | 15.878 ms | 30 - 253 MB | NPU
| BiseNet | TFLITE | float | Snapdragon® 8 Elite Gen 5 Mobile | 12.63 ms | 30 - 309 MB | NPU
| BiseNet | TFLITE | w8a8 | Snapdragon® 8 Gen 3 Mobile | 8.825 ms | 6 - 237 MB | NPU
| BiseNet | TFLITE | w8a8 | Qualcomm® QCS6490 | 47.441 ms | 7 - 31 MB | NPU
| BiseNet | TFLITE | w8a8 | Qualcomm® QCS8275 (Proxy) | 20.873 ms | 8 - 190 MB | NPU
| BiseNet | TFLITE | w8a8 | Qualcomm® QCS8550 (Proxy) | 11.978 ms | 8 - 205 MB | NPU
| BiseNet | TFLITE | w8a8 | Qualcomm® SA8775P | 12.861 ms | 8 - 191 MB | NPU
| BiseNet | TFLITE | w8a8 | Qualcomm® QCS9075 | 13.153 ms | 4 - 28 MB | NPU
| BiseNet | TFLITE | w8a8 | Qualcomm® QCM6690 | 92.741 ms | 7 - 211 MB | NPU
| BiseNet | TFLITE | w8a8 | Qualcomm® QCS8450 (Proxy) | 16.253 ms | 8 - 238 MB | NPU
| BiseNet | TFLITE | w8a8 | Qualcomm® SA7255P | 20.873 ms | 8 - 190 MB | NPU
| BiseNet | TFLITE | w8a8 | Qualcomm® SA8295P | 15.329 ms | 8 - 193 MB | NPU
| BiseNet | TFLITE | w8a8 | Snapdragon® 8 Elite For Galaxy Mobile | 6.742 ms | 6 - 196 MB | NPU
| BiseNet | TFLITE | w8a8 | Snapdragon® 7 Gen 4 Mobile | 15.908 ms | 0 - 199 MB | NPU
| BiseNet | TFLITE | w8a8 | Snapdragon® 8 Elite Gen 5 Mobile | 5.519 ms | 6 - 200 MB | NPU
## License
* The license for the original implementation of BiseNet can be found
[here](https://github.com/ooooverflow/BiSeNet/pull/45/files).
## References
* [BiSeNet Bilateral Segmentation Network for Real-time Semantic Segmentation](https://arxiv.org/abs/1808.00897)
* [Source Model Implementation](https://github.com/ooooverflow/BiSeNet)
## Community
* Join [our AI Hub Slack community](https://aihub.qualcomm.com/community/slack) to collaborate, post questions and learn more about on-device AI.
* For questions or feedback please [reach out to us](mailto:ai-hub-support@qti.qualcomm.com).
|