Export with int8 quantization gives 0 at the output

dmrysr · March 1, 2026, 1:59pm

Hello,

I am trying to implement a YOLO model. I have customized the yolo26.yaml file to my own needs, where some of the operations are not supported by the TFLM library so I changed some of the conv blocks used. However, whenever I export it as tflite with int8 quantization using the export function, the output is always 0 for the detections. When I tested as .pt or .tflite_float32 the model performs as it should.

Could you please enlighten me on what could be the problem?

Thank you

Toxite · March 1, 2026, 8:11pm

end2end models don’t work with static quantization. You can only use the dynamic quantized file which ends with _int8.tflite

dmrysr · March 1, 2026, 8:38pm

Thank you for your reply.

However, I already disabled end2end in the .yaml file. Wouldn’t this overcome the problem?

Toxite · March 2, 2026, 5:08am

Is the output 0 with _int8.tflite file?

If so, there’s something incompatible with your custom layers. Because the default YOLO non end2end models work fine.

dmrysr · March 2, 2026, 9:32am

No, the output of int8 works normal. But I cannot use dynamic quantization because my main goal is to implement the model on an NPU equipped MCU. This dynamic quantization makes the input and output float32 and my converter does not accept it.

Is there any way to obtain static quantization with a working model? Like, should I use yolov8 or yolov5 to obtain this?

Toxite · March 2, 2026, 11:58pm

Static quantization works with YOLOv8 and YOLO11. And also YOLO26 if you export with end2end=False

dmrysr · March 3, 2026, 12:11am

Could you please tell me how to do it ?

I tried with end2end = False and also yolov8 but i still get 0 at the output of the model when i run the full quant file.

Thank you for your time.

Toxite · March 3, 2026, 8:00am

Are you using latest Ultralytics?

dmrysr · March 3, 2026, 9:17am

Yes, I believe so.

Toxite · March 3, 2026, 5:31pm

It works fine for me with latest Ultralytics:

image 1/2 /ultralytics/ultralytics/assets/bus.jpg: 640x640 4 persons, 1 bus, 22.2ms
image 2/2 /ultralytics/ultralytics/assets/zidane.jpg: 640x640 3 persons, 21.0ms
Speed: 11.3ms preprocess, 21.6ms inference, 1.0ms postprocess per image at shape (1, 3, 640, 640)

Topic		Replies	Views
Exporting to tflite with int8 quantization for edge deployment Support yolo , question , support , troubleshooting	1	644	October 2, 2025
Quantization YOLO yolo	11	1672	October 30, 2024
Post-training quantization methods support for YOLO models in TensorRT format Support yolo , question , support , code	3	864	April 16, 2025
Full_integer_quant on YOLO26n Discussion question , support , code	1	85	March 3, 2026
--disable_group_convolution Support yolo , support	3	306	October 24, 2024

Export with int8 quantization gives 0 at the output

Related topics