Exporting to tflite with int8 quantization for edge deployment

GeorgeHill · October 1, 2025, 3:56pm

I’m trying to export a yolov11 model to tflite with int8 quantization to run on an imx. However the yolo11n_int8.tflite file produced is still in fp32 and the yolo11n_full_integer_quant.tflite is quantized to however the output being in int8 means it doesn’t have the precision required to express bounding boxes and confidence values. I believe the backbone and neck of the model should be quantized to int8 and the detection head left in fp32 but I don’t see how to do this.

What is the intended way to extract predictions from quantized tflite models?

Toxite · October 2, 2025, 6:13am

However the yolo11n_int8.tflite file produced is still in fp32

yolo11n_int8.tflite uses dynamic range quantization. It’s not in FP32. The weights are converted to INT8 in dynamic range quantization, but only for storage benefits. During inference, the weights are converted back to FP32 and inference runs at FP32 pecision.

yolo11n_full_integer_quant.tflite is quantized to however the output being in int8 means it doesn’t have the precision required to express bounding boxes and confidence values

yolo11n_full_integer_quant.tflite requires you to scale the input and output manually.

github.com/ultralytics/ultralytics

ultralytics/nn/autobackend.py

85eacfca7


      
          if is_int:
              scale, zero_point = details["quantization"]
              im = (im / scale + zero_point).astype(details["dtype"])  # de-scale

Both of these models can be loaded and run in Ultralytics , and they work fine. There’s nothing wrong with the exported models.

If you want FP32 input and output, then there’s another file generated yolo11n_integer_quant.tflite, that would have FP32 input and output.

Topic		Replies	Views
Quantization YOLO yolo	11	1486	October 30, 2024
About parsing output Support yolo , question , support , troubleshooting	1	96	October 7, 2025
I would like to quantize my custom trained model YOLO question	1	866	January 5, 2025
New Release: Ultralytics v8.3.222 Discussion releases , announcements , ultralytics-official	0	55	October 29, 2025
Choose appropriate configurations for YOLO model exported to OpenVINO format YOLO openvino	5	229	August 19, 2025

Exporting to tflite with int8 quantization for edge deployment

Related topics