I would like to quantize my custom trained model

Fahim_Anwar · January 5, 2025, 6:47am

Actually i wanted to do quantization for my custom trained yolo model. I have already trained my model. Since I am deploying this to nvidia jetson nano therefore, after testing the trained model I felt a huge latency and optimization problems. Now I want to quantize my trained model. If there anyone who can help me guiding how can I quantize my post trained model it would be great. I have tried to convert into onnx but whenever I exported in onnx I lose my class names. Actually I want to know after I quantize or prune the model how could I run inside a video. Thanks in advanced

Most importantly I am using an OBB model.

Toxite · January 5, 2025, 11:42am

Ultralytics supports TensorRT export with int8 quantization.

You can check the docs.

Topic		Replies	Views
Yolo11 quantization YOLO yolo , question , support , troubleshooting	1	84	June 19, 2025
Yolov11 pruning and quantizing YOLO yolo , question	3	803	April 9, 2025
Quantization of yolov11 model Discussion yolo , question , feature	2	51	June 23, 2025
Quantization YOLO yolo	11	874	October 30, 2024
Post-training quantization methods support for YOLO models in TensorRT format Support yolo , question , support , code	3	199	April 16, 2025

I would like to quantize my custom trained model

Related topics