YOLOv11 ONNX Export Error with Batch > 1 and NMS

M_M_M · July 9, 2025, 3:16am

Environment:
Ubuntu 22.04
Python 3.10.12
Ultralytics 8.3.155

Executing:

yolo export format='onnx' dynamic=True model=yolo11n.pt nms=True
yolo predict task=segment source=images batch=4 model=yolo11n.onnx

Expected Behavior:
4 segmented Images

Actual Behavior:
Only the first Image is segmented. The other 3 images have 0 detections / segmentations. This is also occurring in my own tensort inferencing routines.

If I export without nms, then the same predict cli command will segment all 4 images.

BurhanQ · July 9, 2025, 1:24pm

Hm, that’s odd. Where you have images is that a list of images or a directory? Have you tested inference with an exported model where nms=True without batching? Just curious to better understand where the problem is specifically.

CC: @Toxite

Toxite · July 9, 2025, 7:10pm

It’s because with nms=True, you have to also specify max batch during export. By default, that’s 1, so it will only return predictions for the first image during inference.

Toxite · July 9, 2025, 7:34pm

Opened a PR to warn about this similar to TensorRT with dynamic

github.com/ultralytics/ultralytics

Add max batch warning for dynamic model with NMS

main ← nms_warn_dynamic

opened 07:31PM - 09 Jul 25 UTC

Y-T-G

+6 -4

Alert users to correct usage https://community.ultralytics.com/t/yolov11-onnx…-export-error-with-batch-1-and-nms/1207/3 ## 🛠️ PR Summary <sub>Made with ❤️ by [Ultralytics Actions](https://github.com/ultralytics/actions)<sub> ### 🌟 Summary Improves export logic for TensorRT engine models, adding clearer device handling and helpful warnings for dynamic batch sizes. 🚀 ### 📊 Key Changes - Refines device assignment for TensorRT exports, ensuring GPU is used when needed. - Adds a warning if a dynamic model is exported with batch size 1, recommending a larger batch size for optimal performance. - Cleans up and consolidates warning messages for better user guidance. ### 🎯 Purpose & Impact - Ensures smoother and more reliable exports to TensorRT, especially for users who may not specify a device. - Helps users avoid common pitfalls with dynamic batching, improving model deployment experience. - Provides clearer, more actionable feedback during export, making the process more user-friendly and reducing potential errors.

M_M_M · July 9, 2025, 10:38pm

Thank you. That fixes the issue partially, but there is something odd going on I believe with the cli prediction side.

If I export using:
yolo export model=yolo11-boxseg.pt format='onnx' dynamic=True batch=12 nms=True

then this command works as expected:
yolo predict task=segment model=yolo11-boxseg.onnx source=images batch=4

but this command with batch omitted returns 0 detections in any of the images:
yolo predict task=segment model=yolo11-boxseg.onnx source=images

When I first exported a dynamic model yesterday with the nms and batch parameters set, I was using the cli to test predict on the onnx model and I could swear I had a case where there were only detections on the first image and not the others. In fact, I still have the run directory for that case, but I don’t know what export / predict calls generated it. I tried to recreate everything I did yesterday, but now I’m getting no detections on any of the images if batch is not set on the predict command, which I assume is not the expected behavior?

pderrenger · July 10, 2025, 1:29pm

Hello! Thanks for the detailed follow-up and for providing such a clear description of the issue.

You’ve correctly identified a known requirement and what appears to be a bug. The initial problem you faced is because dynamic models exported with nms=True require a maximum batch size to be set during export, for example, batch=16. The default batch=1 is insufficient for the embedded NMS logic, which is why your second export command using batch=12 is the correct method. This behavior is highlighted in our exporter logic, as you can see in the Exporter class reference.

The new issue, where predict fails with an implicit batch=1 on the correctly exported model, is not expected behavior. A dynamic model should handle inference on any batch size up to the maximum it was exported with. This points to a potential edge case bug in the exported NMS operations when the inference batch size is exactly 1.

We appreciate you bringing this to our attention. The team will investigate this. In the meantime, the workaround is to explicitly set a batch size in your predict command, such as batch=2, even when processing a single image.

Toxite · July 10, 2025, 3:37pm

Does the logs also say no detection?

I am not able to reproduce the issue.

Topic		Replies	Views
New Release: Ultralytics v8.3.85 Discussion releases , announcements , ultralytics-official	0	25	March 7, 2025
Cannot export wraped YOLO model to onnx YOLO support	3	95	October 17, 2024
New Release: Ultralytics v8.3.148 Discussion releases , announcements , ultralytics-official	0	19	June 3, 2025
Exported RKNN/ONNX model only has 1 output class instead of 2 Discussion question , support , discussion , code	0	2	July 11, 2025
Yolo11 onnx export format and parsing output Discussion yolo , question , support , code	4	329	February 21, 2025

YOLOv11 ONNX Export Error with Batch > 1 and NMS

Related topics