Apply YOLO11n.pt in ONNX format in different environment

Kasen · April 4, 2025, 4:04am

I have retrained YOLOv11 model using my own image dataset, then i try to export the model in ONNX format:

retrained_model.export(format="onnx")

Once i got the model in ONNX format, I tried to use this model in two different environment, C# with .NET framework and python. Below is my code and their respective output:

Python Envrionment:

model_path = "retrained_model.onnx" 
loaded_model = YOLO(model_path)

results = loaded_model.predict('test_images/test_1.jpeg', device='cpu', save=True, conf=0.2, iou = 0.7)

C# Envrionment :

#r "nuget: YoloDotNet"
#r "nuget: SkiaSharp"

using YoloDotNet;
using YoloDotNet.Enums;
using YoloDotNet.Models;
using YoloDotNet.Extensions;
using SkiaSharp;

// Instantiate a new Yolo object
var yolo = new Yolo(new YoloOptions
{
    OnnxModel = "retrained_model.onnx",             // Your Yolo model in onnx format
    ModelType = ModelType.ObjectDetection,      // Set your model type
    Cuda = false,                               // Use CPU or CUDA for GPU accelerated inference. Default = true
    GpuId = 0,                                  // Select Gpu by id. Default = 0
    PrimeGpu = false,                           // Pre-allocate GPU before first inference. Default = false
});

// Load image
var image = SKImage.FromEncodedData("test_images/test_1.jpeg");

// Run inference and get the results
var results = yolo.RunObjectDetection(image, confidence: 0.2, iou: 0.7);

// Draw results and save it
var resultImage = image.Draw(results);
resultImage.Save("result_images/result_1.jpg", SKEncodedImageFormat.Jpeg, 80);

// Print detection results to the console
Console.WriteLine("Detected objects:");
foreach (var detection in results)
{
    Console.WriteLine($"  - Label: {detection.Label}, Confidence: {detection.Confidence:F2}, Bounding box: {detection.BoundingBox}");
}

The ONNX model works fine in both environment. However, I realized that when I use python environment, there is only one object been detected (which is correct), but in C# envrionment, there are another 3 objects been detected other than the correct one. Both environment are using same testing image and same ONNX model, and having same confidence score and IOU, but the result in C# environment seems like providing extra wrong detection. I saw from some sources mentioned that the image processing step in python and C# is different which caused this issue. May I know is that true? What should I do to make sure the code in C# environment also provide the same output as the one in python?

Thank you very much!

Toxite · April 4, 2025, 8:25am

What’s the output when you use the ONNX model in Ultralytics?

Kasen · April 4, 2025, 9:57am

The output from Ultralytics (in the Python environment) shows that one object was detected, which is correct. The bounding box and label are also accurate. However, in the C# version, the correct output is shown, but three additional objects are incorrectly detected, even though there is only one object in the image.

I suspect that the way Yolodotnet (used in C#) resizes the test image differs from how Ultralytics resizes the image, but I’m not sure how each of them handles the resizing process to fit the image into the retrained model.

BurhanQ · April 4, 2025, 1:33pm

Are you running both environments on the same hardware? Specifically, if you’re running on a Windows PC (as an example), are you using Python on native Windows as well as the C# environment, or are you running the Python inside of a WSL environment? Beyond different hardware/environments, it’s entirely possible that the float handling for Python and C# are different enough that you end up with varying results.

Sharing the output of both, the tensor/metadata and/or annotated image results, would be helpful to understand more specifically the problem you’re facing. There could be lots of different causes that could produce additional detections, but without seeing how the extra detections manifest, it makes it a bit more difficult to deduce.

Kasen · April 8, 2025, 2:57am

Hi BurhanQ,

The metadata below is the metadata that I got from both environment:

Python :

Model Metadata: 
* description: Ultralytics YOLO11n model trained on /dbfs/FileStore/ObjectDetection/custom_data.yaml 
* author: Ultralytics date: 2025-04-07T04:02:20.541596 
* version: 8.3.103 
* license: AGPL-3.0 License (https://ultralytics.com/license) 
* docs: [https://docs.ultralytics.com](https://docs.ultralytics.com/) 
* stride: 32 
* task: detect 
* batch: 1 
* imgsz: [1088, 1088] 
* names: {0: 'ZAI001 - Kopiko', 1: 'ZAI002 - Dynamite', 2: 'ZAI003 - Mentos Mint', 3: 'ZAI004 - Mentos Grape'} 
* args: {'batch': 1, 'half': False, 'dynamic': False, 'simplify': True, 'opset': None, 'nms': False}

C# :

ModelType           ObjectDetection
ModelVersion        V11
InputName           images
OutputNames         System.Collections.Generic.List`1[System.String]
Input               Input { BatchSize = 1, Channels = 3, Width = 1088, Height = 1088 }
Outputs             System.Collections.Generic.List`1[YoloDotNet.Models.Output]
Labels              YoloDotNet.Models.LabelModel[]
InputShape          System.Int64[]
CustomMetaData      System.Collections.Generic.Dictionary`2[System.String,System.String]
                    date                2025-04-07T04:02:20.541596
                    description         Ultralytics YOLO11n model trained on /dbfs/FileStore/ObjectDetection/custom_data.yaml
                    author              Ultralytics
                    version             8.3.103
                    task                detect
                    license             AGPL-3.0 License (https://ultralytics.com/license)
                    docs                https://docs.ultralytics.com
                    stride              32
                    batch               1
                    imgsz               [1088, 1088]
                    names               {0: 'ZAI001 - Kopiko', 1: 'ZAI002 - Dynamite', 2: 'ZAI003 - Mentos Mint', 3: 'ZAI004 - Mentos Grape'}
                    args                {'batch': 1, 'half': False, 'dynamic': False, 'simplify': True, 'opset': None, 'nms': False}

Labels (4):
----------------------------------------------------------
index: 0        label:      ZAI001 - Kopiko color: #91FF57
index: 1        label:    ZAI002 - Dynamite color: #FF6F00
index: 2        label: ZAI003 - Mentos Mint color: #FF63A5
index: 3        label: ZAI004 - Mentos Grape color: #FF6E79

Yes, both are running on the same hardware locally in my Visual Studio Code (Window PC).

The screenshots below are the result images from two different environment:

As you can observed from the screenshots, the same model been used in 2 different environment give different confidence score. This is also one of the problem that I encountered other than the challenge that i mentioned in the question (multiple objects been detected when using C# environment).

Kasen · April 8, 2025, 3:00am

I understand that when a new testing image is passed in, the method YoloDotnet uses to resize the image to 1088x1088 may differ from how Ultralytics resizes the image to the same size. However, I’m not sure how Ultralytics resizes the input image and adjusts it to fit the model before making predictions.

BurhanQ · April 8, 2025, 11:26am

There are two pre-processing steps for image resizing for inference.

github.com/ultralytics/ultralytics

ultralytics/engine/predictor.py

dbe2b5d4b


      
          def preprocess(self, im):
              """
              Prepares input image before inference.
          
              Args:
                  im (torch.Tensor | List(np.ndarray)): Images of shape (N, 3, h, w) for tensor, [(h, w, 3) x N] for list.
              """
              not_tensor = not isinstance(im, torch.Tensor)
              if not_tensor:
                  im = np.stack(self.pre_transform(im))
                  im = im[..., ::-1].transpose((0, 3, 1, 2))  # BGR to RGB, BHWC to BCHW, (n, 3, h, w)
                  im = np.ascontiguousarray(im)  # contiguous
                  im = torch.from_numpy(im)
          
              im = im.to(self.device)
              im = im.half() if self.model.fp16 else im.float()  # uint8 to fp16/32
              if not_tensor:
                  im /= 255  # 0 - 255 to 0.0 - 1.0
              return im

and from L153, the pre_transform method call

github.com/ultralytics/ultralytics

ultralytics/engine/predictor.py

dbe2b5d4b


      
          def pre_transform(self, im):
              """
              Pre-transform input image before inference.
          
              Args:
                  im (List[np.ndarray]): Images of shape (N, 3, h, w) for tensor, [(h, w, 3) x N] for list.
          
              Returns:
                  (List[np.ndarray]): A list of transformed images.
              """
              same_shapes = len({x.shape for x in im}) == 1
              letterbox = LetterBox(
                  self.imgsz,
                  auto=same_shapes
                  and self.args.rect
                  and (self.model.pt or (getattr(self.model, "dynamic", False) and not self.model.imx)),
                  stride=self.model.stride,
              )
              return [letterbox(image=x) for x in im]

For ONNX exported models, the image would be resized to (640, 640) by default for Python. First the largest side of the image would be scaled to 640 and the short side would be scaled the same amount to maintain the image aspect ratio. Then the short side of the image would be padded evenly with the difference to make the image 640 square.

If the YOLODotnet library is resizing to (1088, 1088) that would certainly cause a difference in the confidence values. Differences in the method that is used to calculate IOU for non-max suppression could also contribute to very different results for both confidence values and number of bounding boxes, despite all other settings being equal.

Kasen · April 9, 2025, 2:06am

When I retrained the YOLO model with my own dataset, I used the command model.train(imgsz=1088). Now, if I perform inference on a new test image that is already 1088x1088 in size, will the image still be resized, or will it remain unchanged since it’s already in the required size?

BurhanQ · April 9, 2025, 12:17pm

Since the metadata is showing (1088, 1088) and if your image is already resized to those dimensions, the no resizing should take place. Perhaps try exporting with nms=True to see if that helps with getting closer alignment between the two. Beyond that, you may need to get support from the YOLODotenet author to understand the difference in the results.

Topic		Replies	Views
YOLOv11 ONNX Export Error with Batch > 1 and NMS YOLO support	6	28	July 10, 2025
YOLO to lighter versions conversion for CPU deployement YOLO yolo , question , pytorch , onnx	1	16	June 25, 2025
Yolo11 onnx export format and parsing output Discussion yolo , question , support , code	4	368	February 21, 2025
Cannot export wraped YOLO model to onnx YOLO support	3	100	October 17, 2024
Slimming with onnxslim 0.1.50 fails with yolo export on Jetson 6.2 L4T Discussion yolo , troubleshooting , resource	4	98	April 26, 2025

Apply YOLO11n.pt in ONNX format in different environment

Related topics