YOLO26 experiment: Raw model output to bounding boxes

fzngagan · January 20, 2026, 7:06pm

Did some experimentation with the yolo26 model. Basically, I want to replicate the inference process from model outputs to bounding box drawing. Here’s what I did:

collected model’s raw predictions
scaled them to the original image dimentions
plotted the boxes on the original image using matplotlib.patches

github.com/fzngagan/object-detection-experiments

yolo26.ipynb

main

{
 "cells": [
  {
   "cell_type": "code",
   "execution_count": 1,
   "id": "ef821aaf-d283-4ee8-8623-67200855c268",
   "metadata": {},
   "outputs": [],
   "source": [
    "# !pip install ultralytics -U\"\n",
    "import torch"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 2,
   "id": "48f29a01-a533-4a80-9c64-415b0c4ea20a",
   "metadata": {},
   "outputs": [
    {

This file has been truncated. show original

Understood a bit of the internals of the ultralytics codebase in the process. There are less abstractions in the code than I would have imagined, but I’d like to build a very simple training/inference pipeline mostly for learning and experimentation.

pderrenger · January 21, 2026, 1:38pm

Your approach is basically the right mental model: the “gotcha” is that the network predicts in the letterboxed (resized + padded) image space, and you need to undo that exact resize/pad before drawing on the original. In Ultralytics YOLO this undo step is handled for you, so if you just want the final pixel boxes to plot, you can pull them directly from the Results object (they’re already scaled to the original image):

from ultralytics import YOLO

model = YOLO("yolo26n.pt")
r = model("https://ultralytics.com/images/bus.jpg")[0]

xyxy = r.boxes.xyxy.cpu().numpy()   # x1,y1,x2,y2 in original-image pixels
conf = r.boxes.conf.cpu().numpy()
cls  = r.boxes.cls.cpu().numpy().astype(int)

print(xyxy[0], conf[0], cls[0])

If you’re trying to replicate the full path from raw head outputs → final boxes, the key post steps are “decode → NMS → scale back to original,” implemented in ultralytics/utils/ops.py (look for non_max_suppression() and scale_boxes()). The coordinate formats we expose on Results are summarized in the bounding box glossary, and the expected “absolute vs normalized” behavior is also covered in the common issues guide section on box coordinates.

If you share what you’re calling “raw predictions” (tensor shape + where you tapped it: PyTorch model forward vs an exported model), I can point you to the exact decode step for that output format, since it differs depending on whether you captured pre- or post-decode outputs.

Topic		Replies	Views
YoloV10 bounding box format YOLO	2	458	October 11, 2024
Output of the model in training vs. inference YOLO question , curious	1	622	August 7, 2024
Are instance segmentation (yolo) masks cropped by the bbox for inference? Discussion code	5	368	August 11, 2025
Changing bounding boxes to polygons Discussion question	2	531	December 7, 2024
Yolo 11 image preprocess YOLO yolo , question , support	1	135	December 2, 2025

YOLO26 experiment: Raw model output to bounding boxes

Related topics