Balance Classes During Training - Yolov11

speendo · January 16, 2025, 1:20pm

Hello and thank you for this help forum.

I try to train a yolov11 model with a certain subset of classes from the CityScapes dataset.

As my results so far are only moderate, I would like to try another approach and use a weighted data loader in order to have balanced classes during training.

I tried to follow these intructions to achieve this:

The approach seems to work on my local machine (which does not have GPU support) but when I use it on the data cluster of my research institution, it seems that the class YOLOWeightedDataset is not used although I monkey patched it over dataset.YOLODataset.

After a lot of debugging and banging my head against the keyboard I now think that somewhere in YOLO.train() a new process is spawned that does not know about the monkey patch and therefore would not use it.

Is there another way to achieve a weight balanced training, e.g. by invoking my custom class YOLOWeightedDataset in a different way with the YOLO.train() method?

Here is what I did so far:

from ultralytics import YOLO
import ultralytics.data as data
import ultralytics.data.dataset as dataset
import ultralytics.data.build as build

import numpy as np

class YOLOWeightedDataset(data.dataset.YOLODataset):
    def __init__(self, *args, data=None, task="train", **kwargs):
        """
        Initialize the WeightedDataset.

        Args:
            class_weights (list or numpy array): A list or array of weights corresponding to each class.
        """
    
        super(YOLOWeightedDataset, self).__init__(*args, data=data, task=task, **kwargs)

        self.train_mode = "train" in self.prefix

        # You can also specify weights manually instead
        self.count_instances()
        class_weights = np.sum(self.counts) / self.counts

        # Aggregation function
        self.agg_func = np.mean

        self.class_weights = np.array(class_weights)
        self.weights = self.calculate_weights()
        self.probabilities = self.calculate_probabilities()

    ... # rest of the code from https://y-t-g.github.io/tutorials/yolo-class-balancing/


dataset.YOLODataset = YOLOWeightedDataset

model = YOLO("yolo11n.pt")

model.train(
    data='data.yaml',
    device=[0,1],
    batch=128,
    imgsz=640,
    save_period=10,
    project="runs/detect/nano_balanced"
)

It also might be worth mentioning that I run this code in a Jupyter Notebook on my research institution’s cluster.

Thank you very much in advance for your help!

BurhanQ · January 16, 2025, 1:41pm

@Toxite might be able to answer this more definitely, but I’d guess that it doesn’t work for multiple GPUs. You could try setting device=0 instead to see if that helps.

speendo · January 16, 2025, 1:48pm

Thank you @BurhanQ!

I just found out that the author of the page I linked also provided some instructions for multiple GPUS (Google Colab).

This will definitely be my next approach.

Please let me know if you have any other insight!

Toxite · January 16, 2025, 2:45pm

The callbacks would still not work with multi-GPU. Neither monkey-patching, nor callbacks work with multi-GPU training. You will have to modify the source code to make the modifications stick. By adding the YOLOWeightedDataset to this file:

github.com/ultralytics/ultralytics

ultralytics/data/build.py

d9292fb7f

# Ultralytics 🚀 AGPL-3.0 License - https://ultralytics.com/license

import os
import random
from pathlib import Path

import numpy as np
import torch
from PIL import Image
from torch.utils.data import dataloader, distributed

from ultralytics.data.dataset import GroundingDataset, YOLODataset, YOLOMultiModalDataset
from ultralytics.data.loaders import (
    LOADERS,
    LoadImagesAndVideos,
    LoadPilAndNumpy,
    LoadScreenshots,
    LoadStreams,
    LoadTensor,
    SourceTypes,

This file has been truncated. show original

And then changing YOLODataset in this line to YOLOWeightedDataset:

github.com/ultralytics/ultralytics

ultralytics/data/build.py

d9292fb7f


      
          
          def seed_worker(worker_id):  # noqa
              """Set dataloader worker seed https://pytorch.org/docs/stable/notes/randomness.html#dataloader."""
              worker_seed = torch.initial_seed() % 2**32
              np.random.seed(worker_seed)
              random.seed(worker_seed)
          
          
          def build_yolo_dataset(cfg, img_path, batch, data, mode="train", rect=False, stride=32, multi_modal=False):
              """Build YOLO Dataset."""
              dataset = YOLOMultiModalDataset if multi_modal else YOLODataset
              return dataset(
                  img_path=img_path,
                  imgsz=cfg.imgsz,
                  batch_size=batch,
                  augment=mode == "train",  # augmentation
                  hyp=cfg,  # TODO: probably add a get_hyps_from_cfg function
                  rect=cfg.rect or rect,  # rectangular batches
                  cache=cfg.cache or None,
                  single_cls=cfg.single_cls or False,
                  stride=int(stride),

speendo · January 16, 2025, 8:44pm

Thanks for your insight, @Toxite !

So, do I understand you right that the code that the auther provides in their Colab will just not work? I find it strange that they would publish a code that was not tested before.

However, I also had no success testing the approach provided in the colab…

speendo · January 19, 2025, 1:33pm

Thanks! It seems that worked.

Topic		Replies	Views
I Need Help with YOLOv5 Training for Custom Object Detection YOLO yolov5 , support	1	332	July 24, 2024
Yolo11 training with custom dataset Support yolo , discussion , code	9	1178	February 17, 2025
Tools for handling class imbalance YOLO	1	15	June 17, 2025
About Yolo Configuration File (YAML) Discussion discussion	2	299	September 30, 2024
Add New Classes to (YOLOv8n or YOLO11n) Pretrained Model Without Losing COCO Classes Discussion yolo , question , support	2	178	April 20, 2025

Balance Classes During Training - Yolov11

Related topics