Train a detection YOLO with a fourth input "depth"

redon · December 1, 2024, 1:45am

Hey guys,
Before I go forking and modifying the code to add this feature -

is there any option to add a fourth input, the depth value, per each pixel/object in training a detection YOLO?

Has it ever been implemented before?

Example of the depth values of an image:

Thanks

Toxite · December 1, 2024, 12:48pm

Ultralytics only supports 3 channels by default. So you would have to modify it (probably to a significant degree especially when it comes to augmentations) to get it to work with 4 channels.

redon · December 1, 2024, 1:27pm

Thank you.
I’m willing to give it a try, as I’m enthusiastic about learning about the framework and the under-the-hood technology through action.

If you’re down to guide me a bit on what code segments of the pipeline should be modified, I’ll be more than happy to ask from time to time about the steps I should go through to ensure the modifications are complete.
In the meantime, I’ll try mostly to chat with GPT to get a hold of what’s going on there.

Topic		Replies	Views
Can YOLO be trained using raw images? YOLO yolo	3	203	March 31, 2025
Change yaml file YOLO yolo	3	409	November 20, 2024
Adding a new head to the YOLO11n model to detect very small objects Discussion support , code	21	1275	April 2, 2025
Fine tune a Ultralytics Model with a HuggingFace dataset Support yolo , question , feature , code	3	27	July 11, 2025
Object Detection with YOLOv8 and Depth-sensor Camera by :opencv: Resources	1	637	July 19, 2024

Train a detection YOLO with a fourth input "depth"

Related topics