Modifying yolo architecture

sisidipisi · November 1, 2025, 9:18am

Hi everyone, I’m working on object and lane detection using YOLO, and I’ve been trying to choose a topic for my research project. We currently train two separate models for the two tasks, but I’m wondering if I could use a single model for both. Could I modify the YOLO architecture to have one shared backbone and two separate heads? In the end, would this approach be more efficient, or would I lose accuracy?

Toxite · November 1, 2025, 3:01pm

YOLO segmentation models use two heads. One for detection, another for segmentation.

sisidipisi · November 1, 2025, 4:05pm

but at the end can ı have one .pt file as an example? or would ı be able to have high accuracy scores with fast inference?. I use 25-27 classes for detecting traffic signs. If it’s better for me to modify a high-accuracy and efficient model myself for 27 classes and lane detection, then I’ll do it that way.

pderrenger · November 2, 2025, 1:28pm

Yes—you can keep a single .pt. Use a YOLO11 segmentation model; it already has a shared backbone with two heads (detection + masks), so one forward pass returns boxes for traffic signs and masks for lanes from the same checkpoint, as described in the Segment head reference.

Two important notes:

Training a segmentation model expects masks for all labeled instances. If you only have boxes for traffic signs, either generate masks for them or keep a second detection-only model.
One multi-head model is usually faster than running two models; accuracy is typically comparable if your data is balanced. If you need a bit more headroom, try the next model size up (e.g., yolo11m-seg).

Minimal setup:

# single model for lanes (mask) + signs (boxes)
yolo train task=segment model=yolo11s-seg.pt data=traffic.yaml epochs=100 imgsz=1280

from ultralytics import YOLO
m = YOLO('best.pt')  # result of training above
r = m.predict(source='video.mp4', conf=0.25)
# r[i].boxes -> traffic signs; r[i].masks -> lane masks

If you want to dig deeper into how the dual-head works, see the Segment head reference, and for setup details check the Train mode docs.

sisidipisi · November 5, 2025, 1:04pm

Thank you for your reply. But what if I have two different datasets for the two problems? I won’t be detecting objects and lanes in the same image.

Topic		Replies	Views
Extending YOLO26 for custom multi-task architecture Discussion question	1	77	February 12, 2026
YOLOv11 uses multiple heads(classification+segmentation) during training Discussion yolo , question , code	1	177	December 11, 2025
Modifying yolo11 architecture to have one backbone and 2 necks and heads Discussion yolo , question , support , discussion , code	6	2083	August 12, 2025
Segmentation Or Detection model choice advice? Discussion yolo , question , discussion	1	304	October 19, 2025
Adding a new head to the YOLO11n model to detect very small objects Discussion support , code	23	4242	August 6, 2025

Modifying yolo architecture

Related topics