Hey everyone, I have a question I hope you can help me with. I’m working on a project related to crop disease spot detection. Using the project dataset, which contains around 1,500 images, I trained my model. The results look okay but don’t meet the project requirements:
-
Precision (P): 0.887
-
Recall (R): 0.804
-
mAP@0.5: 0.864
-
mAP@0.5:0.95: 0.412
I want to improve the precision and mAP@0.5:0.95 to 0.9+ and 0.6+ respectively. I’ve tried many approaches. Initially, I trained a YOLOv5 version of the model, but the results were not satisfactory. Then I switched to YOLOv11, which is what I’m currently using. I also tried adjusting parameters to improve performance, but the improvement was minimal.
Then, I consulted others and was advised to try replacing the YOLOv11 backbone with something like DenseNet. I started trying to modify it, but clearly, I failed, which is why I’m asking here.
To be honest, I’m a newcomer to this field. I only understand some technical terms and roughly how models operate. I can’t understand the structure or code at all. However, the project task has been assigned to me, so I must complete it.
I tried using ChatGPT to help me figure out the relationships. It guided me to create my own yolov11.yaml file, replace the backbone with a DenseNet structure, create a related DenseNet .py file to register the backbone with the model, and then call it later. I also made a test.py file to check whether the input and output channels aligned, because I had failed to replace DenseNet in YOLOv5 before.
Despite all this, it still failed because the output channels couldn’t align. I feel very confused and lost. I couldn’t find any tutorials online that explain this. I understand that at a basic stage I should study more theory, but given the situation, I need to figure out a way to complete this task.
I hope someone can help me clarify what’s really going on, what preparations I should make, what to pay attention to when replacing a network or even modules, and what knowledge I should study to tackle this.