Normal then slow then crashing training

BurhanQ · July 9, 2025, 1:09pm

Glad that helped! The Autobatch is calculation is helpful but generally needs a bit of adjustments. FWIW, I usually do a short training session, 3-5 epochs, with autobatch=True to get a rough idea of how much can fit in the GPU memory, and make adjustments from there. Not always a bulletproof plan as you can get an out of memory error at epoch 97/300, but the nice thing is you can always resume training with a lower batch value!

Topic		Replies	Views
Optimize GPU utilization while training YOLO	5	2439	February 19, 2025
Yolo 12 x and l not finishing training Support question , support	1	457	April 8, 2025
GPU memory leak YOLO yolo , nvidia , pytorch	20	1429	September 9, 2025
I am seeing major improvements in my model and the only change has been the machine it is trained on YOLO troubleshooting	3	642	April 29, 2025
Yolov8 CUDA out of memory error Support yolo , support , troubleshooting	3	876	March 12, 2025

Normal then slow then crashing training

Related topics