Normal then slow then crashing training

Glad that helped! The Autobatch is calculation is helpful but generally needs a bit of adjustments. FWIW, I usually do a short training session, 3-5 epochs, with autobatch=True to get a rough idea of how much can fit in the GPU memory, and make adjustments from there. Not always a bulletproof plan as you can get an out of memory error at epoch 97/300, but the nice thing is you can always resume training with a lower batch value!