Windows Multi−GPU / YOLO11 from Python API won’t start multi-GPU — always runs on cuda:0
GPUs: 4× NVIDIA GeForce RTX 5090 (32 GB)
Driver: 577.00 (CUDA driver 12.9)
OS: Windows 11 Pro
Python: 3.11.11 (Conda)
PyTorch: 2.8.0+cu128
Ultralytics: 8.3.127 (CLI suggests updating to 8.3.206)
Problem: Training via the Python API with device=[0,1,2,3] always runs on a single GPU (cuda:0) — DDP never initializes.What I’m looking for:How to start multi-GPU (DDP) from the Python API on Windows in this version?Is this a known issue in 8.3.127, and does updating to 8.3.206 fix API-side DDP on Windows?