Breaking Reproducibility Due to In-place Data Patching

fahmad-wescef · August 18, 2025, 5:59am

I’m training Yolo11x on a few thousand images (JPG). During each training run, I see warnings about some files being corrupt and then being fixed. The “corrupt” files open just fine so I am not sure what is being “fixed”. However, this in-place fixing breaks experiment reproducibility.


val: /mnt/devel/project/data/yolo/internal/1/images/val/I63273_I542557_039m40r3164.jpg: corrupt JPEG restored and saved

Toxite · August 18, 2025, 7:21am

The “corrupt” files open just fine so I am not sure what is being “fixed”.

It would have replaced the original file with the fixed version, so it ought to open fine after that. The corruption it is fixing is truncated or partially downloaded JPEG files:

github.com/ultralytics/ultralytics

ultralytics/data/utils.py

c47172f3d


      
          if im.format.lower() in {"jpg", "jpeg"}:
              with open(im_file, "rb") as f:
                  f.seek(-2, 2)
                  if f.read() != b"\xff\xd9":  # corrupt JPEG
                      ImageOps.exif_transpose(Image.open(im_file)).save(im_file, "JPEG", subsampling=0, quality=100)
                      msg = f"{prefix}{im_file}: corrupt JPEG restored and saved"

However, this in-place fixing breaks experiment reproducibility.

Are you sure that it’s due to this? It shouldn’t affect reproducibility because the fix occurs before the image is passed for training. Is your dataset being redownloaded every time? Or is it on a remote mounted drive? It shouldn’t be getting different images corrupted every training run, unless you’re redownloading it or it’s on a network drive.

fahmad-wescef · August 22, 2025, 2:24am

So the dataset, being a dependency, is dvc versioned. dvc detects that the files are changed but can’t tell what has changed.

Toxite · August 22, 2025, 3:59am

Are the same images appearing as corrupted every time?

fahmad-wescef · August 22, 2025, 6:23am

I haven’t checked this specifically but I’d assume that this is the case since even after I commit the fixed images to dvc, I get error late on.

Toxite · August 22, 2025, 6:40am

What’s your training command?

Toxite · August 22, 2025, 7:06am

To me, it seems unlikely that it’s the same image being repaired each time. The repair would fix the image, and save it, so the same image isn’t going appear corrupted again. Unless dvc is restoring it to the old version and corrupting it again. Also you didn’t answer my questions.

Are you sure that it’s due to this? It shouldn’t affect reproducibility because the fix occurs before the image is passed for training. Is your dataset being redownloaded every time? Or is it on a remote mounted drive? It shouldn’t be getting different images corrupted every training run, unless you’re redownloading it or it’s on a network drive.

Topic		Replies	Views
[serious bug report] cant upload 9mb dataset with 30 classes and 5000 classes Discussion bug-fix	3	30	March 10, 2026
Different results over same trainingn and dataset Discussion question	4	812	October 25, 2024
Upload duplicate image in dataset Discussion feature	2	21	March 26, 2026
YOLOv11-cls.pt for image classification processing time per image fluctuates Discussion yolo , question , discussion	3	117	August 28, 2025
Deterministic Training and Dropout yielding identical results on multiple training runs Discussion discussion	2	745	August 20, 2024

Breaking Reproducibility Due to In-place Data Patching

Related topics