Ultralytics v8.4.0 is live β introducing Ultralytics YOLO26 
Quick overview: v8.4.0 delivers the Ultralytics YOLO26 release β a re-engineered, edge-first family with native end-to-end (NMS-free) inference, major CPU speedups, and upgrades across detect, segment, classify, pose, and OBB. This release also includes an ARM64 Docker fix to improve deployment experience on modern hardware. ![]()
New Features (Highlights)
Ultralytics YOLO26: native end-to-end, NMS-free
Ultralytics YOLO26 arrives in v8.4.0 with a streamlined architecture designed for edge and low-power environmentsβbuilt for simpler exports and faster inference.
Key highlights:
NMS-free end-to-end inference (native design, reduced latency, simpler deployment)
Up to 43% faster CPU inference (edge-first optimization)
MuSGD optimizer (hybrid optimizer inspired by modern LLM training stability ideas)
No DFL (streamlines export + boosts compatibility with low-power hardware)
Task-specific improvements
- Segmentation: semantic loss + multi-scale protos
- Pose: RLE for higher-precision keypoints
- OBB: angle loss to improve boundary behavior
YOLOE-26 open-vocabulary: zero-shot detection using text or visual prompts
Improvements
Edge-first philosophy: removing bottlenecks like DFL and NMS to maximize performance on constrained devices
Unified & versatile: end-to-end support across detection, segmentation, classification, pose, and OBB
Next-gen training stability: MuSGD helps improve convergence behavior and training stability
Bug Fixes
ARM64 Dockerfile fix for smoother builds and deploys on ARM devices
Whatβs Changed (PRs)
- Fix ARM64 Dockerfile by @glenn-jocher in PR #23175
ultralytics 8.4.0YOLO26 Models Release by @glenn-jocher in PR #23176
How to Try v8.4.0
Update in seconds:
pip install -U ultralytics
Quick sanity check (Python):
from ultralytics import YOLO
model = YOLO("yolo26n.pt")
model.predict(source="https://ultralytics.com/images/bus.jpg")
You can also browse the full release details directly in the official v8.4.0 release page, and review the full diff in the v8.4.0 full changelog.
Feedback Welcome
Please try v8.4.0 on your workloads (especially CPU/edge deployments and exports), and share:
- performance numbers (CPU/GPU, batch size,
imgsz, export target) - export experiences (ONNX, TensorRT, CoreML, etc.)
- any regressions or odd behaviors you spot
Your feedback helps the whole YOLO community push the ecosystem forward.
