Ultralytics v8.3.44 Release Notes
Summary
We are excited to announce the release of v8.3.44. This update focuses on enhancing Triton Inference Server integration with improved metadata support, optimizing model export functionality, and delivering significant usability and stability enhancements to streamline your workflows. ![]()
Key Changes
Triton Inference Enhancements
Added functionality to retrieve and store model metadata during export (on_export_endcallback).
Triton Model Repository now dynamically includes metadata (config.pbtxt) for seamless configurations.
Enhanced TritonRemoteModelto support enriched customizability and traceability through metadata handling.
Set a default task (task=detect) for Triton Server models to assist users and minimise configuration errors.
Core Features & Fixes
- Dependency Adjustment: Switched dependency back to
lapfor better stability and compatibility.
Dynamic ONNX: Adjusted the dynamicsetting to only trigger based on image size (not batch size) for robust and efficient ONNX model handling.
Custom PyTorch Model Acceptance: Expanded AutoBackendto accept in-memory PyTorch models, improving workflow flexibility.
AMP Compatibility: Hardcoded failing GPUs (e.g., GTX 16 & Quadro T series) to prevent NaN losses during AMP training.
New Utility - empty_like: Added streamlined tensor creation to enhance code reuse and maintainability.
Segment Resampling Fix: Resolved segment-affecting issues to retain original points during resampling.
Purpose & Impact
Enriched Triton Support
- Improved metadata handling for smooth export and flexible deployment with Triton Inference Server.
- Reduces user errors by introducing intelligent default task settings for Triton integrations.
User Enhancements
- In-memory PyTorch model acceptance simplifies integration in unique workflows.
- Eliminates dependency issues for compatibility-sensitive workflows by reverting to
lap.
Stability & Performance Gains
- Addressed AMP-related issues for specific GPUs, ensuring stable training experiences.
- Dynamic ONNX tweaks make model exports more consistent and reliable.
Maintenance & Optimization
empty_likeutility reduces redundancy in tensor operations across the codebase.- Resampled segments now maintain fidelity to original data for enhanced accuracy.
This release represents a step forward in usability, stability, and technical sophistication for Ultralytics usersβreducing friction across Triton, model export, and core functionality! ![]()
Whatβs Changed
- Revert
lapxtolapby @Laughing-q - Preserve original points in
resample_segmentsby @Y-T-G - Hardcode failing GPUs for AMP checks by @Y-T-G
- Set
dynamicbased on images size for ONNX by @Y-T-G - Default
taskfor Triton inference by @Laughing-q - Update
weightsparameter in AutoBackend by @ye-yangshuo - Fix
empty_likefornpandtorchby @Laughing-q - Improve Triton metadata handling by @Y-T-G
New Contributors
Welcome @ye-yangshuo for their first contribution in #18059!
Explore the Full Changelog: Detail View
Get Involved!
We encourage you to try out v8.3.44 and share your feedback in our Discussions. Your input helps improve Ultralytics and its offerings for the entire community. ![]()
Thank you to all contributors and users for pushing the boundaries of innovation together! ![]()