Ultralytics v8.3.44 Release Notes
Summary
We are excited to announce the release of v8.3.44
. This update focuses on enhancing Triton Inference Server integration with improved metadata support, optimizing model export functionality, and delivering significant usability and stability enhancements to streamline your workflows.
Key Changes
Triton Inference Enhancements
- Added functionality to retrieve and store model metadata during export (
on_export_end
callback). - Triton Model Repository now dynamically includes metadata (
config.pbtxt
) for seamless configurations. - Enhanced
TritonRemoteModel
to support enriched customizability and traceability through metadata handling. - Set a default task (
task=detect
) for Triton Server models to assist users and minimise configuration errors.
Core Features & Fixes
- Dependency Adjustment: Switched dependency back to
lap
for better stability and compatibility. - Dynamic ONNX: Adjusted the
dynamic
setting to only trigger based on image size (not batch size) for robust and efficient ONNX model handling. - Custom PyTorch Model Acceptance: Expanded
AutoBackend
to accept in-memory PyTorch models, improving workflow flexibility. - AMP Compatibility: Hardcoded failing GPUs (e.g., GTX 16 & Quadro T series) to prevent NaN losses during AMP training.
- New Utility -
empty_like
: Added streamlined tensor creation to enhance code reuse and maintainability. - Segment Resampling Fix: Resolved segment-affecting issues to retain original points during resampling.
Purpose & Impact
Enriched Triton Support
- Improved metadata handling for smooth export and flexible deployment with Triton Inference Server.
- Reduces user errors by introducing intelligent default task settings for Triton integrations.
User Enhancements
- In-memory PyTorch model acceptance simplifies integration in unique workflows.
- Eliminates dependency issues for compatibility-sensitive workflows by reverting to
lap
.
Stability & Performance Gains
- Addressed AMP-related issues for specific GPUs, ensuring stable training experiences.
- Dynamic ONNX tweaks make model exports more consistent and reliable.
Maintenance & Optimization
empty_like
utility reduces redundancy in tensor operations across the codebase.- Resampled segments now maintain fidelity to original data for enhanced accuracy.
This release represents a step forward in usability, stability, and technical sophistication for Ultralytics usersβreducing friction across Triton, model export, and core functionality!
Whatβs Changed
- Revert
lapx
tolap
by @Laughing-q - Preserve original points in
resample_segments
by @Y-T-G - Hardcode failing GPUs for AMP checks by @Y-T-G
- Set
dynamic
based on images size for ONNX by @Y-T-G - Default
task
for Triton inference by @Laughing-q - Update
weights
parameter in AutoBackend by @ye-yangshuo - Fix
empty_like
fornp
andtorch
by @Laughing-q - Improve Triton metadata handling by @Y-T-G
New Contributors
- Welcome @ye-yangshuo for their first contribution in #18059!
Explore the Full Changelog: Detail View
Get Involved!
We encourage you to try out v8.3.44 and share your feedback in our Discussions. Your input helps improve Ultralytics and its offerings for the entire community.
Thank you to all contributors and users for pushing the boundaries of innovation together!