[DEP-291] NMS params passed at runtime for NMS-fused ONNX graph by dkosowski87 · Pull Request #2195 · roboflow/inference

dkosowski87 · 2026-04-01T17:17:51Z

Work in progress ...

…ment variables in utils/environment.py

…ional default value in utils/environment.py

…ration.py, utilizing new environment variable retrieval functions for strings and comma-separated lists.

…le input names and validating against declared fused NMS input names. Update forward method to dynamically build input tensors based on configuration, improving flexibility and error handling.

… and logging. Update input name checks for fused NMS and modify forward method to conditionally build input tensors based on provided parameters, improving flexibility in model configuration.

…he script includes command-line options for specifying image and model paths, as well as parameters for confidence, IOU threshold, and maximum detections. It handles image loading, model initialization, and outputs detection results with bounding box coordinates and confidence scores.

…inference method to utilize the new post_process_nms_fused_model_output function for improved detection results when fused NMS is enabled.

…mmand-line parameters for specifying the number of benchmark iterations and warmup runs, allowing users to measure inference latency with mean, median, and standard deviation statistics. Enhanced the main function to handle these new options while maintaining existing inference functionality.

…ns to generate and write latency and prediction reports in JSON format, including detailed statistics on inference latencies. Updated command-line options to specify target directory for output files, improving usability and organization of results.

…MS inference script. Updated output file paths to include run name as a subdirectory, enhancing organization of results and improving usability for reporting.

…troduced a new command-line option for selecting ONNX Runtime execution providers (cpu, cuda, tensorrt) and updated the model loading process to utilize the selected provider and device. Enhanced latency reporting to include execution provider details, improving configurability and transparency in inference performance.

…uced a new command-line option for specifying batch size, allowing users to duplicate the input image for batched inference. Updated the main function and related methods to handle batch inputs, enhancing performance measurement and flexibility in inference execution.

…ages. Updated command-line options to accept an image directory instead of a single image path, enabling batch processing of multiple images. Enhanced latency reporting with additional metrics and improved JSON output structure for better organization of results.

…ency reporting. Introduced a constant for test batch size, enhanced JSON output to include image paths, and updated command-line options for benchmark iterations and warmup runs. Refactored image loading logic to support dynamic batching based on model configuration.

…-element tensors for confidence, IOU threshold, and max detections. This change improves compatibility with the input builders and streamlines the inference process.

…ludes command-line options for image and model paths, as well as parameters for confidence, IOU threshold, and maximum detections. Handles image loading, model initialization, and outputs detection results with bounding box coordinates and confidence scores.

…. Includes command-line options for run name, image directory, model path, target directory, and parameters for confidence, IOU threshold, and maximum detections. Implements latency reporting and JSON output for detailed performance metrics, enhancing usability and configurability for inference tasks.

… error messaging. Updated input validation to allow tensors with dimension 0 equal to 1 and enhanced error messages for incompatible batch sizes. Adjusted batch processing logic to ensure compatibility with dynamic input sizes in YOLOv8 model.

…. Introduced a new command-line option for selecting execution providers (cpu, cuda, tensorrt) and updated model loading to utilize the selected provider and device. Enhanced logging to include provider details for improved configurability and transparency in inference performance.

dkosowski87 added 20 commits April 1, 2026 19:16

Add function to retrieve comma-separated list of strings from environ…

4c91fd6

…ment variables in utils/environment.py

Add function to retrieve a string from environment variables with opt…

a1e7605

…ional default value in utils/environment.py

Add default input name configurations for YOLO Ultralytics in configu…

b3b25cb

…ration.py, utilizing new environment variable retrieval functions for strings and comma-separated lists.

Enhance YOLOv8 ONNX model input handling by adding support for multip…

f4aef5c

…le input names and validating against declared fused NMS input names. Update forward method to dynamically build input tensors based on configuration, improving flexibility and error handling.

Refactor YOLOv8 ONNX model input validation to enhance error handling…

9164869

… and logging. Update input name checks for fused NMS and modify forward method to conditionally build input tensors based on provided parameters, improving flexibility in model configuration.

Integrate fused NMS post-processing in YOLOv8 ONNX model. Update the …

33acdfc

…inference method to utilize the new post_process_nms_fused_model_output function for improved detection results when fused NMS is enabled.

Add command-line options for run name and target directory in fused N…

178a6fe

…MS inference script. Updated output file paths to include run name as a subdirectory, enhancing organization of results and improving usability for reporting.

Refactor input handling in YOLOv8 ONNX object detection to use single…

245a56b

…-element tensors for confidence, IOU threshold, and max detections. This change improves compatibility with the input builders and streamlines the inference process.

dkosowski87 mentioned this pull request Apr 7, 2026

Support yololite with fused NMS #2203

Open

11 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DEP-291] NMS params passed at runtime for NMS-fused ONNX graph#2195

[DEP-291] NMS params passed at runtime for NMS-fused ONNX graph#2195
dkosowski87 wants to merge 20 commits intomainfrom
DEP-291/configurable-NMS-threshold-for-YOLO

dkosowski87 commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

dkosowski87 commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant