[DEP-291] NMS params passed at runtime for NMS-fused ONNX graph#2195
Draft
dkosowski87 wants to merge 20 commits intomainfrom
Draft
[DEP-291] NMS params passed at runtime for NMS-fused ONNX graph#2195dkosowski87 wants to merge 20 commits intomainfrom
dkosowski87 wants to merge 20 commits intomainfrom
Conversation
…ment variables in utils/environment.py
…ional default value in utils/environment.py
…ration.py, utilizing new environment variable retrieval functions for strings and comma-separated lists.
…le input names and validating against declared fused NMS input names. Update forward method to dynamically build input tensors based on configuration, improving flexibility and error handling.
… and logging. Update input name checks for fused NMS and modify forward method to conditionally build input tensors based on provided parameters, improving flexibility in model configuration.
…he script includes command-line options for specifying image and model paths, as well as parameters for confidence, IOU threshold, and maximum detections. It handles image loading, model initialization, and outputs detection results with bounding box coordinates and confidence scores.
…he script includes command-line options for specifying image and model paths, as well as parameters for confidence, IOU threshold, and maximum detections. It handles image loading, model initialization, and outputs detection results with bounding box coordinates and confidence scores.
…inference method to utilize the new post_process_nms_fused_model_output function for improved detection results when fused NMS is enabled.
…mmand-line parameters for specifying the number of benchmark iterations and warmup runs, allowing users to measure inference latency with mean, median, and standard deviation statistics. Enhanced the main function to handle these new options while maintaining existing inference functionality.
…ns to generate and write latency and prediction reports in JSON format, including detailed statistics on inference latencies. Updated command-line options to specify target directory for output files, improving usability and organization of results.
…MS inference script. Updated output file paths to include run name as a subdirectory, enhancing organization of results and improving usability for reporting.
…troduced a new command-line option for selecting ONNX Runtime execution providers (cpu, cuda, tensorrt) and updated the model loading process to utilize the selected provider and device. Enhanced latency reporting to include execution provider details, improving configurability and transparency in inference performance.
…uced a new command-line option for specifying batch size, allowing users to duplicate the input image for batched inference. Updated the main function and related methods to handle batch inputs, enhancing performance measurement and flexibility in inference execution.
…ages. Updated command-line options to accept an image directory instead of a single image path, enabling batch processing of multiple images. Enhanced latency reporting with additional metrics and improved JSON output structure for better organization of results.
…ency reporting. Introduced a constant for test batch size, enhanced JSON output to include image paths, and updated command-line options for benchmark iterations and warmup runs. Refactored image loading logic to support dynamic batching based on model configuration.
…-element tensors for confidence, IOU threshold, and max detections. This change improves compatibility with the input builders and streamlines the inference process.
…ludes command-line options for image and model paths, as well as parameters for confidence, IOU threshold, and maximum detections. Handles image loading, model initialization, and outputs detection results with bounding box coordinates and confidence scores.
…. Includes command-line options for run name, image directory, model path, target directory, and parameters for confidence, IOU threshold, and maximum detections. Implements latency reporting and JSON output for detailed performance metrics, enhancing usability and configurability for inference tasks.
… error messaging. Updated input validation to allow tensors with dimension 0 equal to 1 and enhanced error messages for incompatible batch sizes. Adjusted batch processing logic to ensure compatibility with dynamic input sizes in YOLOv8 model.
…. Introduced a new command-line option for selecting execution providers (cpu, cuda, tensorrt) and updated model loading to utilize the selected provider and device. Enhanced logging to include provider details for improved configurability and transparency in inference performance.
11 tasks
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Work in progress ...
Task: DEP-291
roboflow-train related PR