Skip to content

[A4X TensorRT Inference Benchmark] Update A4X GCSFuse storage recipe to use latest profiled-based configs#174

Open
lepan-google wants to merge 2 commits intoAI-Hypercomputer:mainfrom
lepan-google:a4x-tensorrt-storage-update
Open

[A4X TensorRT Inference Benchmark] Update A4X GCSFuse storage recipe to use latest profiled-based configs#174
lepan-google wants to merge 2 commits intoAI-Hypercomputer:mainfrom
lepan-google:a4x-tensorrt-storage-update

Conversation

@lepan-google
Copy link
Copy Markdown
Contributor

@lepan-google lepan-google commented Mar 28, 2026

GCSFuse released a latest fix for the TensorRT benchmark, they also finalized the profiled-based configs which are recommended to the GPU users.

Per test results, we update the existing recipe to use the configs with the best performance.

TESTED=local tests(manifests, logs)

Comment on lines +10 to +11
- [Single Host Model Serving with NVIDIA TensorRT-LLM (TRT-LLM) and Google Cloud Storage on A4X GKE Node Pool](#single-host-model-serving-with-nvidia-tensorrt-llm-trt-llm-and-google-cloud-storage-on-a4x-gke-node-pool)
- [Table of Contents](#table-of-contents)
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Are these needed? https://screenshot.googleplex.com/4AaDwcrTZHxo288 (Looks a bit redundant with table of contents above)

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This table of contents is automatically generated and updated by our formatter. I have to revert it every time I modify the file, so I was thinking of just leaving it as is. However, I can revert the change again if we want to avoid those first two lines.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Remove the first two titles and save without formatting. Could you please take a look again? Thank you!

@lepan-google lepan-google requested a review from mkmg March 31, 2026 19:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants