[A4X TensorRT Inference Benchmark] Update A4X GCSFuse storage recipe to use latest profiled-based configs by lepan-google · Pull Request #174 · AI-Hypercomputer/gpu-recipes

lepan-google · 2026-03-28T04:22:42Z

GCSFuse released a latest fix for the TensorRT benchmark, they also finalized the profiled-based configs which are recommended to the GPU users.

Per test results, we update the existing recipe to use the configs with the best performance.

TESTED=local tests(manifests, logs)

use latest profiled-based configs

mkmg · 2026-03-30T22:14:08Z

inference/a4x/single-host-serving/tensorrt-llm-gcs/README.md

+- [Single Host Model Serving with NVIDIA TensorRT-LLM (TRT-LLM) and Google Cloud Storage on A4X GKE Node Pool](#single-host-model-serving-with-nvidia-tensorrt-llm-trt-llm-and-google-cloud-storage-on-a4x-gke-node-pool)
+  - [Table of Contents](#table-of-contents)


Are these needed? https://screenshot.googleplex.com/4AaDwcrTZHxo288 (Looks a bit redundant with table of contents above)

This table of contents is automatically generated and updated by our formatter. I have to revert it every time I modify the file, so I was thinking of just leaving it as is. However, I can revert the change again if we want to avoid those first two lines.

Remove the first two titles and save without formatting. Could you please take a look again? Thank you!

[A4X TensorRT Inference Benchmark] Update A4X GCSFuse storage recipe to

86d00cf

use latest profiled-based configs

lepan-google closed this Mar 28, 2026

lepan-google reopened this Mar 28, 2026

Chris113113 requested review from Chris113113 and mkmg March 30, 2026 18:45

Chris113113 approved these changes Mar 30, 2026

View reviewed changes

mkmg reviewed Mar 30, 2026

View reviewed changes

Revert the auto formatting

0f7cb6a

lepan-google requested a review from mkmg March 31, 2026 19:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[A4X TensorRT Inference Benchmark] Update A4X GCSFuse storage recipe to use latest profiled-based configs#174

[A4X TensorRT Inference Benchmark] Update A4X GCSFuse storage recipe to use latest profiled-based configs#174
lepan-google wants to merge 2 commits intoAI-Hypercomputer:mainfrom
lepan-google:a4x-tensorrt-storage-update

lepan-google commented Mar 28, 2026 •

edited

Loading

Uh oh!

mkmg Mar 30, 2026

Uh oh!

lepan-google Mar 30, 2026

Uh oh!

lepan-google Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		- [Single Host Model Serving with NVIDIA TensorRT-LLM (TRT-LLM) and Google Cloud Storage on A4X GKE Node Pool](#single-host-model-serving-with-nvidia-tensorrt-llm-trt-llm-and-google-cloud-storage-on-a4x-gke-node-pool)
		- [Table of Contents](#table-of-contents)

Conversation

lepan-google commented Mar 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mkmg Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

lepan-google Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

lepan-google Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lepan-google commented Mar 28, 2026 •

edited

Loading