Skip to content

Optimize fused gemm+dequant kernel for ROCm, use it for batch sizes o…

6aeade4
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
Open

[ROCm] Optimize kgemm_4bit_inference_naive for ROCm, use it for batch sizes other than 1 #1920

Optimize fused gemm+dequant kernel for ROCm, use it for batch sizes o…
6aeade4
Select commit
Loading
Failed to load commit list.

Annotations

1 warning
CPU (macos, 2.3.1) / build
succeeded Apr 13, 2026 in 20s