Skip to content

Added support for fp16 x fp8, bf16 x fp8, fp8 x fp16, and fp8 x bf16 for the V3 pipeline#6612

Open
SamiAario-AMD wants to merge 18 commits intodevelopfrom
users/samaario/mixed-prec-fp16-bf16-x-fp8
Open

Added support for fp16 x fp8, bf16 x fp8, fp8 x fp16, and fp8 x bf16 for the V3 pipeline#6612
SamiAario-AMD wants to merge 18 commits intodevelopfrom
users/samaario/mixed-prec-fp16-bf16-x-fp8

Conversation

@SamiAario-AMD
Copy link
Copy Markdown
Contributor

@SamiAario-AMD SamiAario-AMD commented Apr 21, 2026

Motivation

Added support for fp16 x fp8, bf16 x fp8, fp8 x fp16, and fp8 x bf16 for the V3 pipeline

Technical Details

  • Added support for mixed precision transposed loads for the gfx950 architecture
  • Added PackNumAccess as a template parameter of WarpGemmAttributeMfma and WarpGemmAttributeMfmaIterateK, to enable explicit choosing of a compatible warp tile distribution encoding when converting between different precisions during a transposed load
  • Replaced existing type conversion logic in the universal GEMM with pattern matching based on input types, which should make type conversion logic easier to follow

Test Plan

Added test cases to the V3 universal GEMM tests for all data type and layout combinations that were added

Test Result

Added tests pass validation on the gfx950 platform

Submission Checklist

@SamiAario-AMD
Copy link
Copy Markdown
Contributor Author

Note that the first three commits are also included in this PR: #5510
I added them here as well so that the approval process for that PR won't stall the approval of this more important one.

@SamiAario-AMD SamiAario-AMD force-pushed the users/samaario/mixed-prec-fp16-bf16-x-fp8 branch from 6c12c0d to b532f32 Compare April 23, 2026 09:48
@SamiAario-AMD SamiAario-AMD force-pushed the users/samaario/mixed-prec-fp16-bf16-x-fp8 branch 3 times, most recently from 6826882 to 74751db Compare May 6, 2026 08:18
@SamiAario-AMD SamiAario-AMD force-pushed the users/samaario/mixed-prec-fp16-bf16-x-fp8 branch from 32d7de5 to 753b287 Compare May 8, 2026 12:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant