Create col-major and tma-aligned x_scale for deep_gemm.gemm_fp8_fp8_bf16_nt (#4515)

Co-authored-by: Zhang Kaihong <zhangkaihong.zkh@alibaba-inc.com>
This commit is contained in:
strgrb
2025-03-19 15:02:43 +08:00
committed by GitHub
parent 90532b7627
commit f9c53cbb42
3 changed files with 28 additions and 9 deletions