Create col-major and tma-aligned x_scale for deep_gemm.gemm_fp8_fp8_bf16_nt (#4515)
Co-authored-by: Zhang Kaihong <zhangkaihong.zkh@alibaba-inc.com>
This commit is contained in:
2
sgl-kernel/3rdparty/deepgemm
vendored
2
sgl-kernel/3rdparty/deepgemm
vendored
Submodule sgl-kernel/3rdparty/deepgemm updated: bd2a775528...3b3783d06c
Reference in New Issue
Block a user