hlu1
|
7a16db9bd9
|
Make sm100 fp8 kernels available on sm103 (#9789)
Signed-off-by: Hao Lu <14827759+hlu1@users.noreply.github.com>
|
2025-08-28 23:47:29 -07:00 |
|
Yuan Luo
|
432f2053dd
|
[sgl-kernel] 1/N Refactor sglang cutlass 3x - gemm fp8 blockwise sm90 (#8913)
Co-authored-by: luoyuan.luo <luoyuan.luo@antgroup.com>
|
2025-08-14 10:55:54 -07:00 |
|
triple-mu
|
444013585d
|
Fix typos and unify size(s)/stride(s) API calls (#8799)
|
2025-08-08 00:18:08 -07:00 |
|
Elfie Guo
|
c23a7072b6
|
Upgrade CUTLASS 4.0 (#6336)
Co-authored-by: zhyncs <me@zhyncs.com>
|
2025-05-15 17:42:23 -07:00 |
|
Yineng Zhang
|
6f56614734
|
chore: upgrade cutlass 3.9.2 (#6004)
Co-authored-by: yizhang2077 <1109276519@qq.com>
|
2025-05-06 13:34:08 -07:00 |
|
Xiaoyu Zhang
|
5bb0accbcf
|
cutlass 3.9 supported to improve fp8_blockwise_gemm (#5820)
|
2025-04-28 21:52:36 -07:00 |
|
Yi Zhang
|
ebf495f013
|
sgl-kernel use cutlass latest version for fp8 blockwise gemm (#5207)
|
2025-04-09 11:47:04 -07:00 |
|
Elfie Guo
|
7c86671131
|
Support Blackwell Block Scale FP8 Gemm (#4278)
|
2025-03-12 14:17:11 -07:00 |
|
Lianmin Zheng
|
8abf74e3c9
|
Rename files in sgl kernel to avoid nested folder structure (#4213)
Co-authored-by: zhyncs <me@zhyncs.com>
|
2025-03-08 22:54:51 -08:00 |
|