Commit Graph

6 Commits

Author SHA1 Message Date
fzyzcjy
19995dd78e Tiny fix cutlass_mla_get_workspace_size stub incorrect signature (#7057) 2025-06-10 12:27:57 -07:00
JieXin Liang
18efb5e8e0 [perf][sgl-kernel] extend cutlass_mla_decode to support num_head < 128 (#6929) 2025-06-08 19:37:34 -07:00
Trevor Morris
0ab3f437ab Cutlass MLA: Disable split kv due to https://github.com/NVIDIA/cutlass/issues/2274 (#6101) 2025-05-08 18:44:30 -07:00
PGFLMG
f6f96b0521 [sgl-kernel] fix: fix cu118 compile error (#6123)
Co-authored-by: zhyncs <me@zhyncs.com>
2025-05-08 14:26:51 -07:00
Yineng Zhang
812e82f35e fix: solve cu118 issue for cutlass mla (#5331) 2025-04-12 12:51:09 -07:00
Trevor Morris
f65b8d5c89 Blackwell Cutlass MLA kernel (#5142) 2025-04-11 22:16:51 -07:00