Commit Graph

13 Commits

Author SHA1 Message Date
Yineng Zhang
e81d7f11de add tensorrt_llm moe_gemm as 3rdparty (#3217) 2025-01-30 23:49:14 +08:00
Yineng Zhang
222ce6f1da add tensorrt_llm common and cutlass_extensions as 3rdparty (#3216)
Co-authored-by: BBuf <35585791+BBuf@users.noreply.github.com>
2025-01-30 23:04:41 +08:00
Yineng Zhang
c38b5fb4f4 update 3rdparty and rms norm for sgl-kernel (#3213) 2025-01-30 19:32:21 +08:00
Byron Hsu
fb11a43981 [kernel] Integrate flashinfer's rope with higher precision and better perf (#3134) 2025-01-27 15:28:00 +08:00
Yineng Zhang
14e754a868 chore: bump v0.0.2.post17 for sgl-kernel (#3125) 2025-01-25 20:43:02 +08:00
Yineng Zhang
153b414e83 minor: sync flashinfer and add turbomind as 3rdparty (#3105) 2025-01-24 19:22:39 +08:00
Yineng Zhang
0da0989ad4 sync flashinfer and update sgl-kernel tests (#3081) 2025-01-23 21:13:55 +08:00
Yineng Zhang
bcda0c9ee6 sync the upstream updates of flashinfer (#3051) 2025-01-22 20:33:13 +08:00
Yineng Zhang
5a0d680a14 feat: add flashinfer as 3rdparty and use rmsnorm as example (#3033) 2025-01-21 20:44:49 +08:00
Yineng Zhang
d33cbb7e58 remove cub and add cccl (#2976) 2025-01-19 15:51:27 +08:00
Yineng Zhang
e2cdc8a5b5 upgrade cutlass v3.7.0 (#2967) 2025-01-18 23:37:42 +08:00
Xiaoyu Zhang
f005758f2b introduce CUB in sgl-kernel (#2887) 2025-01-14 19:48:59 +08:00
Ke Bao
b4403985d0 Add cutlass submodule for sgl-kernel (#2676) 2024-12-31 14:28:29 +08:00