yinfan98
|
4db29e82ec
|
[Feat] support deepgemm for cmake (#4864)
|
2025-03-28 10:51:44 -07:00 |
|
Yineng Zhang
|
8bf6d7f406
|
support cmake for sgl-kernel (#4706)
Co-authored-by: hebiao064 <hebiaobuaa@gmail.com>
Co-authored-by: yinfan98 <1106310035@qq.com>
|
2025-03-27 01:42:28 -07:00 |
|
Yineng Zhang
|
7596417732
|
minor: use bear for compilation database (#2919)
|
2025-01-16 18:39:11 +08:00 |
|
Xiaoyu Zhang
|
f005758f2b
|
introduce CUB in sgl-kernel (#2887)
|
2025-01-14 19:48:59 +08:00 |
|
Xiaoyu Zhang
|
e2b16c4716
|
add sampling_scaling_penalties kernel (#2846)
|
2025-01-12 19:38:17 -08:00 |
|
Ke Bao
|
0f3eb1d294
|
Support cutlass Int8 gemm (#2752)
|
2025-01-06 22:51:22 +08:00 |
|
Yineng Zhang
|
b6b57fc200
|
minor: cleanup sgl-kernel (#2679)
|
2024-12-31 14:52:00 +08:00 |
|
Ke Bao
|
b4403985d0
|
Add cutlass submodule for sgl-kernel (#2676)
|
2024-12-31 14:28:29 +08:00 |
|
Ke Bao
|
b02da24a5b
|
Refactor sgl-kernel build (#2642)
|
2024-12-30 18:07:01 +08:00 |
|
yizhang2077
|
e04d3f2897
|
adapt tensorrt llm custom all reduce to sgl-kernel (#2481)
Co-authored-by: Yineng Zhang <me@zhyncs.com>
|
2024-12-15 13:15:59 +08:00 |
|
Yineng Zhang
|
7301a39b13
|
fix: resolve CodeQL cpp issue (#2305)
|
2024-12-01 23:55:19 +08:00 |
|