Logo
Explore Help
Register Sign In
EngineX-Hygon/sglang
5
0
Fork 0
You've already forked sglang
Code Issues Pull Requests Actions 7 Projects Releases Wiki Activity
Files
b371f7cd3626cd1ef8fa0b61b5d34b3fa6c4d47e
sglang/sgl-kernel/csrc
History
Yineng Zhang 812e82f35e fix: solve cu118 issue for cutlass mla (#5331)
2025-04-12 12:51:09 -07:00
..
allreduce
sgl-kernel transfer custom allreduce from trt kernel to vllm kernel (#5079)
2025-04-05 14:23:20 -07:00
attention
fix: solve cu118 issue for cutlass mla (#5331)
2025-04-12 12:51:09 -07:00
cpu
Add optimized native kernels in sgl-kernel (#5150)
2025-04-08 09:37:46 -07:00
cutlass_extensions
sgl-kernel use cutlass latest version for fp8 blockwise gemm (#5207)
2025-04-09 11:47:04 -07:00
elementwise
Optimize rope in sgl kernel (#4267)
2025-03-10 10:07:45 -07:00
gemm
fix: remove cublas_grouped_gemm (#5307)
2025-04-11 16:22:37 -07:00
moe
reduce moe_align_block_size_kernel small batch mode overhead (#5086)
2025-04-09 17:59:35 -07:00
speculative
[ROCm] Enable MTP (NextN) on AMD GPU (#4631)
2025-03-23 22:58:05 -07:00
common_extension.cc
[Feat] Add sparse attn to sgl-kernel (#5327)
2025-04-12 11:36:36 -07:00
flash_extension.cc
[Fix] fix fa3 build at cu118 (#5036)
2025-04-03 11:52:35 -07:00
torch_extension_rocm.cc
update variable naming and comments for rocm (#5299)
2025-04-11 23:15:05 -07:00
Powered by Gitea Version: 1.24.3 Page: 113ms Template: 8ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API