Logo
Explore Help
Register Sign In
EngineX-Hygon/sglang
5
0
Fork 0
You've already forked sglang
Code Issues Pull Requests Actions 7 Projects Releases Wiki Activity
Files
bd75690f4eef6b3140162a8b03af1f0a96a5e358
sglang/sgl-kernel/csrc/gemm
History
Pavani Majety eb38c7d1ca [1/2] Add Kernel support for Cutlass based Fused FP4 MoE (#6093)
Signed-off-by: Pavani Majety <pmajety@nvidia.com>
2025-06-02 13:48:03 -07:00
..
awq_kernel.cu
fix sgl-kernel cu118 build (#4872)
2025-03-28 17:23:51 -07:00
bmm_fp8.cu
…
fp8_blockwise_gemm_kernel.cu
Upgrade CUTLASS 4.0 (#6336)
2025-05-15 17:42:23 -07:00
fp8_gemm_kernel.cu
…
int8_gemm_kernel.cu
Fix shared memory OOM on sm86 GPUs. (#4797)
2025-03-26 10:41:53 -07:00
nvfp4_expert_quant.cu
[1/2] Add Kernel support for Cutlass based Fused FP4 MoE (#6093)
2025-06-02 13:48:03 -07:00
nvfp4_quant_entry.cu
[1/2] Add Kernel support for Cutlass based Fused FP4 MoE (#6093)
2025-06-02 13:48:03 -07:00
nvfp4_quant_kernels.cu
fix sgl-kernel cu118 build (#4872)
2025-03-28 17:23:51 -07:00
nvfp4_scaled_mm_entry.cu
…
nvfp4_scaled_mm_kernels.cu
[Build] Fix cuda12.8 build error in nvfp4_scaled_mm_kernels.cu (#4953)
2025-03-31 12:00:34 -07:00
per_tensor_quant_fp8.cu
…
per_token_group_quant_8bit.cu
[sgl-kernel] per token group quant support COLUMN MAJOR (#4817)
2025-04-02 18:29:59 -07:00
per_token_quant_fp8.cu
…
qserve_w4a8_per_chn_gemm.cu
[1/2] Support Qserve (#6457)
2025-05-21 19:48:59 -07:00
qserve_w4a8_per_group_gemm.cu
[1/2] Support Qserve (#6457)
2025-05-21 19:48:59 -07:00
Powered by Gitea Version: 1.24.3 Page: 4266ms Template: 6ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API