This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX-Hygon
/
sglang
Watch
5
Star
0
Fork
0
You've already forked sglang
Code
Issues
Pull Requests
Actions
7
Projects
Releases
Wiki
Activity
Files
cbb5fc2edc3165a8745edf90e674c0e4ad6336eb
sglang
/
sgl-kernel
/
csrc
/
gemm
History
fzyzcjy
21337b22b9
Reland [1/2] Optimizations and refactors about quant kernel (
#10312
)
...
Co-authored-by: Yineng Zhang <
me@zhyncs.com
>
2025-10-11 15:59:03 +08:00
..
gptq
…
marlin
…
awq_kernel.cu
…
bmm_fp8.cu
Move rope and bmm into sgl-kernel (
#4241
)
2025-03-09 18:38:15 -07:00
dsv3_fused_a_gemm.cu
…
dsv3_router_gemm_bf16_out.cu
…
dsv3_router_gemm_entry.cu
…
dsv3_router_gemm_float_out.cu
[Kimi K2] dsv3_router_gemm supports NUM_EXPERTS == 384 (
#8013
)
2025-08-01 22:01:24 +08:00
fp8_blockwise_gemm_kernel.cu
…
fp8_gemm_kernel.cu
…
int8_gemm_kernel.cu
…
math.hpp
…
nvfp4_expert_quant.cu
…
nvfp4_quant_entry.cu
…
nvfp4_quant_kernels.cu
…
nvfp4_quant.cuh
…
nvfp4_scaled_mm_entry.cu
…
nvfp4_scaled_mm_kernels.cu
Optimize nvfp4 block scaled gemm kernel when M is small. (
#10101
)
2025-09-06 22:31:00 -07:00
per_tensor_quant_fp8.cu
…
per_token_group_quant_8bit_v2.cu
Reland [1/2] Optimizations and refactors about quant kernel (
#10312
)
2025-10-11 15:59:03 +08:00
per_token_group_quant_8bit.cu
Reland [1/2] Optimizations and refactors about quant kernel (
#10312
)
2025-10-11 15:59:03 +08:00
per_token_quant_fp8.cu
…
qserve_w4a8_per_chn_gemm.cu
…
qserve_w4a8_per_group_gemm.cu
…