This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX-Hygon
/
sglang
Watch
5
Star
0
Fork
0
You've already forked sglang
Code
Issues
Pull Requests
Actions
7
Projects
Releases
Wiki
Activity
Files
7bc5fb0d78c183186e352cdc8f614a01ac2829d4
sglang
/
sgl-kernel
/
csrc
/
moe
/
cutlass_moe
/
w4a8
History
Rain Jiang
2286e85e77
pass a_scale from fp8 quant result instead of hard code to 1.0f (
#10241
)
...
Co-authored-by: Yichen Wang <
yichen.wang@bytedance.com
> Co-authored-by: Jinwu Guo <
641876696@qq.com
>
2025-09-10 12:56:05 -07:00
..
scaled_mm_entry.cu
[1/n]: add cutlass W4A8 moe kernel for hopper architecture (
#7772
)
2025-07-04 20:50:12 -07:00
w4a8_get_group_starts.cuh
[feat] Support tp mode for DeepSeek-R1-W4AFP8 (
#8118
)
2025-09-01 22:17:26 -07:00
w4a8_grouped_mm_c3x.cu
[feat] Support tp mode for DeepSeek-R1-W4AFP8 (
#8118
)
2025-09-01 22:17:26 -07:00
w4a8_grouped_mm_c3x.cuh
pass a_scale from fp8 quant result instead of hard code to 1.0f (
#10241
)
2025-09-10 12:56:05 -07:00
w4a8_moe_data.cu
[1/n]: add cutlass W4A8 moe kernel for hopper architecture (
#7772
)
2025-07-04 20:50:12 -07:00