This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX-Hygon
/
sglang
Watch
5
Star
0
Fork
0
You've already forked sglang
Code
Issues
Pull Requests
Actions
7
Projects
Releases
Wiki
Activity
Files
f2a75a66c4e5bd007ce0eabd4fb16edade751e6c
sglang
/
sgl-kernel
/
csrc
/
moe
History
Elfie Guo
3e56f557fd
Add a CUDA kernel for fusing mapping and weighted sum for MoE. (
#6916
)
...
Co-authored-by: Elfie Guo <
elfiegxf@gmail.com
>
2025-06-07 15:24:39 -07:00
..
cutlass_moe_helper.cu
[1/2] Add FP8 Blockscale MoE CUTLASS kernel for Blackwell (
#5281
)
2025-04-22 22:28:20 -07:00
ep_moe_reorder_kernel.cu
[EP] Add cuda kernel for moe_ep_post_reorder (
#6837
)
2025-06-05 00:33:47 -07:00
fp8_blockwise_moe_kernel.cu
Add a CUDA kernel for fusing mapping and weighted sum for MoE. (
#6916
)
2025-06-07 15:24:39 -07:00
moe_align_kernel.cu
reduce torch.zeros overhead in moe align block size kernel (
#6369
)
2025-06-07 02:47:36 -07:00
moe_fused_gate.cu
Set
num_fused_shared_experts
as
num_shared_experts
when shared_experts fusion is not disabled (
#6736
)
2025-06-04 15:53:22 -07:00
moe_topk_softmax_kernels.cu
Add moe topk softmax templated from vllm (
#4302
)
2025-03-14 12:03:33 -07:00
nvfp4_blockwise_moe.cu
[1/2] Add Kernel support for Cutlass based Fused FP4 MoE (
#6093
)
2025-06-02 13:48:03 -07:00
prepare_moe_input.cu
Add a CUDA kernel for fusing mapping and weighted sum for MoE. (
#6916
)
2025-06-07 15:24:39 -07:00