sglang/moe at e1ce44cdb1e427c8dcbfd42f7181d08be8484b15 - sglang - Gitea: Git with a cup of tea

EngineX-Hygon/sglang

Files

History

Xiaoyu Zhang 8b5f83ed3b reduce torch.zeros overhead in moe align block size kernel (#6369 )

2025-06-07 02:47:36 -07:00

..

cutlass_moe_helper.cu

[1/2] Add FP8 Blockscale MoE CUTLASS kernel for Blackwell (#5281 )

2025-04-22 22:28:20 -07:00

ep_moe_reorder_kernel.cu

[EP] Add cuda kernel for moe_ep_post_reorder (#6837 )

2025-06-05 00:33:47 -07:00

fp8_blockwise_moe_kernel.cu

[2/2] Add python wrapper for CUTLASS FP8 Blockscale MoE Kernel. (#5694 )

2025-05-16 13:14:07 -07:00

moe_align_kernel.cu

reduce torch.zeros overhead in moe align block size kernel (#6369 )

2025-06-07 02:47:36 -07:00

moe_fused_gate.cu

Set num_fused_shared_experts as num_shared_experts when shared_experts fusion is not disabled (#6736 )

2025-06-04 15:53:22 -07:00

moe_topk_softmax_kernels.cu

Add moe topk softmax templated from vllm (#4302 )

2025-03-14 12:03:33 -07:00

nvfp4_blockwise_moe.cu

[1/2] Add Kernel support for Cutlass based Fused FP4 MoE (#6093 )

2025-06-02 13:48:03 -07:00

prepare_moe_input.cu

[1/2] Add Kernel support for Cutlass based Fused FP4 MoE (#6093 )

2025-06-02 13:48:03 -07:00