sglang

Files

Rain Jiang 2286e85e77 pass a_scale from fp8 quant result instead of hard code to 1.0f (#10241 )

Co-authored-by: Yichen Wang <yichen.wang@bytedance.com>
Co-authored-by: Jinwu Guo <641876696@qq.com>

2025-09-10 12:56:05 -07:00

2025-09-10 12:56:05 -07:00

2025-08-28 10:18:03 -07:00

cutlass_moe_helper.cu

2025-07-30 19:49:35 -07:00

fp8_blockwise_moe_kernel.cu

2025-08-28 23:47:29 -07:00

moe_align_kernel.cu

2025-08-18 16:53:44 -07:00

moe_fused_gate.cu

2025-08-12 20:12:38 -07:00

moe_topk_softmax_kernels.cu

2025-08-28 10:18:03 -07:00

nvfp4_blockwise_moe.cu

2025-06-02 13:48:03 -07:00

prepare_moe_input.cu

2025-07-04 23:23:30 -07:00