sglang

Files

Pavani Majety eb38c7d1ca [1/2] Add Kernel support for Cutlass based Fused FP4 MoE (#6093 )

Signed-off-by: Pavani Majety <pmajety@nvidia.com>

2025-06-02 13:48:03 -07:00

awq_kernel.cu

2025-03-28 17:23:51 -07:00

bmm_fp8.cu

2025-03-09 18:38:15 -07:00

fp8_blockwise_gemm_kernel.cu

2025-05-15 17:42:23 -07:00

fp8_gemm_kernel.cu

2025-03-20 12:40:28 -07:00

int8_gemm_kernel.cu

2025-03-26 10:41:53 -07:00

nvfp4_expert_quant.cu

2025-06-02 13:48:03 -07:00

nvfp4_quant_entry.cu

2025-06-02 13:48:03 -07:00

nvfp4_quant_kernels.cu

2025-03-28 17:23:51 -07:00

nvfp4_scaled_mm_entry.cu

2025-03-24 19:50:23 -07:00

nvfp4_scaled_mm_kernels.cu

2025-03-31 12:00:34 -07:00

per_tensor_quant_fp8.cu

2025-03-22 00:37:57 -07:00

per_token_group_quant_8bit.cu

2025-04-02 18:29:59 -07:00

per_token_quant_fp8.cu

2025-03-22 00:37:57 -07:00

qserve_w4a8_per_chn_gemm.cu

2025-05-21 19:48:59 -07:00

qserve_w4a8_per_group_gemm.cu

2025-05-21 19:48:59 -07:00