sglang/kernels at 9eefe2c0b71c995383cd94ddc81f2016a4e93ace - sglang - Gitea: Git with a cup of tea

EngineX-Hygon/sglang

Files

History

Cheng Wan 3c06b673af [8/N] MoE Refactor: deprecate EPMoE (#11211 )

2025-10-07 21:51:41 -07:00

..

[Feat] Support Torch Symm Mem AllReduce (#10571 )

2025-10-05 13:55:19 -07:00

decoding_attention_triton

[CI] Remove unused imports with Ruff to pre-commit config, only to benchmarks/docs/examples folder (#3969 )

2025-03-27 19:45:02 -07:00

Support tuning DeepEP configs (#6742 )

2025-05-29 08:12:22 -07:00

Restruct sgl-kernel benchmark (#10861 )

2025-09-25 07:45:25 +08:00

[sgl-kernel] Optimize concat_mla_k kernel (#10543 )

2025-09-28 23:04:22 +08:00

flashinfer_allreduce_fusion

[benchmark] add flashinfer_allreduce_fusion benchmark (#9937 )

2025-09-03 16:31:01 +08:00

fused_moe_triton

[sgl-kernel] Support float64 moe_sum_reduce cuda kernel (#11068 )

2025-10-07 14:31:11 +00:00

minmax-text-01-lightning_attention

[CI] Remove unused imports with Ruff to pre-commit config, only to benchmarks/docs/examples folder (#3969 )

2025-03-27 19:45:02 -07:00

[NVIDIA] [2/N] Optimize silu_and_mul_scaled_fp4_grouped_quant perf (#9556 )

2025-08-29 17:17:03 -07:00

scheduler_batch

[test] add ut and bm for get_last_loc (#6746 )

2025-05-29 11:47:21 -07:00

sliding_window_attention_triton

Optimize triton swa kernel by skipping computation (#8860 )

2025-08-06 21:37:50 +08:00