sglang/kernels at c4e81e64fb4233fbd76b5bb24e5fe6dd2a87832e - sglang - Gitea: Git with a cup of tea

EngineX-Hygon/sglang

Files

History

Cheng Wan 5b214b50b6 [Refactor] move deep_gemm_wrapper out of quantization (#11784 )

2025-10-17 18:57:54 -07:00

..

[Feat] Support Torch Symm Mem AllReduce (#10571 )

2025-10-05 13:55:19 -07:00

decoding_attention_triton

[CI] Remove unused imports with Ruff to pre-commit config, only to benchmarks/docs/examples folder (#3969 )

2025-03-27 19:45:02 -07:00

Support tuning DeepEP configs (#6742 )

2025-05-29 08:12:22 -07:00

Restruct sgl-kernel benchmark (#10861 )

2025-09-25 07:45:25 +08:00

[sgl-kernel] Optimize concat_mla_k kernel (#10543 )

2025-09-28 23:04:22 +08:00

flashinfer_allreduce_fusion

[benchmark] add flashinfer_allreduce_fusion benchmark (#9937 )

2025-09-03 16:31:01 +08:00

fused_moe_triton

[sgl-kernel] Support float64 moe_sum_reduce cuda kernel (#11068 )

2025-10-07 14:31:11 +00:00

minmax-text-01-lightning_attention

[CI] Remove unused imports with Ruff to pre-commit config, only to benchmarks/docs/examples folder (#3969 )

2025-03-27 19:45:02 -07:00

[Refactor] move deep_gemm_wrapper out of quantization (#11784 )

2025-10-17 18:57:54 -07:00

scheduler_batch

[test] add ut and bm for get_last_loc (#6746 )

2025-05-29 11:47:21 -07:00

sliding_window_attention_triton

Optimize triton swa kernel by skipping computation (#8860 )

2025-08-06 21:37:50 +08:00