sglang/kernels at v0.5.4 - sglang - Gitea: Git with a cup of tea

EngineX-Hygon/sglang

Files

History

Zhengyi Lai 81fd2b0ee0 fix(deepep): resolve benchmark failure on 4×IB-card setup by aligning tuning config with DeepEP commit bdd119f8 (#11965 )

2025-10-22 21:20:54 -07:00

..

[Feat] Support Torch Symm Mem AllReduce (#10571 )

2025-10-05 13:55:19 -07:00

decoding_attention_triton

[CI] Remove unused imports with Ruff to pre-commit config, only to benchmarks/docs/examples folder (#3969 )

2025-03-27 19:45:02 -07:00

fix(deepep): resolve benchmark failure on 4×IB-card setup by aligning tuning config with DeepEP commit bdd119f8 (#11965 )

2025-10-22 21:20:54 -07:00

Restruct sgl-kernel benchmark (#10861 )

2025-09-25 07:45:25 +08:00

[sgl-kernel] Optimize concat_mla_k kernel (#10543 )

2025-09-28 23:04:22 +08:00

flashinfer_allreduce_fusion

[benchmark] add flashinfer_allreduce_fusion benchmark (#9937 )

2025-09-03 16:31:01 +08:00

fused_moe_triton

[sgl-kernel] Support float64 moe_sum_reduce cuda kernel (#11068 )

2025-10-07 14:31:11 +00:00

minmax-text-01-lightning_attention

[lint] improve ruff check (#11922 )

2025-10-22 11:32:50 +08:00

[Refactor] move deep_gemm_wrapper out of quantization (#11784 )

2025-10-17 18:57:54 -07:00

scheduler_batch

[test] add ut and bm for get_last_loc (#6746 )

2025-05-29 11:47:21 -07:00

sliding_window_attention_triton

Optimize triton swa kernel by skipping computation (#8860 )

2025-08-06 21:37:50 +08:00