sglang/kernels at 64480df4950a2b120c905917bfedc785ef4c50eb - sglang - Gitea: Git with a cup of tea

EngineX-Hygon/sglang

Files

History

yiakwy-xpu-ml-framework-team 64480df495 [BUG] fix moe benchmark when bs*seq is small (#3382 )

2025-02-08 15:39:44 +08:00

..

decoding_attention_triton

benchmark decoding attention kernel with cudnn (#2467 )

2024-12-17 03:31:57 -08:00

fused_moe_triton

[BUG] fix moe benchmark when bs*seq is small (#3382 )

2025-02-08 15:39:44 +08:00

minmax-text-01-lightning_attention

support lightning_attention_decode in sgl-kernel for MiniMax-Text-01 (#3030 )

2025-01-23 15:29:20 +08:00

add tuning block wise fp8 (#3242 )

2025-02-01 03:58:18 +08:00

[Benchmark] add a benchmark for hf/vllm/sglang rmsnorm (#2486 )

2024-12-15 13:52:08 +08:00

scheduler_batch

[kernel optimize] benchmark write_req_to_token_pool_triton and optimize kernel (#2509 )

2024-12-22 02:31:02 -08:00