sglang/kernels at 0f2a2e3c19efadd7b46f393a73f40ac6464f8f08 - sglang - Gitea: Git with a cup of tea

EngineX-Hygon/sglang

Files

History

yych0745 6a02b32d07 Add A100 tuning configs for DeepSeek R1/V3 channel-wise INT8 (#4287 )

Co-authored-by: HandH1998 <1335248067@qq.com>

2025-03-11 00:49:06 -07:00

..

decoding_attention_triton

benchmark decoding attention kernel with cudnn (#2467 )

2024-12-17 03:31:57 -08:00

Optimize Triton Kernel of Group GEMM in DeepGEMM Benchmark (#4014 )

2025-03-02 23:29:55 -08:00

fused_moe_triton

Add A100 tuning configs for DeepSeek R1/V3 channel-wise INT8 (#4287 )

2025-03-11 00:49:06 -07:00

minmax-text-01-lightning_attention

support lightning_attention_decode in sgl-kernel for MiniMax-Text-01 (#3030 )

2025-01-23 15:29:20 +08:00

Tuning Script for Feature DeepSeek V3/R1 INT8 Quantization (block-wise) (#3922 )

2025-02-27 10:59:46 +00:00

[Benchmark] add a benchmark for hf/vllm/sglang rmsnorm (#2486 )

2024-12-15 13:52:08 +08:00

scheduler_batch

[kernel optimize] benchmark write_req_to_token_pool_triton and optimize kernel (#2509 )

2024-12-22 02:31:02 -08:00