sglang/kernels at 31dfff7da7ade6703303a67bfe6ef52ead97640a - sglang - Gitea: Git with a cup of tea

EngineX-Hygon/sglang

Files

History

Chunan Zeng 14269198e3 [Benchmark] tilelang vs deepgemm vs w8a8_block_fp8_matmul (#4735 )

2025-03-24 20:56:31 -07:00

..

decoding_attention_triton

benchmark decoding attention kernel with cudnn (#2467 )

2024-12-17 03:31:57 -08:00

[Benchmark] tilelang vs deepgemm vs w8a8_block_fp8_matmul (#4735 )

2025-03-24 20:56:31 -07:00

fused_moe_triton

Correcting default configuration when benchmarking fused_moe (#4665 )

2025-03-22 00:52:34 -07:00

minmax-text-01-lightning_attention

[Fix] use torch.cat instead of torch.concat to prevent entering the Autograd backends. (#4466 )

2025-03-16 00:02:47 -07:00

Tuning Script for Feature DeepSeek V3/R1 INT8 Quantization (block-wise) (#3922 )

2025-02-27 10:59:46 +00:00

[Benchmark] add a benchmark for hf/vllm/sglang rmsnorm (#2486 )

2024-12-15 13:52:08 +08:00

scheduler_batch

[kernel optimize] benchmark write_req_to_token_pool_triton and optimize kernel (#2509 )

2024-12-22 02:31:02 -08:00