This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX-Hygon
/
sglang
Watch
5
Star
0
Fork
0
You've already forked sglang
Code
Issues
Pull Requests
Actions
7
Projects
Releases
Wiki
Activity
Files
c9565e49e71d2fb5ddf439da34cafdcd2f317656
sglang
/
benchmark
/
kernels
History
yigex
fdf04a1426
[ROCm] Add ROCm tuning config to block gemm and Re-tune for AMD Radeon Graphics (
#3418
)
...
Co-authored-by: Bruce Xue <
yigex@xilinx.com
> Co-authored-by: HAI <
hixiao@gmail.com
>
2025-02-10 23:55:04 -08:00
..
decoding_attention_triton
benchmark decoding attention kernel with cudnn (
#2467
)
2024-12-17 03:31:57 -08:00
fused_moe_triton
refine some typo (
#3473
)
2025-02-10 23:35:44 +08:00
minmax-text-01-lightning_attention
support lightning_attention_decode in sgl-kernel for MiniMax-Text-01 (
#3030
)
2025-01-23 15:29:20 +08:00
quantization
[ROCm] Add ROCm tuning config to block gemm and Re-tune for AMD Radeon Graphics (
#3418
)
2025-02-10 23:55:04 -08:00
rmsnorm
[Benchmark] add a benchmark for hf/vllm/sglang rmsnorm (
#2486
)
2024-12-15 13:52:08 +08:00
scheduler_batch
[kernel optimize] benchmark write_req_to_token_pool_triton and optimize kernel (
#2509
)
2024-12-22 02:31:02 -08:00