sglang/benchmark at eefcbdd3533b065b950276ce23c8ab7a4f69bd99 - sglang - Gitea: Git with a cup of tea

EngineX-Hygon/sglang

Files

History

Xiaoyu Zhang bb418ced80 optimize per token group quant fp8 (#3490 )

2025-02-11 22:19:05 +08:00

..

bench_fp8_gemm.py

support w8a8 fp8 kernel with CUTLASS (#3047 )

2025-01-26 15:46:51 +08:00

bench_int8_gemm.py

Add shapes for int8 gemm benchmark (#3093 )

2025-01-24 12:27:30 +08:00

bench_lightning_attention_decode.py

support lightning_attention_decode in sgl-kernel for MiniMax-Text-01 (#3030 )

2025-01-23 15:29:20 +08:00

bench_per_token_group_quant_fp8.py

optimize per token group quant fp8 (#3490 )

2025-02-11 22:19:05 +08:00