This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX-Hygon
/
sglang
Watch
5
Star
0
Fork
0
You've already forked sglang
Code
Issues
Pull Requests
Actions
7
Projects
Releases
Wiki
Activity
2,284
Commits
8
Branches
0
Tags
ffa1b3e318c9d1342a5e430eb04df609e22a3775
Commit Graph
3 Commits
Author
SHA1
Message
Date
Stefan He
95085d65e9
[Refactor] Reducing code duplication across FP8 CUDA quantization kernels (
#4163
)
2025-03-06 22:58:52 -08:00
Xiaoyu Zhang
55a7ec388f
use warp shuffle style reduce and flashinfer vectorize (
#3628
)
2025-02-19 20:53:51 +08:00
Xiaoyu Zhang
bb418ced80
optimize per token group quant fp8 (
#3490
)
2025-02-11 22:19:05 +08:00