This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX-Hygon
/
sglang
Watch
5
Star
0
Fork
0
You've already forked sglang
Code
Issues
Pull Requests
Actions
7
Projects
Releases
Wiki
Activity
Files
23632d350ce6df7b8d5bc6e3816031522c5022c9
sglang
/
sgl-kernel
/
python
/
sgl_kernel
History
Yineng Zhang
e53df7c009
chore: bump sgl-kernel v0.3.12 (
#10732
)
2025-09-22 14:39:25 -07:00
..
testing
…
__init__.py
[sgl-kernel] Support moe_sum_reduce cuda kernel (
#10321
)
2025-09-19 14:12:09 +08:00
_fa4_interface.py
support using fa4 on deepseek on blackwell (
#9928
)
2025-09-16 16:16:06 -07:00
allreduce.py
…
attention.py
…
cutlass_moe.py
…
elementwise.py
[1/2] Speed up trtllm_mla attention backend (>10% e2e) (
#10473
)
2025-09-15 11:53:21 -07:00
flash_attn.py
Revert "Fix FA4 import cause moe_fused_gate output be illegal memory" (
#10432
)
2025-09-14 19:03:27 -07:00
fused_moe.py
…
gemm.py
…
grammar.py
…
kvcacheio.py
…
mamba.py
…
marlin.py
…
memory.py
…
moe.py
[sgl-kernel] Support moe_sum_reduce cuda kernel (
#10321
)
2025-09-19 14:12:09 +08:00
sampling.py
…
scalar_type.py
…
sparse_flash_attn.py
…
spatial.py
…
speculative.py
[Feature] Speculative decoding support lookahead (
#9873
)
2025-09-18 16:42:41 -07:00
top_k.py
…
utils.py
…
version.py
chore: bump sgl-kernel v0.3.12 (
#10732
)
2025-09-22 14:39:25 -07:00