This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX-Hygon
/
sglang
Watch
5
Star
0
Fork
0
You've already forked sglang
Code
Issues
Pull Requests
Actions
7
Projects
Releases
Wiki
Activity
Files
2f7420bc84beacb9b742dfd62eed29a7fb00a86b
sglang
/
sgl-kernel
/
csrc
/
attention
History
Trevor Morris
0ab3f437ab
Cutlass MLA: Disable split kv due to
https://github.com/NVIDIA/cutlass/issues/2274
(
#6101
)
2025-05-08 18:44:30 -07:00
..
cascade.cu
feat: adapt merge_state (
#5337
)
2025-04-12 21:14:04 -07:00
cutlass_mla_kernel.cu
Cutlass MLA: Disable split kv due to
https://github.com/NVIDIA/cutlass/issues/2274
(
#6101
)
2025-05-08 18:44:30 -07:00
lightning_attention_decode_kernel.cu
support cmake for sgl-kernel (
#4706
)
2025-03-27 01:42:28 -07:00
merge_attn_states.cu
bugfix: fix merge_state_v2 cuda graph (
#5419
)
2025-04-15 10:18:47 -07:00
vertical_slash_index.cu
[Feat] QWen-1M context support[1/2]: Update block sparse attention backend utils kernel (
#5847
)
2025-04-28 11:03:17 -07:00