This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX-Hygon
/
sglang
Watch
5
Star
0
Fork
0
You've already forked sglang
Code
Issues
Pull Requests
Actions
7
Projects
Releases
Wiki
Activity
Files
6d55f60e7794de3859137340259782372236010f
sglang
/
sgl-kernel
/
csrc
/
attention
History
hlu1
4c22ebe2e8
Disable kernel cutlass_mla_decode on SM103 (
#10058
)
...
Signed-off-by: Hao Lu <
14827759+hlu1@users.noreply.github.com
>
2025-09-06 01:35:18 -07:00
..
cutlass_sm100_mla
[perf][sgl-kernel] extend cutlass_mla_decode to support num_head < 128 (
#6929
)
2025-06-08 19:37:34 -07:00
cascade.cu
feat: adapt merge_state (
#5337
)
2025-04-12 21:14:04 -07:00
cutlass_mla_kernel.cu
Disable kernel cutlass_mla_decode on SM103 (
#10058
)
2025-09-06 01:35:18 -07:00
lightning_attention_decode_kernel.cu
support cmake for sgl-kernel (
#4706
)
2025-03-27 01:42:28 -07:00
merge_attn_states.cu
bugfix: fix merge_state_v2 cuda graph (
#5419
)
2025-04-15 10:18:47 -07:00
vertical_slash_index.cu
[bugifx] QWen-1M context support[2/3] using current cuda stream in the DCA's kernel for bugfix. (
#8611
)
2025-07-31 22:41:39 +08:00