This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX-Hygon
/
sglang
Watch
5
Star
0
Fork
0
You've already forked sglang
Code
Issues
Pull Requests
Actions
7
Projects
Releases
Wiki
Activity
Files
229d2b95f19573ece9c1c5d6b357df9874e04f59
sglang
/
python
/
sglang
/
srt
/
model_executor
History
li-kesen
2bc61dd194
Remove hybrid_linear_attn attention backend and refactor attention registry (
#10816
)
...
Co-authored-by: Yi Zhang <
1109276519@qq.com
>
2025-09-30 10:16:16 +08:00
..
cpu_graph_runner.py
Add graph runner support with torch compile on CPU (
#7843
)
2025-09-07 21:33:58 -07:00
cuda_graph_runner.py
[Profile] dump memory trace when cuda graph profile is enabled (
#11083
)
2025-09-29 17:36:48 -07:00
forward_batch_info.py
Fix cutlass moe accuracy drop caused by attention UB from DP padding mode (
#10414
)
2025-09-13 22:29:09 -07:00
model_runner.py
Remove hybrid_linear_attn attention backend and refactor attention registry (
#10816
)
2025-09-30 10:16:16 +08:00
npu_graph_runner.py
[Ascend]optimize Qwen3 on Ascend (
#10574
)
2025-09-22 17:18:36 -07:00