This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX-Hygon
/
sglang
Watch
5
Star
0
Fork
0
You've already forked sglang
Code
Issues
Pull Requests
Actions
7
Projects
Releases
Wiki
Activity
Files
efbae697b370a128a4a0354776f66057c205c994
sglang
/
python
/
sglang
/
srt
/
model_executor
History
Baizhou Zhang
efbae697b3
[Revision] Replace enable_flashinfer_mla argument with attention_backend (
#5052
)
2025-04-05 01:23:02 -07:00
..
cuda_graph_runner.py
[Fix] avoid stream sync and torch compile in prefill for fa3 backend (
#4932
)
2025-03-30 13:53:44 -07:00
forward_batch_info.py
FA3 Spec Decoding to support top k = 1 and add cuda graph support (
#5050
)
2025-04-04 23:03:59 -07:00
model_runner.py
[Revision] Replace enable_flashinfer_mla argument with attention_backend (
#5052
)
2025-04-05 01:23:02 -07:00