This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX-Hygon
/
sglang
Watch
5
Star
0
Fork
0
You've already forked sglang
Code
Issues
Pull Requests
Actions
7
Projects
Releases
Wiki
Activity
Files
e4db4e5ba58c6e5c9850327fff8b34e5366dd925
sglang
/
python
/
sglang
/
srt
/
layers
History
Lianmin Zheng
bbc07c4197
Move sampling logits to float32 (
#773
)
2024-07-27 17:30:12 -07:00
..
quantization
refactor model loader [unreachable code]: initial refactor (
#655
)
2024-07-19 09:27:06 -07:00
context_flashattention_nopad.py
Remove cached triton launcher (
#656
)
2024-07-18 23:28:40 -07:00
extend_attention.py
Remove cached triton launcher (
#656
)
2024-07-18 23:28:40 -07:00
fused_moe.py
Format (
#593
)
2024-07-05 10:06:17 -07:00
linear.py
refactor model loader [unreachable code]: initial refactor (
#655
)
2024-07-19 09:27:06 -07:00
logits_processor.py
Move sampling logits to float32 (
#773
)
2024-07-27 17:30:12 -07:00
radix_attention.py
Reduce hardcoded logic of kernel usage (
#707
)
2024-07-23 16:42:21 -07:00
token_attention.py
Remove cached triton launcher (
#656
)
2024-07-18 23:28:40 -07:00