This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX-Hygon
/
sglang
Watch
5
Star
0
Fork
0
You've already forked sglang
Code
Issues
Pull Requests
Actions
7
Projects
Releases
Wiki
Activity
Files
0f2a2e3c19efadd7b46f393a73f40ac6464f8f08
sglang
/
python
/
sglang
/
srt
/
layers
/
moe
History
Ximingwang-09
0f2a2e3c19
Add H20 tuning configs support DeepSeek V3/R1 INT8(block-wise) (
#4220
)
...
Co-authored-by: ximing.wxm <
ximing.wxm@antgroup.com
>
2025-03-11 12:32:33 -07:00
..
ep_moe
Improve code styles (
#4021
)
2025-03-03 03:20:23 -08:00
fused_moe_triton
Add H20 tuning configs support DeepSeek V3/R1 INT8(block-wise) (
#4220
)
2025-03-11 12:32:33 -07:00
fused_moe_native.py
Support penalty in overlap mode; return logprob with chunked prefill; improve benchmark scripts (
#3988
)
2025-03-03 00:12:04 -08:00
topk.py
feat: update grouped_topk to support softmax and sigmoid (
#3680
)
2025-02-21 16:30:15 +08:00