This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX-Hygon
/
sglang
Watch
5
Star
0
Fork
0
You've already forked sglang
Code
Issues
Pull Requests
Actions
7
Projects
Releases
Wiki
Activity
Files
ca75741e8605e1c823b7ccc8035ba0fa226a1676
sglang
/
python
/
sglang
/
srt
/
layers
/
moe
History
fzyzcjy
ca75741e86
Support async in DeepEP (
#4610
)
...
Co-authored-by: Cheng Wan <
cwan39@gatech.edu
>
2025-03-22 22:39:56 -07:00
..
ep_moe
Support async in DeepEP (
#4610
)
2025-03-22 22:39:56 -07:00
fused_moe_triton
Support serving DeepSeek-R1-Channel-INT8 with 32 L40S. (
#4418
)
2025-03-17 00:03:43 -07:00
fused_moe_native.py
Support penalty in overlap mode; return logprob with chunked prefill; improve benchmark scripts (
#3988
)
2025-03-03 00:12:04 -08:00
router.py
Add some fused elementwise kernels for grok-1 (
#4398
)
2025-03-13 13:39:10 -07:00
topk.py
Revert "feat: update grouped_topk to support softmax and sigmoid" (
#4505
)
2025-03-17 11:30:26 -07:00