This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX-Hygon
/
sglang
Watch
5
Star
0
Fork
0
You've already forked sglang
Code
Issues
Pull Requests
Actions
7
Projects
Releases
Wiki
Activity
Files
515ef4facbc89cd7c093c198386a8817fce856d6
sglang
/
python
/
sglang
/
srt
/
layers
/
moe
/
fused_moe_triton
History
Xiaoyu Zhang
515ef4facb
Fuse routed scaling factor in topk_reduce kernel (
#6220
)
2025-06-07 11:06:50 -07:00
..
configs
Add triton version as a fused_moe_triton config search key to avoid performace decrease in different Triton version (
#5955
)
2025-06-07 02:43:50 -07:00
__init__.py
Reorg moe code (
#2563
)
2024-12-24 01:10:22 +08:00
fused_moe.py
Fuse routed scaling factor in topk_reduce kernel (
#6220
)
2025-06-07 11:06:50 -07:00
layer.py
Fuse routed scaling factor in topk_reduce kernel (
#6220
)
2025-06-07 11:06:50 -07:00