This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX-Hygon
/
sglang
Watch
5
Star
0
Fork
0
You've already forked sglang
Code
Issues
Pull Requests
Actions
7
Projects
Releases
Wiki
Activity
Files
b4408e6098ca06026d8ff0a56fa86492c0e27b99
sglang
/
python
/
sglang
/
srt
/
layers
/
quantization
/
quark
History
Bowen Bao
cd4b39a900
[quantization] Properly ignore quantization for layers excluded in quant_config (
#11205
)
2025-10-07 14:06:05 -07:00
..
schemes
Optimized deepseek-v3/r1 model performance on mxfp4 run (
#10008
)
2025-09-04 15:11:22 -07:00
__init__.py
Support OCP MXFP4 quantization on AMD GPUs (
#8255
)
2025-08-04 18:14:52 -07:00
quark_moe.py
[quantization] Enable aiter mxfp4 fused_moe for Quark (
#10048
)
2025-10-05 19:51:34 -07:00
quark.py
[quantization] Properly ignore quantization for layers excluded in quant_config (
#11205
)
2025-10-07 14:06:05 -07:00
utils.py
Optimized deepseek-v3/r1 model performance on mxfp4 run (
#10008
)
2025-09-04 15:11:22 -07:00