This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX-Hygon
/
sglang
Watch
5
Star
0
Fork
0
You've already forked sglang
Code
Issues
Pull Requests
Actions
7
Projects
Releases
Wiki
Activity
Files
1b2ff4fb7f05ec82128765c366e6f75f4e3f05f7
sglang
/
python
/
sglang
/
srt
/
layers
/
quantization
/
quark
History
Yineng Zhang
1b2ff4fb7f
Revert "Optimized deepseek-v3/r1 model performance on mxfp4 run (
#9671
)" (
#9959
)
2025-09-03 00:50:04 -07:00
..
schemes
Revert "Optimized deepseek-v3/r1 model performance on mxfp4 run (
#9671
)" (
#9959
)
2025-09-03 00:50:04 -07:00
__init__.py
Support OCP MXFP4 quantization on AMD GPUs (
#8255
)
2025-08-04 18:14:52 -07:00
quark_moe.py
Combine fp4.py and mxfp4.py into one file and support dynamic mxfp4 quantization in mxfp4.py (
#9049
)
2025-08-16 19:01:54 -07:00
quark.py
Combine fp4.py and mxfp4.py into one file and support dynamic mxfp4 quantization in mxfp4.py (
#9049
)
2025-08-16 19:01:54 -07:00
utils.py
Revert "Optimized deepseek-v3/r1 model performance on mxfp4 run (
#9671
)" (
#9959
)
2025-09-03 00:50:04 -07:00