This website requires JavaScript.
Explore
Help
Register
Sign In
EngineX-Hygon
/
sglang
Watch
5
Star
0
Fork
0
You've already forked sglang
Code
Issues
Pull Requests
Actions
7
Projects
Releases
Wiki
Activity
Files
287427e2e66aef4e4d857cfd666fe849e9f73617
sglang
/
python
/
sglang
/
srt
/
layers
/
quantization
History
Zhiyu
287427e2e6
Enable Nvidia's ModelOpt fp8 quantized models (
#2535
)
2025-01-06 14:54:52 -08:00
..
configs
Update Triton configs for block fp8 kernels (
#2641
)
2024-12-29 22:53:47 +08:00
__init__.py
Enable Nvidia's ModelOpt fp8 quantized models (
#2535
)
2025-01-06 14:54:52 -08:00
base_config.py
fix black in pre-commit (
#1940
)
2024-11-08 07:42:47 +08:00
fp8_kernel.py
Update Triton configs for block fp8 kernels (
#2641
)
2024-12-29 22:53:47 +08:00
fp8_utils.py
AMD: set weights and scaling numbers properly for block FP8 (
#2637
)
2024-12-29 03:23:39 -08:00
fp8.py
fix typo in python/sglang/srt/layers/quantization/fp8.py (
#2655
)
2024-12-29 23:45:02 -08:00