Yun Dai
|
9731eca77b
|
[modelopt] automatically inspect if model is ModelOpt quantized and set quantization method (#5145)
|
2025-04-07 22:12:11 -07:00 |
|
fzyzcjy
|
15ddd84322
|
Add retry for flaky tests in CI (#4755)
|
2025-03-25 16:53:12 -07:00 |
|
Adarsh Shirawalmath
|
a2cc62a6db
|
[CI fix] test skipping modelopt on AMD (#4677)
|
2025-03-22 12:36:02 -07:00 |
|
Yun Dai
|
8cd4250401
|
[quantization] fix channelwise conversion with scalar weight scale (#4596)
|
2025-03-22 00:47:52 -07:00 |
|
Stefan He
|
ef3c2dd08e
|
Support Online Quantization for W8A8 (#4485)
|
2025-03-17 00:28:56 -07:00 |
|
Lianmin Zheng
|
2c4f5ccac1
|
Fix minor style (#4460)
|
2025-03-15 21:51:12 -07:00 |
|
HandH1998
|
2ac189edc8
|
Amd test fp8 (#4261)
|
2025-03-10 10:12:09 -07:00 |
|