Support OCP MXFP4 quantization on AMD GPUs (#8255)
Co-authored-by: wunhuang <wunhuang@amd.com> Co-authored-by: Hubert Lu <Hubert.Lu@amd.com>
This commit is contained in:
@@ -813,6 +813,7 @@ class ServerArgs:
|
||||
"moe_wna16",
|
||||
"qoq",
|
||||
"w4afp8",
|
||||
"mxfp4",
|
||||
],
|
||||
help="The quantization method.",
|
||||
)
|
||||
|
||||
Reference in New Issue
Block a user