Apply sgl w8a8 fp8 kernel (#3148)
This commit is contained in:
@@ -405,6 +405,7 @@ class ServerArgs:
|
||||
"gguf",
|
||||
"modelopt",
|
||||
"w8a8_int8",
|
||||
"w8a8_fp8",
|
||||
],
|
||||
help="The quantization method.",
|
||||
)
|
||||
|
||||
Reference in New Issue
Block a user