[Feature] Support AWQ MoE W4A16 Quantization (#142)
Signed-off-by: tangshiwen <tangshiwen@baidu.com> Co-authored-by: Li Wei <liwei.109@outlook.com>
This commit is contained in:
@@ -162,7 +162,7 @@ class KunlunFusedMoE(FusedMoE):
|
||||
if (self.quant_config is None) or (
|
||||
should_ignore_layer(
|
||||
prefix,
|
||||
ignore=self.quant_config.ignore,
|
||||
ignore=getattr(self.quant_config, "ignore", tuple()),
|
||||
fused_mapping=self.quant_config.packed_modules_mapping,
|
||||
)
|
||||
):
|
||||
|
||||
Reference in New Issue
Block a user