Files
Qwen3-32B.w8a8/recipe.yaml

9 lines
199 B
YAML
Raw Permalink Normal View History

default_stage:
default_modifiers:
SmoothQuantModifier: {smoothing_strength: 0.7}
GPTQModifier:
targets: [Linear]
ignore: [lm_head]
scheme: W8A8
dampening_frac: 0.01