xc-llm-ascend

Files

linfeng-yuan deceefd305 [releases/v0.18.0][bugfix][eplb] remove unnecessary weight_scale wrap behaviour (#7732 )

### What this PR does / why we need it?
This PR simplifies the apply method in w8a8_dynamic.py by removing the
conditional logic that used fused_w1_scale and fused_w2_scale based on
the fused_scale_flag. This redundant wrap behavior leads to EPLB break
in int8 quantization scenarios.

Cherry-picked from #7188. Note that only bugfix lines in that PR are
picked.

Signed-off-by: linfeng-yuan <1102311262@qq.com>

2026-03-30 16:16:03 +08:00

methods

[releases/v0.18.0][bugfix][eplb] remove unnecessary weight_scale wrap behaviour (#7732 )

2026-03-30 16:16:03 +08:00

__init__.py

[refactor] replace scattered business kwargs with typed request objects and explicit stage boundaries (#7024 )

2026-03-20 23:23:57 +08:00

compressed_tensors_config.py

[CI] Add pre-commit check for patch logger (#7446 )

2026-03-19 16:53:20 +08:00

method_adapters.py

[refactor] replace scattered business kwargs with typed request objects and explicit stage boundaries (#7024 )