xc-llm-ascend

Files

22dimensions 0942d9aaab [3/N][Refactor][Quantization]remove packed_modules_mapping from models (#3021 )

### What this PR does / why we need it?

Some custom models in vllm-ascend define packed_modules_mapping, which
prevent keeping same model class with vllm community. So move these
custom packed_modules_mapping to quant utils.py. After this pr, some
custom models can be removed.

### Does this PR introduce _any_ user-facing change?

tested by CI

### How was this patch tested?

tested by CI

- vLLM version: v0.10.2
- vLLM main:
5089fd749c

Signed-off-by: 22dimensions <waitingwind@foxmail.com>

2025-09-19 20:50:14 +08:00

test_quant_config.py

[3/N][Refactor][Quantization]remove packed_modules_mapping from models (#3021 )

2025-09-19 20:50:14 +08:00

test_utils.py

[1/N][Refactor][Quantization] remove redundant quantizer class (#2680 )

2025-09-04 11:35:14 +08:00

test_w4a8_dynamic.py

[feat]: oproj tensor parallelism in pure DP and graph-mode scenarios. (#2167 )

2025-09-07 10:31:32 +08:00

test_w8a8_dynamic.py

[feat]: oproj tensor parallelism in pure DP and graph-mode scenarios. (#2167 )

2025-09-07 10:31:32 +08:00

test_w8a8.py

[main] [refactor] refactor common_fused_moe.py (#2706 )

2025-09-08 20:09:50 +08:00