xc-llm-ascend

Files

Icey c5fe179cef [0.11.0] [Cherry-pick #4058 ] Fixes Qwen3-Next enable nz accuracy problem (#4056 )

### What this PR does / why we need it?
- Fixes Qwen3-Next enable nz accuracy problem

---------

Signed-off-by: wxsIcey <1790571317@qq.com>
Signed-off-by: Icey <1790571317@qq.com>

2025-11-10 20:56:39 +08:00

test_quant_config.py

[Feat] Unquantized Linear to nz and control all nz-cast (#3356 )

2025-10-14 17:39:26 +08:00

test_utils.py

[1/N][Refactor][Quantization] remove redundant quantizer class (#2680 )

2025-09-04 11:35:14 +08:00

test_w4a4_flatquant_dynamic.py

[Refactor] Clean up w4a4_flatquant_dynamic implementation (#3440 )

2025-10-17 23:53:19 +08:00

test_w4a8_dynamic.py

[0.11.0] [Cherry-pick #4058 ] Fixes Qwen3-Next enable nz accuracy problem (#4056 )

2025-11-10 20:56:39 +08:00

test_w8a8_dynamic.py

[feat]: oproj tensor parallelism in pure DP and graph-mode scenarios. (#2167 )

2025-09-07 10:31:32 +08:00

test_w8a8.py

Remove unused row_idx in token_dispatcher (#3442 )

2025-10-15 09:08:31 +08:00