[3/N][refactor] refactoer quantization (#2504)
### What this PR does / why we need it? Move torchair related qunatization section into torchair dir to make the code clear. Next step we'll remove all torchair related code outside of torchair quantization. ### Does this PR introduce _any_ user-facing change? NO ### How was this patch tested? vLLM version: main vLLM main:ab9f2cfd19- vLLM version: v0.10.1.1 - vLLM main:959783fb99Signed-off-by: hust17yixuan <303660421@qq.com>
This commit is contained in:
1016
vllm_ascend/torchair/quantization/torchair_w8a8_dynamic.py
Normal file
1016
vllm_ascend/torchair/quantization/torchair_w8a8_dynamic.py
Normal file
File diff suppressed because it is too large
Load Diff
Reference in New Issue
Block a user