[Doc]modify the quantization user guide and add a quantization adaptation developer guide (#5554)

### What this PR does / why we need it?
This PR makes the following modifications:
1.delete the `user_guide/feature_guide/quantization-llm-compressor.md`
and merge it into `user_guide/feature_guide/quantization.md`.
2.update the content of `user_guide/feature_guide/quantization.md`.
3.add guidance `developer_guide/feature_guide/quantization.md' on the
adaptation of quantization algorithms and quantized models.
### Does this PR introduce _any_ user-facing change?
N/A
### How was this patch tested?

- vLLM version: v0.13.0
- vLLM main:
7157596103

---------

Signed-off-by: IncSec <1790766300@qq.com>
Signed-off-by: InSec <1790766300@qq.com>

This commit is contained in:

InSec

2026-01-05 09:12:11 +08:00

committed by

GitHub

parent 96775a27a8

commit 7cf65d0581

10 changed files with 204 additions and 116 deletions

BIN
docs/source/assets/quantization/quant_algorithm_overview.png Normal file

View File

Binary file not shown.

After

Width: | Height: | Size: 28 KiB

[Doc]modify the quantization user guide and add a quantization adaptation developer guide (#5554)

BIN docs/source/assets/quantization/quant_algorithm_overview.png Normal file View File

BIN
docs/source/assets/quantization/quant_algorithm_overview.png Normal file

View File