[Doc] Support kimi-k2-w8a8 (#2162)
### What this PR does / why we need it?
In fact, the kimi-k2 model is similar to the deepseek model, and we only
need to make a few changes to support it. what does this pr do:
1. Add kimi-k2-w8a8 deployment doc
2. Update quantization doc
3. Upgrade torchair support list
### Does this PR introduce _any_ user-facing change?
### How was this patch tested?
- vLLM version: v0.10.0
- vLLM main:
9edd1db02b
---------
Signed-off-by: wangli <wangli858794774@gmail.com>
This commit is contained in:
Binary file not shown.
|
Before Width: | Height: | Size: 115 KiB |
BIN
docs/source/assets/multi_node_dp_deepseek.png
Normal file
BIN
docs/source/assets/multi_node_dp_deepseek.png
Normal file
Binary file not shown.
|
After Width: | Height: | Size: 90 KiB |
BIN
docs/source/assets/multi_node_dp_kimi.png
Normal file
BIN
docs/source/assets/multi_node_dp_kimi.png
Normal file
Binary file not shown.
|
After Width: | Height: | Size: 129 KiB |
Reference in New Issue
Block a user