[Doc] Support kimi-k2-w8a8 (#2162)

### What this PR does / why we need it? In fact, the kimi-k2 model is similar to the deepseek model, and we only need to make a few changes to support it. what does this pr do: 1. Add kimi-k2-w8a8 deployment doc 2. Update quantization doc 3. Upgrade torchair support list ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.10.0 - vLLM main: 9edd1db02b --------- Signed-off-by: wangli <wangli858794774@gmail.com>
2025-08-06 19:28:47 +08:00
parent 875a86cbe9
commit bf84f2dbfa
8 changed files with 194 additions and 40 deletions
--- a/docs/source/tutorials/index.md
+++ b/docs/source/tutorials/index.md
@@ -13,4 +13,5 @@ multi_npu_qwen3_moe
 multi_npu_quantization
 single_node_300i
 multi_node
+multi_node_kimi
 :::