[Lint]Style: reformat markdown files via markdownlint (#5884)

### What this PR does / why we need it? reformat markdown files via markdownlint - vLLM version: v0.13.0 - vLLM main: bde38c11df --------- Signed-off-by: root <root@LAPTOP-VQKDDVMG.localdomain> Signed-off-by: MrZ20 <2609716663@qq.com> Co-authored-by: root <root@LAPTOP-VQKDDVMG.localdomain>
2026-01-15 09:06:01 +08:00
parent 96edd4673f
commit 4811ba62e0
75 changed files with 711 additions and 308 deletions
--- a/docs/source/user_guide/feature_guide/ucm_deployment.md
+++ b/docs/source/user_guide/feature_guide/ucm_deployment.md
@@ -89,6 +89,7 @@ INFO:     Application startup complete.
 Congratulations, you have successfully started the vLLM server with UCM connector!

 ## Evaluating UCM Prefix Caching Performance
+
 After launching the vLLM server with `UCMConnector` enabled, the easiest way to observe the prefix caching effect is to run the built-in `vllm bench` CLI. Executing the following command **twice** in a separate terminal shows the improvement clearly.

 ```bash
@@ -109,32 +110,34 @@ vllm bench serve \
 ```

 ### After the first execution
+
 The `vllm bench` terminal prints the benchmark result:

-```
+```shell
 ---------------Time to First Token----------------
 Mean TTFT (ms):                           15323.87
 ```

 Inspecting the vLLM server logs reveals entries like:

-```
+```shell
 INFO ucm_connector.py:228: request_id: xxx, total_blocks_num: 125, hit hbm: 0, hit external: 0
 ```

 This indicates that for the first inference request, UCM did not hit any cached KV blocks. As a result, the full 16K-token prefill must be computed, leading to a relatively large TTFT.

 ### After the second execution
+
 Running the same benchmark again produces:

-```
+```shell
 ---------------Time to First Token----------------
 Mean TTFT (ms):                            1920.68
 ```

 The vLLM server logs now contain similar entries:

-```
+```shell
 INFO ucm_connector.py:228: request_id: xxx, total_blocks_num: 125, hit hbm: 0, hit external: 125
 ```