### What this PR does / why we need it? Using the cache load operator to replace the index select operator. - vLLM version: v0.14.1 - vLLM main: dc917cceb8 --------- Signed-off-by: liziyu <liziyu16@huawei.com>
dc917cceb8
hf_config
hf_text_config
envs
envs_ascend