Song Zhixin
|
216fc0e8e4
|
[feature] Prompt Embeddings Support for v1 Engine (#3026)
### What this PR does / why we need it?
this PR based on
[19746](https://github.com/vllm-project/vllm/issues/19746), support
Prompt Embeddings for v1 engine on NPU
### Does this PR introduce _any_ user-facing change?
### How was this patch tested?
```python
python examples/prompt_embed_inference.py
```
- vLLM version: v0.11.0
- vLLM main:
https://github.com/vllm-project/vllm/commit/releases/v0.11.1
---------
Signed-off-by: jesse <szxfml@gmail.com>
|
2025-10-30 17:15:57 +08:00 |
|