Files
enginex-mlu370-vllm/vllm-v0.6.2/vllm/model_executor
Chranos ebdc6fed03 fix: pass lm_head to LogitsProcessor instead of calling forward()
In vLLM v0.6.2, ParallelLMHead.forward() raises RuntimeError since
its weights should be used through LogitsProcessor.linear_method.apply().
Pass lm_head as first arg to LogitsProcessor which handles the
hidden_states -> logits projection internally.
2026-02-06 14:21:14 +08:00
..
2026-02-04 17:22:39 +08:00
2026-02-04 17:22:39 +08:00
2026-02-05 18:36:03 +08:00
2026-02-05 15:53:43 +08:00
2026-02-04 17:22:39 +08:00
2026-02-04 17:22:39 +08:00
2026-02-04 17:22:39 +08:00
2026-02-04 17:22:39 +08:00
2026-02-04 17:22:39 +08:00
2026-02-04 17:22:39 +08:00